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SOME PSYCHOSOCIAL CONSEQUENCES OF A 
CHANGE TO A REPLACEMENT BIRTHRATE 2 
IN THE UNITED STATES : 


JAMES M. RICHARDS, Ja. 


The Johns Hopkins University 


Consequences of a change to a replacement birthrate and a stable population 
would include changes in the population age distribution. The present study 
employed estimates of such age distribution changes to estimate changes in 
several psychological characteristics of the United States population, including 
average abilities, personality patterns, and incidence of important life transi- 
tions. Results indicated that these age distribution changes alone would pro- 
duce no major changes in the average psychological characteristics of the 
population of the United States. Therefore, age distribution-related psycho- 
logical changes do not appear to pose any serious objection to a replacement 


birthrate in the United States. 


This study assumes the essential correctness 
of the view that human survival and the op- 
portunity for a decent life for all depend on 
finding solutions to the interrelated problems 
of stabilizing population, increasing food sup- 
ply, and living more in harmony with the 
natural environment and available resources 
(Ehrlich & Ehrlich, 1972; Hutchinson, 1969; 
NAS-NRC Committee on Resources and 
Man, 1969; Kirk, 1973; Looft, 1971; Mea- 
dows, Meadows, Randers, & Behrens, 1972). 
Because most proponents of this view regard 
continuing population growth as a genuine 
and critical emergency, and because the most 
plausible way to stabilize population is by 
first attaining a replacement birthrate,? most 
of their efforts have been devoted to under- 
standing fertility (Fawcett, 1973; Pohlman, 
1969) and to reducing the birthrate. Most 
recent trends in the United States, however, 
suggest that a birthrate at (or below) re- 
placement level is likely in the near future. 
Therefore, it seems appropriate for advocates 


1 Requests for reprints should be sent to James 
M. Richards, Jr., Center For Social Organization of 
Schools, The Johns Hopkins University, Baltimore, 
Maryland 21218. 

2A stable population could also be attained, for 
example, by having the number of births each year 
exactly equal the number of deaths (Frejka, 1968). 
This possibility seems quite improbable given the 
current age distribution of the United States be- 
cause the birthrate would be low when many 
women are in the prime child-bearing years and 
high when few women are in those years. 


of such a transition to devote some effort to 

anticipating the consequences of this change 

in order to minimize any socially destructive 

consequences. Knowledge of such conse- 
quences should contribute to the scientific 
understanding of the psychology of popula- 
tion (American Psychological Association 
Task Force, 1972). 

The present study is intended to be one 
step in such anticipation of consequences. 
Specifically, a replacement birthrate would 
produce gradual changes in the population age 
distribution. The purpose of this research was 
to estimate the changes in several psycho- 
logical characteristics of the United States 
population, including average abilities, per- 
sonality patterns, and incidence of important 
life transitions, that would occur as a conse- — 
quence of such changes in the age distribu- — 
tion. 


METHOD 


Using the 1969 age distribution as a base, age dis. 
tributions were computed at five-year intervals for 
150 years after attainment of a replacement birth- — 
rate. These age distributions were combined with 28 
estimates of the relationship between age and various 
psychological characteristics to yield estimates of 
population psychological characteristics for the same 
five-year intervals. For each characteristic, the 
general procedure was (a) to multiply the typical 
score for people in each of several age ranges by í 
the number of people in that age range in a given 
—ͤ— ; on 

Tables and figures, however, show results only 
to the point at which values stabilize. 1 


a 


T year, (b) to sum across age ranges, and (c) to di- 
vide by the total number of people in the popula- 
non in that year. These computations were severely 
handicapped by the paucity of hard data about the 
relationship between age through the entire life 
span and any psychological characteristics (Bloom, 

1964). 

By deliberate choice, the calculations involved a 
small number of assumptions or parameter esti- 
mates; that is, the calculations involved only esti- 
mated age distributions and estimated relationships 
between age and various psychological character- 
istics. The effect of such social changes as increases 
in average educational level were ignored. This 
procedure assumed that there is an inherent matura- 
tional relationship between age and these psycho- 
logical characteristics. Such an assumption appears 
essential to the purposes of this study. 

In order to compute the age distributions, birth- 
rates specific to women of the United States in 
various age groups were derived by assuming that 
these women continue to have their children at the 
same relative ages, but give birth to only enough 

female children to yield a net reproduction rate 
er 1.00. It was also assumed that the death rates 

for the entire United States population sank to the 

“level of the white population's death rates at the 
dame time a replacement birthrate was attained. 
(For ages above 70, death rates of the white popu- 
lation were estimated from overall rates.) The use 
ot the white population’s death rates for the entire 
United States population was a partial response to 
the charge that measures to stabilize population are 


James M. RICHARDS, JR. 


TABLE 1 


ESTIMATED AGE DISTRIBUTIONS AT FIVE-YEAR INTERVALS AssUMING A REPLACEMENT 
BIRTHRATE WAS ATTAINED IN 1969 


genocidal, Advocates of stabilization may react 
tionally to such maligning of their motives, bul 
is important to consider what serious arguments 
underlie the rhetoric. It is clearly true that 
rates for such groups as blacks, Chi 

American Indians are substantially higher 
death rates for whites. If death rates for 
groups stayed the same while birthrates for 
of specific ages sank to what would be a 
ment rate for whites, the consequences wot 
fact be gradual disappearance of these groups 
the population. In other words, such 
death rates provide clear cut evidence of con 
social injustice in the United States; and Í 
important to attain both stable population 
social justice. The use of death rates of the 
population provides a minimal demographic 
tion of social justice for this study. To faci 
computation these birth and death rates were 
pressed as probabilities, the age distributions 
calculated through simple probability multi 
and the effects of immigration and emigration 
not considered. Therefore the age distributions 
differ slightly from those that could be prodt 
a more complex mathematical model for demograp 
(Coale, 1972). “J 


RESULTS AND DISCUSSION 


Demographic changes. The age dis 
tions in various years after attainment of 
replacement birthrate are summariz 


Age distributions 


Males 

| 
10-19 | 20-29 | 30-39 | 40-49 | 50-59 | 60-69 | 70-up | 0-9 | 10-19 | 20-29 | 30-39 | 40-40 | 50-59 
19.9 15.0 11.1 11.8 10.0 7.0 5.2 19.0 19.2 14.9 11.4 12.5 10.9 
21,0 | 17.6 | 12.2 | 112 | 10.6 | 7.6 | 5.3 16.1 20.3 | 173 12.4 11.7 11.7 
19.7 | 196 | 14.7 | 10.8 | 10.8 | 8.0 | 5.6 | 15.7] 18.9 5% | 147 | 11.2 | 119 
120.7 17.3 | 11.8 | 10.2 | 8:5 | 61 | 17.6} 16.1 | 202 | 17.1 | 12.1 | 11.2 
16.3 | 19.4 | 19,2 | 142 | 98 | 86 | 64 18.8 15.7 | 18.8 | 19:0 | 14:5 | 10.7 
1813 16.5 20.3 | 16.7 | 10.8 | 8.2 | 6.9 18.8 17:5 | 160 | 20.0 | 168 | 11.6 
19:6 180 | 19.0 19% 151 | 7.9 | 7.0 % 18.8 | 15.6 | 18.7 | 18.6 | 13.9 
19.8 18.0 16.2 | 197 | 15.3 | 8.7 | 6.8 |17.0] 18.7 | 174 | 15.9 | 19.6 | 16.1 
115 198 15.8 | 184 | 17.0 | 10.5 | 6.7 | 17.2) 17.7 | 187 | 15.4 | 18.3 | 17.8 
178 | 132 | 177 183 18.0 | 12-3 | 71 |178 | 16.9 % 17.3 | 15.6 | 18.8 
18.5 174 18.9 | 15:3 | 168 | 13.6 | 82 % %% | 17:6 | 18:5. | 15.2 | 17-5 
18.5 | 174 | 18.9 TrA 143 | 144 | 95 | 181) 17.7 | 168 | 18.5 | 17.0 | 149 
18-9 | 126 | 479°) 183 | 140 | ß | 182 | 145 
18.7 2 17% | 183 | 15.7 | ib „ 18.0 | 17.6 | 16.7 | 181 | 16.2 
18.3 18.5 175 17.3 16.8 | 11.2 | 11.4 17% 17.6 | 180 | 16.9 | 17.1 | 17.4 
182 18 1355 16.5 | 16.7 | 12,6 | 10.0 |17.8 17.4 | 17.9 | 17.5 | 164 | 17.3 
Si 3 ER 1528 ee 16.7 | 15.8 | 13.4 NT ATO f 17:5- | ATS 17.9 106| 164 
ah 2 18.5 | 17. A} 17.3 | 15.1 | 13.4 | 10.4 17.8 | 17.7 | 17.3 | 17.7 | 17.2 | 15.7 
de 18. X 0 | 17.7 | 17.7 | 15.3 | 12.6 | 10.7 |17.7 | 17.8 | 17.4 | 17.4 | 17.6 | 15.8 
r A 8.4] 18.5 | 18.2 | 17.5 | 17.5 | 15.8 | 12.1 | 10.9 |176| 17:8 | 17.6 | 171 | 174 | 164 
206 18.5 18.3 | 18.3 | 17.6.) 17.2 | 16.2 | 12.2 | 10.6 17.) 17.6 | 17.7 | 17.3 | 17.0 | 16.8 
23374 | 18.6} 18.3 | 18.2 | 17.9 | 17.0 | 16.1 | 12.7 | 10.2 178 17.6 | 17:7 | 17.5 | 16.8 | 16.6 
2079 | 18.5} 18.4 | 18.1 | 18.0 | 17.1 | 15.7 | 12.9 | 10.2 178 17.6 | 17/5 | 17.6 | 16.9 | 16.3 
18.5 18.4 | 18.0 | 17.9 | 17.3 | 15.5 | 12.8 | 10.4 | 17.7 | 17:7 | 17/5 | 17.5 | 17.2 | 164 


sak ole. Computations were carried out through the year 2119, 


bilize. Units = one million persons, 


but results are shown only to the point at which values apf 


Table 1. The trends shown in Table 1 appear 
to be consistent with other estimates of a 
gradual transition to a stable population in 
the United States (Ehrlich & Ehrlich, 1972). 
The total population continued to increase 
for some 60 to 65 years after attainment of a 
replacement birthrate, with a final population 
about 30% larger than the base population. 
The proportion of children in the population 
decreased; the proportion of aged in the pop- 
ulation increased; and a series of waves 
or peaks moved through the age distribu- 
tion (Frejka, 1968). The first wave repre- 
Sented persons born in the baby-boom after 
World War II: the second wave represented 
their children, etc. As a consequence of these 
waves, the age distribution would not reach 
a steady state for a number of years after the 
population size stabilized. 

Some other demographic consequences of a 
replacement birthrate are presented in Fig- 
ure 1. Data points in this figure represented 
ratios of the number of people in a given 
category in a given year to the number of 
people in the same category in 1969. The 
birthrate stabilized in some 70 to 75 years, 
after considerable fluctuation because of vari- 
ation in the number of women in the child- 
bearing years. The relative increase in popu- 
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Fiover 1. Estimated demographic changes at five-year intervals assuming 
that a replacement birthrate was attained in 1969. 
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lation size was greater than the relative in- 
crease in births, largely because the use of 
the death rates of the white population for 
the death rates of the entire population of — 
the United States produced an increase in 
overall life expectancy. For the same reason, 
the relative increase in the population of 
blacks, or Chicanos, or American Indians, 
etc., was considerably greater than the rela- 
tive increase in the total population. These 
results reduce the plausibility of the charge 
of “genocide,” so long as the search for 
population stability was combined with a 
search for social justice. (However, rela- 
tively little change in the proportion of 
blacks in the total population occurs.) F 
nally, the labor force (i.e., persons in the age 
range 20-65) grew both more and faster in 
size than the total population. 3 
Changes in mental abilities. Estimates b 
Schaie and Strother (1968) and Schal 
(1965, 1970) of the longitudinal relationship l 
between age and T scores on Thurstone’s — 
(1958) SRA Primary Mental Abilities Tests 
were used for the present study. Changes 
between the ages of 21 and 70 were esti- 
mated from results of Schaie and Strother's 
youngest cohort. Changes after age 70 wei 
estimated by assuming that the percentage 


TABLE 2 


ESTIMATED RELATIONSHIP BETWEEN AGE 
AND INTELLECTUAL ABILITIES 


Intellectual ability 

Age = 7 4 5 
Verbal | Reason- Number | Word 

meaning| ing fluency 

| 

0-4 11.5 12.5 10.8 11.5 
5-9 23.8 25.7 22.4 23.7 
10-14 3722 40.2 34.9 37.1 
15-19 46.5 50.3 43.7 | 464 
20-24 49.9 54.4 45.9 50.5 
25-29 51.9 5525 49.8 50.8 
30-34 52.0 56.1 53.2 49.7 
35-39 51.1 56.0 55.1 46.0 
40-44 50.8 87 57.3 45.4 
45-49 49,7 53.4 58.0 44.3 
50-54 48.2 51.4 57.3 40.4 
55-59 46.5 49.3 553 35.7 
60-64 45.6 46.9 52.3 30.9 
65-69 44.7 45.7 SET 27.7 
70-74 43.8 44.5 50.9 24.8 
75-79 42.9 43.4 50.1 22.3 
80-84 42.1 42.3 49.4 20.0 
85-89 41.3 41.2 48.6 17.9 
90-94 40.4 40,2 47.9 16.0 
95-99 39.6 39.1 47.1 14.4 
100-104 38.9 38.1 46.4 12.9 


Note. Entries are based on T scores with X = 50 and SD = 10. 


change between adjacent age intervals are 
the same as the percentage of change between 
Schaie and Strother’s last two age intervals. 
Changes prior to age 21 were computed on 
the assumption that Bayley’s (1955) chart 
of intellectual development adequately de- 
scribes the development of all traits“ The 
final relationships between age and abilities 
produced by these assumptions are summa- 
rized in Table 2. (The Space test is omitted 
because Schaie and Strother’s estimates for 
this test appeared unstable.) These relation- 
ships conform to current psychological no- 
tions about the relationship between age and 
mental ability in that childhood and adoles- 
cence are characterized by steady growth; 
ability reaches a peak during early maturity; 
there is a relatively slow decline during the 
middle years, and a more rapid decline in old 


4Thurstone (1955), as cited by Bloom (1964), 
Proposes a slightly different course of early develop- 
ment for these abilities but not in a form con- 
_ venient for deriving estimates like those used in this 


study. 
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age. The trends shown in Table 


) appear 
generally consistent with Sc! (197% 
latest estimates of maturation nges in 
those abilities. 

The figures shown in Tables 2 Were 
combined to yield estimated a ability 
scores in various years for the popula- 
tion and for the labor force, 7: 3 sum- 
marizes this analysis. It is apparen it only 
minimal changes in average abi would 
occur solely from a change to a cement 
birthrate, and therefore no s n that 


any special problems for society 
from changes in average ability. 
evidence, for example, that the la 
tion of older people would prod 
able decline in average ability. 
An obvious question is the eff 
differences on these conclusio 


id result 
e was no 
* propor- 
a notice- 


of cohort 
Schaie’s | 


(1970) latest analyses showed in gen- 
eral, older cohorts have lowe: ability 
scores with some variation a onorts de 
cohort differences were take unt, 
one would expect slightly lower lation 
averages in the early years after ¿i inment 
of a replacement birthrate, but no i portant 


deviations from the pattern shown in Table 3. 
One would also expect values to converge 
quickly to the values shown in Tabte 3 inas- 
much as persons in the same age ranges as 
Schaie and Strother’s cohorts represent a 
rapidly decreasing proportion of the total 
population. (The mean year of birth for 
Schaie and Strother’s youngest cohort was 
1940. People in the same age ranges as their 
cohorts represented only about 55% of the 
population in 1969 and by the year 2009 will 
represent only about 10% of the population.) 
These expectations were confirmed by com- 
puting population averages on verbal mean- 
ing, taking cohort differences into account. It 
appears, therefore, that Table 3 provides a 
reasonable estimate of the combined effects 
of changes in the age distribution and matt- 
rational changes in these mental abilities. 
Changes in personality patterns. One of 
the few studies relating age to personality, 
which is relevant to this study, appeared to be 
Bendig’s (1960) investigation (based on mer ; 
in Pittsburgh) of age differences in the Guil- 
ford-Zimmerman Temperament Survey. AL 


PSYCHOSOCIAL CONSEQUENCES 


TABLE 3 


ESTIMATED MEAN INTELLECTUAL Auttatv 


Intellectual ability scores 


Population 


~~ aos 


or A REPLACEMENT BIRTHRATE 


Scores ror POPULATION 1N Various YEARS 


Labor force 


Year 
: s j Í A 
Verbal Reasoning | Number Word Verbal | Reasoning | Number Word ` 
| meaning | fluency meaning | fluency 
1000 41.6 4 A 37.0 49.8 53.5 53.4 47 ae 
1974 42.6 45.6 | HS 38.0 49.8 53.6 53.2 HO 
1979 43.0 460 | 449 38.3 49.9 53.7 53.0 45.1 
1984 129 460 | 440 38.1 50.0 53.8 80 | 45.3 
1989. | 42.8 45.8 49 379 50.1 453 
1994 f 429. |, 45.9 45.3 37.9 50.1 45.1 8 
1999 | %% 10 45% 38.1 49.9 44.8 - 
2004 43.4 46.5 46.1 38.2 49.8 H4 ; 
2009 434 4088 | 462 38.1 49.7 442 
2014 43.3 46.3 461 37.7 49.6 44.0 
2019 43.2 461 | 460 374 49.6 | 9 A 
2024 43.2 | 461 | 461 133222 49.7 44.0 N 
2029 437 1462 402 372 49.8 43 i 
2034 43.3 40.2 463 373 49.8 44.4 
2039 33 462 46.2 ĩ 373 49.7 | 442 
2044 433 4456.2 46.2 37.3 | 49.7 | 44.0 3 
2049 Hosea 402, | AOI wanes}. 49.7 44.0 
2054 43.3 46.2 46.2 372, 409.7 4.1 
| E ae 
r 
though obvious questions arise about the ex- per 100,000 people) in various years after i 
tent to which these results were representa- attainment of a replacement birthrate. Table 
tive of the total population of men of the 5 summarizes the results. There was some in- 
United States, some data are better than none, crease in incidence of all three types of psy- — 
and accordingly, age group means obtained by chosis, as well as some differential change in 
Bendig (on the five scales showing signifi- S 
cant age differences) were combined with TABLE 4 - 
the age distributions shown in Table 1 to den A = J 
x : . „ STI MEAN GUILFORD-ZIMMERMAN SCORES 
y ield estimated population he) at 15-year For Mares Ace 20-59 N Vartovs YEARS 
intervals. Results are summarized in Table 4 — —ñññů 
As was apparent from Bendig’s means, again | Males 
there was virtually no change in average jee cs. — OR 
scores for the population as a consequence of m | 12 
a replacement birthrate. In other words, there r eo Se Geese eee 
pas no evidence that the typical personality straint | ance | bility | linity | rela- 
É pattern for the adult male population would | tions 
change very muchii. 5 1969 | 19.98 | 18.41 | 22.42 | 20.04 | 2242 
Another way to estimate changes in per- i984 | 19.89 | 18.58 | 22.50 | 2013 | 2245 
sonality patterns is through changes in the 1000 | 20.08 | 18.32 | 22.35 | 19.97 | 22.52 
incidence of various psychiatric disorders. 2014 | 20.04 | 1830 22.30 19.97 | 22.49 
Accordingly, Jaco’s (1960) estimates (based 9925 2882 18 1 a 1 5 22 0 
on Texans of both sexes combined) of the 2050 | 39.96 | 1831 2233 | 19.97 22449 
age specific incidence of functional, organic, 


and old age psychoses were used to estimate 
incidence in the total population (i.e., cases 


Note, Entries are estimated raw score means, Only scales 
showing significant age differences in Bendig's (1960) study are 
included. 


TABLE 5 
ESTIMATED TRENDS IN INCIDENCE OF FUNCTIONAL, 
ORGANIC, AND OLD AGE PSYCHOSES 


Incidence per 100,000 persons 
Year . 1 — 
Functional Organic Old age 
1969 49.2 9.3 14.7 
č 1974 51.4 9.7 14.9 
1979 53.3 9.9 15.3 
} 1984 54.5 10.1 ETIA 
1989 54.8 10.2 16.1 
1994 55.0 10.4 16.4 
1999 55.4 10.6 16.2 
* 2004 56.3 10.9 16.1 
2 2009 56.9 11.3 16.6 
Er 2014 56.7 11.4 17.9 
ay 2019 56.1 11.3 19.3 
70 2024 55.3 11.2 21.0 
yE 2029 54.6 11.2 22.4 
a 2034 54.8 11.2 22.3 
2039 55.4 11.3 21.2 
2044 55.7 11.3 20.6 
2049 88.5 11.3 20.8 
2054 55.2 11.2 | 214 
| | “i, 


chat the percentage of increase was somewhat 
greater for old age psychoses than for the 
other psychoses. Nevertheless, the overall 
pattern varied very little as a consequence of 
the changes in the age distributions. 

Thus, two different procedures for examin- 
= ing population personality change yielded 
results that were quite consistent with each 
other and with the results for intellectual 
ability presented earlier. A change to a re- 
placement birthrate should produce no strik- 
ing changes in population personality char- 
acteristics that would pose serious problems 
for society. 

Changes in life transitions. In recent years, 
largely through Erickson’s (1950) influence, 
psychological changes through the life span 
have increasingly been identified with im- 
portant life transitions. A corollary of this 
approach is that the overall psychological 
characteristics of a society could be deter- 
mined, in part, by the relative incidence of 
various life transitions in that society. Ac- 
_ cordingly, the last analysis involved changes 
in the incidence of three important life tran- 
sitions to which it appeared reasonable to 
assign a specific age. These were: (a) enter- 


James M. RicuHarps, JR. 


school and leave home each year 


pE 


ing school (age = 6), (6) l 
rental home (age = 18), an 
(age = 65). Changes in the incid of 
transitions for both sexes comt 
marized in Figure 2. 

Results in Figure 2 show (a 
of the waves or peaks that pass igh 
age distribution, (b) an overall 
the incidence of entering school ê ar 
home (the absolute number of pet 
through these transitions also d 
crease), and (c) an overall inc: in 
incidence of retiring. Similar tr 
pected for life transitions that 
lated but that cannot reasonab!; 
to specific ages. For example 
be a relative decrease in the j 
first marriages and à relative incr 
sons facing the death of a spouse. 

The overall effect of the tren 
Figure 2 was to bring the curves 
gether but not to change the ov: 
drastically. More individuals still 


that would retire, even after populat 


relative incidence of life transition: are i 
sufficient to produce drastic changes i 
overall psychological characteri 
United States society. Therefore, 
appears that no major social disioc 
would occur in the United States s 
consequence of a change to a rep 
birthrate. Indeed, some problems 
eliminated, in that society is no long 
with wide fluctuation over relatively brie 
periods in such transitions as entering school 


CONCLUSIONS 


replicati 
th better data about the relationship of 


While more comprehensive 


5 Birth data in Figure 1 involve total births ove 
five-year intervals, while entering school data ™ 
Figure 2 involve incidence in a single year that } 
not at the midpoint of its age interval. Any apparel 
discrepancies between these figures can be attribute 
to these procedural differences. 

et should be emphasized that for about tai 
next fifty years the fluctuations shown in Figure 
will occur whether or not a change to a replaceme” 
birthrate occurs. 
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Estimated incidence at five-year intervals of three important life 


transitions. 


to psychological characteristics are needed, 
the results of this study suggest that changes 
in such characteristics do not appear to pose 
any serious objection to a replacement birth- 
rate in the United States. This is not to say 
that possible psychosocial consequences can 
be entirely ignored in striving to attain and 
maintain a replacement birthrate. A stable 
population, especially if combined with a 
steady-state economy (Daly, 1971), could 
produce major changes in the structure of so- 
ciety, and these changes could involve great 
psychological changes. Specifically, a stable 
population and economy probably involve 
much more stability in the overall society. 
This might produce a much less positive atti- 
tude toward change and toward achievement. 
Other consequences might be a much less 
youth-oriented society, with much more em- 
phasis on age and experience as positive val- 
ues. All of these changes could produce large 
psychosocial changes in the United States 
population. These changes, however, would 
result from a different way of organizing so- 
ciety rather than being a necessary conse- 
quence of the change to a replacement birth- 
rate itself. Therefore, the possibility of such 


changes also does not appear to pose a serious 
objection to a replacement birthrate. 

Since population problems, both practical 
and scientific, confront all of mankind, not 
just the population of the United States, 
similar investigations in other nations could 
be helpful, especially in nations whose popu- 
lation age distributions are quite different 
from the United States distribution. 
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INFLUENCE OF SEX ROLE STEREOTYPES ON 
PERSONNEL DECISIONS 


BENSON ROSEN axo THOMAS H. JERDEE 
Graduate School of Business Administration, University oj North Carolina 


An in-basket exercise was wied to investigate the influence of sex role stereo 
types on the personne! decisions of 95 bank supervisors, The design consisted of 
four separate experiments (In-basket items) in which an employee's sex and 
other situational attributes were manipulated. Results confirmed the hypothesis 
that male administrators tend to discriminate against female employees in 
personnel decisions involving promotion, development, and supervision, Dis- 
crimination against male employees was found in personnel decisions involving 
competing role demands stemming from family circumstances. Features of the 
decision-making process which tend to heighten the influence of sex role 
stereotypes on personnel decisions are discussed 


It is frequently alleged that male adminis- 
trators view females as equipped to do the 
organizational housekeeping but as deficient 
in the toughness, stability, judgment, and 
dedication required for success in managerial 
and other traditionally male roles. To protect 
both the organization and the “vulnerable” 
female employee, male administrators alleg- 
edly resort to a pattern of exclusion in selec- 
tion, promotion, and development which bars 
women from the more challenging organiza- 
tional roles or places them at a disadvantage 
when they do achieve these roles. 

In some ways, however, the commonly al- 
leged sex role stereotypes may tend to work 
in favor of females and against males. Since 
men are expected to pursue their careers with 
single-minded dedication, any intrusion of 
family or other personal considerations may 
be viewed more unfavorably for them than 
for women. In cases involving time off or 
other similar indulgences for family and per- 
sonal reasons, management may deal more 
harshly with males than with females, and 
young male managerial aspirants may be 
reluctant to make such requests for fear of 
jeopardizing their careers. 

In spite of all the allegations and circum- 


1 Requests for reprints should be sent to Benson 
Rosen, Graduate School of Business Administration, 
University of North Carolina, Chapel Hill, North 
Carolina 27514. 
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stantial evidence, the actual behavioral effects ` 
of sex role stereotyping have only recently 
been subject to systematic investigation 
(see, for example, studies by Goldberg, 1968 
Shaw, 1972; Rosen & Jerdee, 1973), While 
some studies have supported the general no- A 
tion of the biasing effects of sex role stereo- 
types, they cover a rather narrow range of 
discriminatory behavior and they do not deal 
at all with those situations in which sex role 
stereotypes may work to the disadvantage of 
males. Accordingly, it is desirable to direct 
research at both types of effects, those forms 
of exclusion and favoritism favoring males 
and those forms favoring females. On the 
basis of the commonly alleged stereotypes for 
males and females, we hypothesized (a) that 
male administrators would tend to discrimi- — 
nate against females in important decisions 
involving promotion, development, and super- 
vision and (5) that they would tend to dis- 
criminate against males in decisions involving 
competing role demands stemming from fam- 
ily or other personal circumstances. These 
hypotheses were tested in an experiment in- 
volving a group of male managers from the 
banking industry. 


aa 
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METHOD 
Subjects 


Ciir ae a 


The subjects were 95 male bank supervisors at- 
tending a management institute at the University of 
North Carolina during the summer of 1972. The 
institute was open to both sexes, but since only 18 
women attended, it was necessary to restrict the X 
study to males. 8 
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male. In the other version, the choi between 

ere an older, unpromotable male and a y r, highly 

Experimental materials were embedded in an in- qualified female. We expected that Č le would 
basket exercise? Subjects were asked to assume the be chosen more frequently than the f 
role of the personnel director of a multibranch bank 3. Solution of a supervisory prot Sex of 
and to react to a series of items in memorandum Supervisor X Sex of Subordinate X N of the 
or letter form. Four of these in-basket items covered problem., This item consisted of a report 
different types of personnel problems and were from a bank supervisor describing a lem with 
written in two or more versions so as to manipulate one of his (her) subordinates and r ting the 
the sex of one or more of the characters and certain termination or at least the transfer oi subordi- 
other attributes of the characters and the situation. nate. We expected that the subjects we be more 

For each item, subjects indicated on fixed-response inclined to accept the supervisor’s judy ent and 
scales their decisions and the extent to which they proceed with the termination or transfer when the 
would find certain reactions to the case acceptable. supervisor was a male than when the su; sor was 
Each subject received only one version of each of a female. We also sought to examine | the 88 
the experimental items and was unaware that other of the subordinate and the nature of problima 
versions existed. From the subjects’ perspective, all (poor performance vs. offensive perso would 
were participating in an identical in-basket exercise affect the subjects’ willingness to termi r trans- 
dealing with personnel problems in the banking in- fer the subordinates. Accordingly, eig sions of 
dustry. The sex variable was completely unobtrusive. this case were used, forming a 2 X2 actorial 
Since each item dealt with a different type of design. a 
problem and involved different dependent variables ‘The major dependent varla or item wer 
(managerial decisions), the design can best be con- ist HN 


ceptualized as a series of four separate experiments 
with different experimental stimuli and different 
dependent variables in cach and with completely 
random assignment of treatments to subjects within 
each experiment. 

Three of the in-basket items involved administra- 
tive decisions in which sex role stereotypes were 
hypothesized to work to the disadvantage of females. 
These items dealt with promotion, development, and 
leadership style. These three items and the experi- 
mental variables of interest were as follows: 


1. Promotion to branch manager: Sex of candi- 
date and complexity of job. This item was in the 
form of a memorandum requesting a decision on the 
promotion of an employee to the position of branch 
manager. The memorandum was written in four ver- 
sions so as to manipulate the variables of sex of 
the candidate and the nature of the manager's job, 
which was described as either complex or routine. 
We expected a greater tendency to promote the 
male, especially when the job was more complex. 

The major dependent variables were (a) a recom- 
mendation either to promote the candidate or to hold 
the candidate’s file and interview additional candi- 
dates, (b) a rating of the candidate on potential for 
customer relations, and (c) a rating of the candidate 
on potential for employee relations. Subjects were 
asked to make their evaluations on 6-point scales 
ranging from extremely unfavorable (1) to extremely 
favorable (6), 

2. Development: Choice of a male or female candi- 
date to atiend a conference. This item was in the 
form of a memorandum asking the personnel director 
to decide which of two persons on his staff should 
attend a personnel conference, In one version of the 
memorandum the choice was between an older, un- 
promotable female and a younger, highly qualified 


2 See Fredricksen (4068) for an example of the use 
of the in-basket technique as a research task. 
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subjects’ ratings of the following ad 
actions: (a) arrange for a termination c 
ordinate; (b) arrange for transfer of 


nate; (c) suggest that the supervisor t gain to 
solve the problem; or (d) arrange for interview 
with both the supervisor and the subo E. 

4. Approval of request for a leave of «ence: Sex 
of employee. The final item involved an ministra- 
tive decision in which sex role stereo s were 
hypothesized to lead to more indulgent >- deferent 
behavior toward females than toward les. This 
item was in the form of a request from a» account- 
ant for time off to take care of children, In Ver- 
sion A, the request was made by a female sccountant — 


who claimed that it was impossible for he: husband 
to assume these responsibilities. Version 
the identical situation except that the acc 
was a male whose wife’s career obliga 
vented her from assuming the child care 
bilities. We expected that subjects would 
to grant the leave to the female account 
it would be more in keeping with the stereo 
type regarding the primacy of a woman’s family 
obligations. 

Subjects were asked to rate the appropriatentss 
of the request and the desirability of the following 
administrative actions: (a) give the employee 4 
leave of absence without salary; (b) do not allow 
the employee to take a leave of absence; and (0) 
allow the employee to take a leave of a 

l 
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with salary. 


Our first major hypothesis stated that male 
administrators would tend to discriminate 
against females in important decisions involv- 
ing promotion, development, and Supervisor) 
practice. Results for the in-basket items rele- 
vant to this hypothesis will now be considered: 


RESULTS | 


INFLUENCE or Sex ROLE Stexvotyres on PERSONNEL Decisions 


TABLE 1 


ADMINISTRATIVE Actions rok Fermate ano Mace CANDIDATES 


Proyonox Case 


4 oo T O 


| Female candidate | Male candidate 
lministrative acti | r 
a Simple pb | Complex job | | Both Simple job | Complex job | Both 
4 (n = 24) | (ms = 25) iota (n = 24) (n = 20) | jobs 
—.— : — — — — best. 
| 1 5 | 19. aot 2 i | 32 
Hold tiles | 10 20 30 f 3 9 |12 
f ial for customer relations“ 4.52 | 4.01 4.25 4.95 451 | 
39 | 439 419| 4.75 466 | 


for employee relations” | 


nber of subjects, 
: rating. 


motion to Branch Manager 
jects’ promotion decisions and mean 


of the candidates are shown in 
1 l. From Table 1 it can be seen that 
s ts were more willing to promote a male 
caudate than a female candidate. The male 


late was recommended for promotion by 

32 cut of 44 subjects, while the female candi- 
date was recommended for promotion by only 
19 out of 49 subjects (x? 6.53, dj =1, 
pb < 05). As might be expected, subjects were 
more willing to promote a candidate to a 
simple job than to a complex job (x? = 13.09, 
dj = 1, p < .01). The interaction of job com- 
plexity by sex of candidate was not signifi- 
cant. It appears that subjects discriminated 
against women in their promotion recom- 
mendations and that the bias against pro- 
moting women to managerial positions held 
across the two conditions of job complexity. 
Table 1 also shows mean ratings of each 
candidate’s potential in the experimental con- 
ditions. Ratings of potential for customer 
relations were influenced by both sex and job 
type. Mean rating for the male candidate was 
4.73 compared to a mean rating of 4.25 for 
ale candidate (F = 4.76, df = 1/80, 

35). The mean rating for candidates for 

the simple job was 4.72 compared to a mean 
of 4.24 for the complex job (F = 4.93, 


. dj = 1/80, p < .05). The sex of the candi- 


date by job complexity interaction was not 
significant. 

Finally, Table 1 shows the mean rating of 
the candidate’s potential for employee rela- 
tions in each of the experimental conditions. 


The sex of the candidate variable was again 


significant. The mean rating on potential for 
employee relations for the male candidate 
was 4.70 compared to 4.19 for the female 
candidate (F = 4.57, df= 1/80, p<.05). 
No significant effects were obtained for the 
job complexity variable or for the complex- 
ity by sex or candidate interaction. These 
results clearly demonstrate discrimination 
against females in promotion decisions. 


Choosing an Employee to Attend a 
Professional Training Conference 


Table 2 shows subjects’ choices of an 
employee to participate in the professional 
training conference in the two experimental 
conditions. ay 

When the younger, promotable employee | 
was male, 76% of the subjects selected him 
in preference to the older unpromotable fe- — 
male. On the other hand, when the younger, 
promotable employee was female, only 569% 
selected her in preference to the older, un- 73 


is strongly preferred to à female employee 
with less potential; however, a highly pro- 
motable female is preferred only slightly more 
frequently than an unpromotable male. N 


support to the hypothesis that women are a 
victims of discrimination in important per- 3 
sonnel decisions involving career progress. 


Solution of a Supervisory Problem 


Table 3 shows the mean rating for each of 
the administrative actions under each of the 4 
eight experimental conditions. Ratings of 


TABLE 2 nation of the subordinate w ed 
SELECTION oF EMPLOYEE TO ATTEND CONFERENCE (X = 2.9) and transfer high | 2). Om 
— 5 = speculative interpretation of th: dings 
Male | Female that subjects had greater conñ in th 
nat = ability of a male supervisor th fem 
No. subjects | Proportion | Jo, aubjects | Proportion Supervisor to appraise the serio s$ of 
of row total | of row total performance problem. Accordin; ubjeci 
= would be more willing to acce; > ma 
Version A: Qualified, younger male versus supervisor’s recommendation for te: ination 
a {. Siipomotable, older female For personality problems, the d eren 
37 5 F effects of sex are not as clear. en the 
37 16 12 | 25 8 f — 
ie A 1 requesting supervisor was male, nation 
Version B: Qualified younger female versus of the subordinate was rated l. = 28 
unpromotable, older male and transfer high (X = 41); y he 
115 —— questing supervisor was female inatio 
20 44 25 and transfer rated equally (& ). 
wits = differential effects of sex here jiff 
Total do interpret. 
l a7 — The results for the three in-b: ite 
57 0¹ 37 -39 reported thus far strongly suppor: ne hy- 
= — pothesis that sex role stereotypes le. to d 
oes ‘crimination in personnel decisions volvi 
termination and of transfer are of particular promotion, development, and si Vision, 
interest and will be considered together since We now turn to results relevant to second 
_ these are mutually exclusive actions with ypothesis, Here we predicted that s who 
negatively correlated ratings. deviated from the sex role stereot: s tha 
Main effects were not significant but there depicted men as dedicated to ih - work 
were significant interaction effects (Sex of above family and personal cons‘: ration 


2 


; 


Supervisor X Nature of Problem) for both 
termination (F = 5.30, df = 1/69, p < .05) 
and transfer (F = 4.75, df = 1/69, p < .05). 
For performance problems, the differential 
effects of the supervisor’s sex are quite clear, 
When the requesting supervisor was male, 
termination of the subordinate was rated high 
(X = 4.3) and transfer low (X = 2.9); when 
the requesting supervisor was female, termi- 


TABLE 3 


SUPERVISOR-SUBORDINATE CONFLICT: MEAN RATING OF ADMINIsTRATIVE ACTIONS 
FOR EACH EXPERIMENTAL 


Benson ROSEN AND THomas H. Jerper 


would be dealt with more harshly 
males in similar situations. The 
basket item is concerned with this hypo 


Approving a Leave of Absence 


Mean ratings for the leave request a 
alternative administrative actions are sho 
in Table 4. Subjects rated the request for 


CONDITION 


Male supervisor Female supervisor 
Mehagerial action Performance Personality a Performance Personality Row X 
2 Male | Female | Male | Female | Male | Female | Male | Female 
© ( = 8) | @ =6) | =11)! ( =7) | @ =10)| ( 10) ( = 9) | G@ = 8) 
Arrange a termination 4.25 4.42 3.38 2.62 3.00 3.00 400 | 3.10 3.42 
Arrange a transfer 3.00 2.85 3.45 4.90 4. 4. 2 4. 3.74 
Suggest 1 1 try ete ee ay oe 
to resolve problem 2.00 2.16 2.45 2.00 2.110 
Arrange for interview with Hay rink nee N 
both parties 2 5.40 4.42 4.47 4.62 5.18 5.00 4.41 5.60 4.87 


* 


INFLUENCE or Sex Run Stexnoryres on Parsoxnet Decisions — 


leave to care for small children as signifi- 
cantly less appropriate when it came from a 
male accountant than when it came from a 
emale accountant. In addition, subjects rated 
it significantly more appropriate for a female 
than a male to take a leave of absence without 
salary. Here we have a situation where the 
male sex role stereotype of complete dedica- 
tion to the job appears to be influencing sub- 
jects’ ratings, Subjects clearly perceive it more 
inappropriate for males than for females to 
take time off because of family obligations. 


Discussion 


Our findings support current allegations 
regarding discrimination against women in 
a variety of organizational situations. The 
pervasiveness of the discrimination was dem- 
onstrated by the finding of bias against 
women in personnel decisions related to 
promotion, development, and supervision. 

In view of the unobtrusiveness of the ma- 
nipulations in the experiment (subjects were 
unaware of other versions of each in-basket 
item), the results demonstrating the effect of 
sex role stereotypes seem impressive. Subjects’ 
responses in the simulated decision-making 
situation provided by the in-basket exercise 
were similar in a very important way to com- 
parable administrative decisions in real life; 
that is, each administrative decision was made 
for only one person (sometimes a male and 
sometimes a female). It might be reasonable 
to assume that when managers are confronted 
with both a male and a female employee for 
either a new position, promotion, advanced 
training, or supervisory assistance, they will 
be more likely to act in a fair and impartial 
way. In such instances, the potential for dis- 
crimination is quite salient and subjects might 
tend to guard against favoritism. On the other 
hand, in many organizational settings such 
administrative actions occur as single events, 
as was the case in our experimental situation. 
In these situations, where the possibility of 
discrimination is not an obvious factor in the 
decision process, the influence of sex role 
stereotypes seems to have a definite impact. 

Our data reveal a second feature of the 
decision situation which tends to foster biased 
decisions. Indications of discrimination were 


r 
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TABLE 4 


Kequtsr ror a Leave or Ansence: Mean Raru 
or AFPROPRIATENESS oF REQUEST AWD 
ADMINISTRATIVE Acpoxs 


| Male Female 
túis account- | account. 
ant | ant 


(n = 43) | ose) 


Appropriateness of 


request 1.95 2.64 
Give leave without i 

salary |. 288 | 406 
Do not allow account- | 

ant to take leave | l 

of absence | . 344 
Give leave with 

salary | 10 | 143 


found with regard to administrative actions 
that were not unanimously perceived as 
clearly appropriate or clearly inappropriate. 
For example, discrimination was quite stron; 
in the case where subjects were asked to judge 
the “potential for customer and employe 
relations” of a candidate for promotion to 
branch bank manager and also in the case 
where subjects were asked to determine the 
desirability of granting a leave of absence — 
without pay to a bank accountant. In situa- _ 
tions where the desirability or appropriateness — 
of a particular administrative action is am- 
biguous and only partial information is avail- | 
able to the decision maker, subjects appear 
to fall back on preconceived attitudes (sex — 
role stereotypes) to arrive at their ultimate 
decisions. In a sense, subjects may be pro- 
viding more information for themselves by 
filling in either their own prejudices or some 
widely held societal expectations, 4 
Our findings also demonstrate what may 
become an increasingly serious role conflict S 
for male employees: the conflict dense 
career and family responsibilities. While most 
of the discussion of sex role stereotypes 8 
centered on female role conflicts between 
career and family, it is likely that male 
employees may soon experience à similar 
dilemma. In spite of the obstacles depicted 
in the Results section, more women are 
moving into managerial and professional 
careers. Accordingly, it seems reasonable to 


assume that the husbands of career women 
will be expected to share many of the family 
responsibilities which previously had been 
considered the female’s province. This means 
that changes in attitudes may be necessary 
with regard to the appropriate roles for both 
males and females in professional and private 
life. 

One way to broaden managerial attitudes 
regarding male and female roles is to demon- 
strate to managers the restrictive effects of 
current stereotypes. The in-basket as used in 
this study offers some promise for this pur- 
pose. By collecting data early in the manage- 
ment training program, we were able to feed 
a our results to participants in a later ses- 
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tize to our subjects the many subtle way 
which their own decisions were influences 
sex role stereotypes. 
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The work of Likert and his associates at 
the Institute for Social Research represents a 
major contribution to modern organization 
theory. Likert's two books (1961, 1967) sum- 
marize much of the work of the institute and 
present his own theory of management. The 
theory postulates four systems of manage- 
ment: (a) authoritative, (b) benevolent au- 
thoritative, (c) consultative, and (d] partici- 
pative group. The key organizational char- 
acteristics of the different systems are de- 
fined in terms of leadership, motivation, com- 
munication, decision making, goal setting, and 
control processes. System 4 (participative 

group) is seen as doing the best job of utiliz- 
ing the organization’s human assets, thereby 
producing greater employee motivation and 
more favorable attitudes as well as more 
effective organizational performance. A con- 
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Requests for reprints should be sent to George 
F. Farris, Sloan School of Management, Massachu- 
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THE LIKERT ORGANIZATIONAL PROFILE: 


METHODOLOGICAL ANALYSIS AND TEST OF 
SYSTEM 4 THEORY IN BRAZIL ' 


A 20-item Likert Organizational Profile was administered twice to 256 em- 
ployees in 13 Braten development banks. Actual and ideal bank profiles were 
similar to those found in the United States and ehewhere: employees want 
participative group management methods but say that their organization uses 
autocratic or consultative methods, Factor analyses did not yield the siz dimen- 
sions predicted by LAkert theory; factors were only partially consistent over 
time and for different hierarchical levels. Retrospective scores were quite 
accurate and were equally so for the 6-, 
the original administration. Test-retest reliability of the Likert Organizational 
Profile as a whole was moderate, Bank Likert Organizational Profile scores 
were unrelated to objective measures of organizational effectiveness, but were 
positively related to employee satisfaction. The Likert Organizational Profile 
appears useful for current and retrospective organization studies, but the 
theory of management systems measured by it was only partially supported. 
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12, and 18-month time periods since 


been carried out on the theory (eg., 
1961, 1967; Marrow, Bowers, & Seas 
1967); however, most of the research 
done in organizations in the United States. 
More recently, the theory has been tested 
in Yugoslavia and Japan using various ver- 
sions of a “profile of organizational character- 
istics” (Kavčič, Rus, & Tannenbaum, 1971 
Likert, 1969). The profile is a questionnaire 
that describes an organization on the dimen- 
sions of Likert's theory and has come to be 
known as the Likert Organizational Profile. 
The Likert Organizational Profile has also 
been used in the United States, frequently in . 
studies of change (eig., Blumberg & Wiener, 
1971; Golembiewski & Carrigan, 1970, 
1970b; Marrow et al., 1967). a 
Despite this use of the Likert Organiza- 
tional Profile in organization studies, it has 
not been subject to much analysis as a mea- 
suring instrument. Little evidence has been 
presented as to the Likert Organizational Pi 
file’s reliability or factor structure. The pur- 
pose of the present paper is to present 
methodological analysis of the Likert Org 
zational Profile and to examine the suitabilit 
of using the Likert Organizational Profile to 
study organizations in a country outside the 
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United States—Brazil in the present case. We 
shall also test support for Likert’s theory in 
Brazil. 

To meet these objectives we shall (a) pre- 
sent profiles of Brazilian organizations as 
seen by their members in terms of Likert’s 
four management systems; (b) discuss factor 
analyses of the Likert Organizational Pro- 
file; (c) present a longitudinal analysis of the 
profile; and (d) examine relationships be- 
tween Likert Organizational Profile scores and 


organizational effectiveness and employee 
satisfaction. 

METHOD 
Instrument 


The Likert Organizational Profile used in the 
present study was Form S, distributed by the 
Foundation for Research on Human Behavior 
(1967). Two items (Number 2, subordinates’ confi- 
dence in superiors, and Number 12, subordinates’ 
knowledge of organizational policies) were added 
at the request of the Brazilians, making a total of 
20 items on the profile. Questions were translated 
from English into Portuguese by bilingual Brazilians. 
These translations were then checked and revised 
by the authors in collaboration with the translators 
until all were satisfied that the translations were as 
accurate as possible. An independent retranslation 
into English indicated that the Portuguese and Eng- 
lish versions were essentially identical. 

Likert Organizational Profile questions are an- 
swered on a 20-point scale divided into four sec- 
tions, each section representing one of the four man- 
agement systems described by Likert’s theory. Each 
question has a different set of four descriptive al- 
ternatives corresponding to the characteristics of the 
four management systems. For example, Question 1 
asks, “How much confidence is shown in subordinates 
by superiors?” The four alternatives spread out on 
the 20-point scale are very little, some, substantial, 
and very much. Responses more toward the right 
on the scale receive a higher numerical value and 
indicate use of a management system more like the 
participative group System 4 seen by the theory as 
most effective. According to Likert (1967), the fact 
that all the scales are in the same direction does not 
influence the results. 

The items in Form S are grouped into the six 
theoretical dimensions of organizational processes 
mentioned in the introductory paragraph. Items 1-4 
measure leadership; 5-7, motivation; 8-12, com- 
munication; 13-15, decision making; 16-17, goal 
setting; and 18-20, control. Both individual item 
scores (Golembiewski & Carrigan, 1970a, 1970b; 
Kavčič et al., 1971) and index scores (Blumberg & 
Wiener, 1971; Marrow et al., 1967) on the basic 
dimensions have been used in previous research. 


Procedure 

As part of a larger study (Butt k Farris, 
1971) the Likert Organizational Prof s admin- 
istered in group sessions during visi Brazilian 
development finance institutions. On irst_ad- 
ministration, instructions were to ans\ ch ques- 
tion twice, once to describe the org m as it 
was actually and a second time as ould be 
ideally. On the second administration, 6 months 
later, instructions were to describe the anization 
as it was actually and as it had been pi usly (at 
the time of the first administration, igh the 
first administration was not referred to ifically). 

Complete data are available from rganiza- 
tions. One was privately owned and tt rs were 
either development banks or develop finance 
companies owned by state or regional ernments, 
For 2 banks, there was a 6-month tim between 
administrations; for 8 banks, there v -month 
time lag; and for 3 banks, there wa i8-month — 
time lag after the original administr 1 of the 
Likert Organizational Profile. These ences in 
time were designed to look at accurag of retro- 
spection as a function of the passage 1 It was 
the authors’ impression that the partici; in this 
study were quite interested in the task © npleting 
the questionnaire. The sessions in whic ze ques- 
tionnaires were administered seemed to as well 
as they usually do in the United States similar 
respondents and similar instruments. 
Respondents 

Respondents were professional employess ranging 
from nonsupervisors to bank presidents. ty-seven 
per cent were between the ages of 25 anc 34; 760% 


had more than 15 years of schooling; 53% had been 
with their organizations 4 years or less; and 80% 
were either lawyers, economists, engineers, or at- 
countants. 

Total individual ns for the Time 1 and Time 3 
administrations were 211 and 256, respectively. They 
represent a nonrandom sample of about 85% of the 
professional staff involved with a loan program for 
small and medium-sized industrial firms. The ns with- 
in banks varied from 7 to 41, the size of the sample 
generally reflecting the size of the bank. Size did rot 
influence the results. 


Analysis 


Except for the factor analyses and hierarchical 
analyses, data were analyzed using bank mean scores 
The bank was selected as the unit of analysis since 
the Likert Organizational Profile is designed to mea- 
sure organizational characteristics, and our primar) 
concern is with the management systems of organ 
zations. 


RESULTS 
Profiles 


What management system are these B 
zilian development banks actually using, and 
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MANAGEMENT SYSTEM 

1 2 3 


o 
in subordinates 
in superiors 
Subordinates feel free to talk to 
inates' ideas sought and 
= 
otivation used 
W ponsibility is felt 
i teamwork 
of information flow r 
of downward 1 
ion y 
of upward — & 
ation Fi & 
Si now subordinates’ £ = 
= 
. 2 E 
Subordinates know organizational 
polices 7 r 
Se / f 
WI ecisions are made s] 
Su tes involved in work- ~ \ 4 
related decisions y 
Decision making contributes to l * 
motivation 7 i 
— ̃ ͤ — ah T 7 ier 
How organizational goals are set 4 
A 1 N “<a 
Covert resistance to organizational 4 
goals f 
Distribution of internal control 7 
N s = 
Informal organization resists N F. 
mal organization 4 
ternal control data \ tz 
Actual -——-ldeal 
Ficure 1. Bank mean ideal and actual Likert Organizational Profiles, Time 1 (= 13 banks). AN 


what system would they like to be using istration. It is evident that these 13 financial 
ideally? Figure 1 presents the average actual institutions were actually using a System 3 


~ and ideal profiles from the 13 banks, based on consultative style of management with Syster 
tte first Likert Organizational Profile admin- 2 attributes in the area of decision making 


TABLE 1 


FACTOR ANALYSIS OP IDEAL LIKERT 
ORGANIZATIONAL PROFILE 


Item Loading 


Factor 1: Leadership (5.69,* 280 


1) Confidence in subordinates 
2 Confidence in superiors | 
3) Subs feel free to talk to sups 71 
J) Subs ideas sought, used 


Factor 2: Resistance (1.85,* 38>) 


17) Covert resistance org. goals 82 
19) Informal org. resists formal 76 
Factor 3; Guidance (149% . 45 
20) Use of internal control data 74 
9) Acceptance communication down 71 
Factor 4: Informed decision 
making (1.23, 5 15) 

0 Subs involved decisions related to work | .67 
11) Sups know subs’ problems .66 
13) Where decisions made 63 

15) DM contribute motivation 58 
12) Subs know org’l policies 50 
Factor 5: Dispersion of goal setting 
and control (1.04, . 560) 
s 18) Distribution of internal control .66 
16) How org’l goals set 62 


Factor 6: Motivation and communication 
(08) 61>) 


8) Direction of information flow 76 
6) Where responsibility felt 69 
10) Accuracy of communication up 61 


å Eigenvalue. 
b Cumulative proportion of total variance. 
© Item loads on more than one Factor over .40. 


and goal setting. The ideal style was seen as 
System 4. Analyses of variance indicated 
essentially no differences among banks on 
their ideal profiles and significant differences 
on their actual profiles. Similarly, analysis of 
variance among three hierarchical levels in the 
banks (nonsupervisory professionals, super- 
visors, and top management) showed no dif- 


D. ANTHONY BUTTERFIELD AND GEORGE F. Farris 


ference on ideal profiles, but fre differ 
ences on the actual profiles. Hig ganiz 
tional levels tended to see their ization: 
as being more toward System 4 tha i lov 
levels. Thus, there was substantial emen 
among banks and organizational is th 
System 4—participative group be th 
ideal management style, yet the il sys 
tem was seen as being more toy con 
sultative or autocratic. 

The profiles in Figure 1 are simil: thos 
found in Likert’s studies. They sug: «st that 
in terms of Likert’s theory, Brazilian develops 
ment banks may be using management $ 
tems not very different from those used b 


organizations in the United Stat 
Yugoslavia or Japan. 


Factor Analysis of the Likert Org 
Profile 


The short form of the Likert Orga: 
Profile was designed to measure bas 
organizational processes: leadership 
tion, communication, decision mak»: 
setting, and control. Does it, in fact 
Factor analyses of the Likert Organiza 
Profile were carried out to examine its 
structure and to see if there was emp 
support for the proposed six dimensious. 

Table 1 presents results of ihe factot 
analysis of the ideal responses from individual 
respondents at all organizational levels.“ 
principal-components analysis was used, 
varimax rotation for six factors. The first 
factor is clearly leadership, characterized 
mutual confidence and free communicati 
between superiors and subordinates and bj 
utilization of subordinates’ ideas. We ha 
given tentative labels to the remaining f 
tors, which are at best mixtures of the dimet 
sions proposed by the theory. Resistance M 
corporates covert resistance to organizatio 
goals and the presence of an informal org 
ization resisting the formal one. Guidance 
fers to the open acceptance of communicatio 
from above and the use of internal contra 
data for problem solving rather than puni 


3 The specific program used was Number BM D0. 
from the University of California, Los Ange 
Health Sciences Computing Facility (See Dis 
1967). 
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ment. Informed decision making unites dele- 
gation and involvement in decision making 
with knowledge of subordinates’ problems and 
Í the organization's policies. Dispersion of 
goal setting and control measures whether the 
goal setting and control functions are located 
at the top or spaced throughout the organiza- 
tion. Motivation and communication combines 
accurate upward communication and hori- 
zontal as well as vertical information flow 
with widespread felt responsibility for achiev- 
ing the organization’s goals. 

In additional analyses it became more diffi- 
cult to label factors. Note that the analysis 
of the ideal Likert Organizational Profile 
responses shown in Table 1 should have 
stopped with five factors since the eigenvalue 
for the sixth factor is less than 1.00. A careful 
examination of Table 1 also indicates that 
neither Item 5 (type of motivation used) nor 
Item 7 (amount of teamwork) find a place in 
any of the six factors. 

Additional factor analyses were carried out 
for the actual responses from the two time 
periods. However, since results were much 
less interpretable, they will not be presented 
here. Controls for organizational level tended 
to complicate rather than clarify results. The 
most consistent factors were leadership and 
resistance, but they emerged in a different 
order and with a different makeup in each 
analysis. Usually there were only three or 
four factors with eigenvalues greater than 
1.00. Items 7 and 5 frequently did not appear. 

The results in Table 1 represent in a sense 
the best of what the factor analyses pro- 
duced, that is, somewhat interpretable factors 
with at least some relationship to the theory. 
Repeated analyses produced only fair factor 
consistency over time, fair overlap between 
ideal and actual factors, and fair- to- poor 
comparability across organizational levels. 
This lack of consistency in factors across dif- 
ferent analyses was the characteristic out- 
come, not the emergence of a large and possi- 
bly halo-induced general factor. Other studies 
(Likert, 1967; Marrow et al., 1967) have 
alluded to such a halo factor, but none was 
found in the present case. Since factors were 
not clear cut nor consistent across the differ- 
ent analyses, the creation of factor scores did 


VS se 


not seem justified. Subsequent analyses ‘ 
based on individual items as well as on a 
total Likert Organizational Profile score (0 
all average across the 20 items), The la 
seemed justified in view of high interco 

tions among the twenty Likert Organizi 
Profile items and of Likert’s (1967) asse 
that the Likert Organizational Profile m 
sures relatively consistent management 

tems. 


Longitudinal Analysis of the Likert 
Organizational Profile * 


How reliable are Likert Organizational Pro- . 
file scores over time? In particular, how 
curate are Likert Organizational Profile scores 
when used retrospectively? The accuracy is- 
sue should be of general interest since investi- 
gators frequently ask respondents on que: 
tionnaires to remember how things were at © 
some point in the past, without knowing in 
fact how they really had been. In the — 4 
study we knew. 9 

Accuracy of retrospection. We might expect 
that accuracy of retrospection decreases as 
time increases due to forgetting. Thus, in the 
present study, the banks in which the Time 2 
administration followed the Time 1 adminis- 
tration by 6 months should be the most accu- 


rate; retrospective scores from the 18. mond 
time-lag banks should be the least accurate; 


and the 12-month group of banks should fall 
in between. However, if retrospection were 
perfect, the actual and retrospective profiles 
within each time grouping should be the same. 
Preliminary visual inspection showed remark- 
able overlap between the actual (first admin- 
istration) and retrospective (second adminis- 
tration) profiles within each of the three time 
groups. The ¢ tests, which were done on an 
item-by-item basis separately for each time 
group, confirmed that there were few signifi- | 
cant differences between actual and retrospec- 

tive item scores for any time group. These re- A 
sults are summarized, using mean total Likert 
Organizational Profile scores, in Table 2. An 
F test on the average difference score for the 


tare 


4 All longitudinal analyses were carried out sepa- $ 
rately for each organizational level. Results wens 
essentially identical. Data presented in the longi- 
tudinal section are from respondents at all organiza k 
tional levels combined. 


CABLE 2 


SUMMARY; EFFECT OF TIME ON RETROSPECTION 


Mean total Likert 


| | 
Fe Organizational | 
a Profile | | 
group | E 
in j | | 
months Time! | Retro- | | | 
actual | spective | | 
6 | 991 | 1048 | 82| 1 | ns 
12 [11.84 11.46 | —.23| 7 ns 
1249 8 2 | ons 


18 | 12.32 


three groupings confirmed that there was 
no difference in accuracy among the three 
retrospective periods (F = 0.27, df = 2/10, 
p= ns). 
Test-retest reliability. To look at “real 
change“ as a function of time, similar longi- 
tudinal analyses were carried out comparing 
Time 1 actual Likert Organizational Profile 
Scores with Time 2 actual scores. Results (not 
shown) were the same as for retrospective 
scores: time had no effect. Since time differ- 
ences produced no differences in either retro- 
_ Spective accuracy or change, simple rank- 
order correlations were calculated using all 
13 bank means on the total Likert Organiza- 
_ tional Profile scores. Time 1 actual scores cor- 
related p=.63 with Time 2 retrospective 
Scores and p = .52 with Time 2 actual scores. 
With an n of 13, both correlations are sig- 
nificant beyond the .05 probability level using 
a one-tailed test. The first p (.63) might be 
considered the retrospective reliability of the 
Likert Organizational Profile. The second p 
(82) is the test-retest reliability. Though 
statistically significant, neither is especially 
high. Of course, the moderate test-retest reli- 
ability over the 6-18-month periods is prob- 
ably due to actual changes in management 


systems as well as to characteristics of the 
measuring instrument. 


at Pe fe 


ae 7. 


r 
* T 


eel 


k a sii Naas 


Likert Organizational Profile Scores and 
Effectiveness and Satisfaction 


Likert’s theory predicts positive relation- 
ships between Likert Organizational Profile 
Scores and such dependent variables as orga- 
nizational effectiveness and employee satis- 
faction. That is, the more an organization 


we 
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TABLE 3 
CORRELATIONS BETWEEN BANK TOTA 'RGANI 
ZATIONAL PROFILE SCORES AND Erri ESS 
AND SATISFACTION 
cert 
Dependent variable a 
onal 
ofile 
Effectiveness 
1. Number of development loans, 1970 o1 
2. Ratio of development loans to total 
assets, 1969 07 
3. Rankings of overall effectiveness by 
National Bank experts, 1970 28 
4. Bank self-rating of effectiveness 78* 
Satisfaction 
5. Satisfaction with immediate 
supervisor 24 
6. Employee satisfaction with bank 77* 
7. Evaluation of bank as place to work 83* 
Likert 
file scores and que: n non- 
sionals only, Ns range 11 ding 2 
dinal analysis. 
% < Ol, one-tailed test, 
uses a participative group managen sys- 
tem, the better off it will be. To t ex- 
tent is the theory supported in deve ment 
banking organizations in Brazil? 
Table 3 presents correlations tween 
Time 2 actual bank Likert Organi: «tional 
Profile scores and effectiveness and s+=tisfac- 


tion.’ The first three effectiveness variables 
are independent “objective” measures oi bank 
performance. The number of development 
loans, 1970, is a measure of the development 
productivity of the banks. It represents the 
number of small and medium industrial loan 
projects approved for disbursement in 1970. 
The ratio of development loans to total assets, 


1969,° indicates the relative emphasis the 


banks place on development. The greater the 
proportion of total assets taken up by devel- 


5 Time 2 data were selected to maximize V. Miss- 
ing data and the fact that Time 1 Likert Organiza- 
tional Profile data were collected at different times 
for different banks prevented the use of path analy- 
sis or of other procedures which might have better 
indicated the direction of causality. 

This ratio includes development loans of all types: 
not just small and medium industrial projects. The 


rank ordering of banks on financial data was 50 


highly correlated from year to year in prior years 
that data on this ratio were not obtained for 1970. 


the more the bank is fulfilling 
ssion. Rankings of overall effec- 
National Bank experts, 1970, 
combined ranks of four judges 
who in ently rank ordered the overall 
perfor of the banks they knew best. 
(hi ions among the judges’ rankings 
s stically significant, ranging from 
In general, the objective mea- 
sures were reliable (year-to-year correlations 
i 7Os and .80s) but only moderately 
| with one another, 
ir measures in Table 3 are items 
another re adminis- 
anks at the time of the second 
zational Profile administration. 
I employees’ own perceptions of 
he their bank is, how satisfied they 
i own supervisor, the extent to 
ank’s employees are satisfied 
! issatisfied, and their evaluation 
ol as a place to work compared 
rganizations they know. Since 
organi level had a significant effect on 
th (higher levels reporting more 
fav ponses), the results in Table 3 
are based on Likert Organizational Profile 
and questionnaire data from nonsupervisory 
professional only. 
f Tab! shows that bank scores on the 
Likert rational Profile were not related 
to objective measures of bank effectiveness, 
but that they were related in the predicted 
manner to three of the four questionnaire 
measures of effectiveness and satisfaction. 
Contrary to theory, System 4 methods of 
management were not associated with better 
-~ organizational performance, but they were 
associated with employees’ feelings about per- 
formance and with employee satisfaction. The 
theory ìs thus only partially supported. 
nature of the one satisfaction item 
d in Ta related to the Likert Organiza- 
tional ie ie. Apparently, employees who 
described their organization as more towards 
System 4 were not more satisfied with their 
own supervisor (nor were they more dis- 
- satisfied). One's immediate supervisor may 
not be more satisfying in a System 4 orga- 
nization, yet employees do feel that the entire 
organization is more effective, is a better 
a 
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place to work, and has more satisfied 
ployees, It appears that the Likert On 
nizational Profile can distinguish betwe 
more general organization-wide and 
immediate work-group characteristics, 


Discussion 


Development banks are probably ame 
the more modern of Brazilian organiza’ 
They are relatively new and hire a high 
centage of young and well-educated 
ployees. Under these conditions, the 
Organizational Profile appears to be a us 
instrument for organization studies in B 
Questionnaire administration went 
smoothly, and profiles were remarkably 
lar to those obtained in the United S 
Common stereotypes of Latin American 
ture would predict that in the more 


we might find both actual and ideal profiles 
toward the authoritative end of the e 
These stereotypes merit empirical testing. 
The factor structure of the Likert O 
nizational Profile does not correspond 
to the six hypothesized organizational 
cesses. This is not a serious drawback, 
it does suggest that index scores on the sub- 
scales should be used with great caution. 
Likert himself recognizes the difficulty in ob. 
taining factors corresponding exactly to the 
proposed dimensions (Likert, 1967). He 
argues that either ideal scores or actual scores 
from within a single organization will yiel 
cleaner factors. In the present study, we did 
not have sufficient data to justify factor 
analysis from within a single organization, 
Factor scores aside, much can still be done 
with individual items and especially with a 


program for organizational development ha 
been useful in both Brazil (Butterfield, 1972) 
and the United States (Blumberg & Wiener. 
1971). 

The reliability of the Likert Organizational 
Profile over an extended period of time is a 
complex question. On the one hand, it seems 


-clear that organizational members do a 


k 
* 

£ 
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fairly accurate job in remembering the state 
of their organization 6, 12, or 18 months into 
the past. Furthermore, these differences in 
time have no effect on the accuracy of retro- 
spection. Using the Likert Organizational 
Profile retrospectively thus appears to be quite 
legitimate, at least up to a period of 18 
months. On the other hand, total scores on 
the Likert Organizational Profile are only 
moderately stable over time. The test-retest 
reliability (.52) obtained in the present study, 
though statistically significant, is certainly 
not high by traditional testing standards. 
However, given the small number of orga- 
nizations (13) and the rather dynamic state 
of the development finance field in Brazil, 
perhaps we ought not to have expected the 
test-retest reliability to have been much 
higher. 
Results of the correlations between the 


Variables are of particular theoretical interest 
As predicted, Brazilian development bankers 
Were more satisfied with participative group 
management systems (although such satisfac- 
tion was not dependent on satisfaction with 
_ the immediate supervisor) and said such sys- 
tems would be the ideal. Contrary to theory, 

organizations tending to use such systems were 
not more effective in terms of objective 
performance. 

System 4 theory would explain these dif- 
ferent results by invoking a time-lag argu- 
ment. The benefits of using System 4 methods 
are already apparent in the greater satisfac- 
tion of organization members. Such satis- 

faction usually precedes changes in organiza- 
tional end-product variables. Improved per- 
formance is on the way; it is only a matter 
of time. Unfortunately, although we do have 
longitudinal data in the present study, we 
_ do not have sufficient data over a sufficient 
time period to test adequately the time-lag 

hypothesis. 

An alternative explanation is the “man- 
Power competence performance feedback loop 
hypothesis“ (Farris, 1969; Farris & Butter- 
field, 1971). This theory holds that the effec- 
tiveness of participative methods depends on 
having competent, experienced employees, 
When such a manpower base does not exist, 
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Likert Organizational Profile and dependent 


as is the likely case in developit unn 
participative methods are not y toy 
more effective. Greater relative compete 
resides at the supervisory level, s it clo 
methods of supervision may be m ppropri 
ate as well as more accepted ‘arris | 
Butterfield, 1971). In the present study, fo 
example, satisfaction with imme: en 
visor was not significantly related to being 
a more participative organization f 
These are alternative rather than -omp 
hypotheses. Results of the present «tudy sW 
gest that the Likert Organizat l Profil 


may be a useful instrument for ex; ng 
further, even if System 4 theory is 


partially | — in Brazil. 
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EFFECTS OF A MERGER OF A SMALL AND A LARG] 
ORGANIZATION ON MEMBERS’ BEHAVIORS AND 
EXPERIENCES * 


ALLAN W. WICKER °? AND CLAUDIA E. KAUM 


Claremont Graduate School University of Wisconsin-Mi 


After the merger of two local churches consisting of 419 and 961 member 
118 members formerly from the smaller church and 115 members former! 
from the larger church reported their support for activities in both the prio 
and merged churches, and their feelings about participating in the merge 
church, Church support prior to the merger did not differ for the two group 
Results indicated that both groups declined in support after the merger, bu 
the decline was greater for members from the small church. The small churcl 
sample felt less close to the merged church, less obligated and less welcom 


to participate in its activities. 


A number of recent studies have suggested 
that participation in voluntary activities of 
an organization is related to the organiza- 
tion’s size: members of small schools and 
churches have been found to assume more 
positions of responsibility, to enter more dif- 
ferent kinds of activities, and in general to 
show higher levels of support for the activi- 
ties than members of large schools and 

churches (Baird, 1969; Barker & Gump, 
1964; Wicker, 1969a, 1969b; Wicker & 
Mehler, 1971). These behavioral differences 
have been accompanied by differences in sub- 
jective experiences and social norms; mem- 
bers of the small organizations have reported 
greater feelings of responsibility, involvement, 
and obligation to participate, and they have 
shown greater approval of high levels of par- 
ticipation (Barker & Gump, 1964; Wicker, 
1968, 1969b; Wicker & Mehler, 1971). 

The research cited necessarily employed 
correlational designs in which investigators 
selected organizations on the basis of mem- 


1 This report is based on a master’s degree thesis 
submitted to the Department of Psychology, Uni- 
versity of Wisconsin-Milwaukee, by the second 
author, The first author supervised the research. 
The authors are indebted to Alan Baron, Anthony 
Fazio, and J. Richard Hackman for helpful com- 
ments at various stages of the present research, and 
to Martin Thomas and Galen Luebke and members 
of their congregation for cooperation in the study. 

Requests for reprints should be sent to Allan W. 
Wicker, Graduate Faculty in Psychology, Benezet 
Psychology Building, Claremont Graduate School, 
Claremont, California 91711. 


bership size, and then obtained measures of 
participation and experiences fron: a sample 
of members in each organization. s proce- 
dure, particularly for the stu yi churches, 
leaves open the question of direction of effect. 
It seems plausible that chu nembers 


ize that 
willing- 


might actively select a church 
makes demands compatible with i 


ness to expend effort. The prest vestiga- 
tion deals with this problem by ing ad- 
vantage of a natural experiment: merger 


of two churches. Thus it was 
compare the behavior and experien 
same individuals when they were 
of organizations differing in size. 

The two merging churches were located 
within six blocks of one another in a working- 
class area of Milwaukee, Wisconsin. The 
denomination of the smaller church was 
Evangelical United Brethren; it had 419 
members. The larger church was Methodist, 
and had 961 members. These two local con- 
gregations voted to merge after the national 
organizations of these denominations merge 
to form the United Methodist Church. The 
building occupied by the smaller church wa 
abandoned after the merger. Both pastors “ 
the former churches shared in conducting 
worship services in the merged church. 1 
committees were organized to integrate 
members into the various activities of t 
merged church. In the first year, the E l 
of the various organizations in the merge 
church included 98 from the small church 
and 200 from the large church. 


members 
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Unfortunately, the selectivity problem was 
not wholly eliminated in the present study. 
Seventy-one of the members from the smaller 
church did not continue membership in the 
merged church. Of these, approximately one 
fourth stated they did not wish to affiliate 
with the Methodist denomination, one fourth 
had been inactive members, and one half 
lived in outlying areas and saw the transition 
as an opportunity to join churches closer to 
their homes. This attrition of members from 
the smaller church was not judged a serious 
problem, however, since it was clearly 
not the case that the most active members 
of the smaller church were discontinuing 
membership. 

A number of specific expectations for the 
present study were derived from the assump- 
tion that organization size determines, in a 
causal sense, members’ levels of participation 
in organization activities and their feelings of 
identification with the organization, its activi- 
ties, and its members. These expectations 
were as follows: (a) Prior to the merger, 
members of the small church would have 
shown greater levels of support for their 
church than members of the large church, 
while after the merger, the two groups should 
not differ in their levels of support, nor in 
their feelings regarding support (e.g., feeling 
obligated, welcome to participate). (b) The 
degree to which subjects were acquainted 
with members from the other church would 
be the same for both groups. (c) While 
members from both groups would show a 
decline in church support as a result of the 
merger, this decline should be greater for 
the members of the small church since they 
experienced a greater increase in church size. 
(d) The latter group would report, after the 
merger, a greater decline in feelings of pres- 
sure to participate and in feelings of close- 
ness to the church than would members from 
the large church. 


METHOD 
Subjects 


Names of subjects contacted were drawn from the 
membership list of the merged church. All married 
couples formerly from the small church (n = 65) 
and an equal number of randomly selected couples 
formerly from the large church were sent a letter 


$ 
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from the church pastors requesting their participa- 
tion in the study. The letter introduced the investi- ~ 
gators, stressed the importance of the study to the 
church, and stated that members would be contacted — 
for a telephone interview within the next few weeks. 

‘An attempt was made to interview both members — 
of every couple. Repeated attempts were made io 
contact subjects not initially available. Of 130 former 
members (Le., 68 couples) of the small church, 118 
(91%) eventually provided data and 12 (9%) re- 
fused to participate. Of 130 former members (also 
65 couples) of the large church, 115 (889) provided 
data, 2 (2%) could not be reached, and 13 (1096) 
refused to participate. The sex distribution of the 
participating subjects from each church did not differ 
(50% males and females in each sample). 


Telephone Interview 


Telephone interviews, conducted 11 months alter 
the merger, dealt with members’ participation in and 
feelings about church activities. (Complete copies 
of the interview schedules are provided in Kauma, 
1972.) The interviewer’s introductory statement as- 
sured subjects that their responses would be confi- — 
dential, and that although a report of the study 
would be made to church officials, it would contain i 
only group data. “i 

It had been determined that there were 13 activi- — 
ties common to the small, large, and merged 
churches; these included religious study groups, 
administrative board meetings, church picnics, church 
school classes, choir rehearsals, and several other 
organized activities. Subjects were asked if they had 
attended each activity either during the year before ; 
the merger or during the year following the merger. 
If a subject had attended an activity at cither time, 
he was asked if he had had any job or responsibil- 
ity, such as serving as an officer or helping out in 
any other way; on the basis of this information, his 
level of participation was categorized as (@) member 
(attending but having no particular responsibility), 
(b) worker (helping but not having a major respon- ~ 
sibility), or (c) leader (taking a position Which 
involved some control and authority in the activity). 

Subjects were also asked to indicate, for both 
before and after the merger: (a) the number of A 
hours per month they spent in all church activities, 
excluding Sunday worship service, and (b) the 
number of times per month they attended Sunday 
morning worship service. Other questions were (a) 
whether subjects felt obligated to participate in 
each of six activities of the merged church (includ- 
ing choir, men’s or women’s organizations, couples 
club, and others), (b) whether they felt welcome to 
participate in the same six activities, (c) whether 
they felt under more, less, or about the same amount 
of pressure to participate in church activities as a 
result of the merger, (d) whether they felt closer to i 
the merged church, not so close, or about the same 
as they felt toward the prior (small or large) 
church, and (e) whether they were acquainted with 
five persons named by the interviewer. The persons 
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named were former officers of the small or large In another type of analysis support 
church; 8 from a sy 1 * pis sie data, composite indices of sup; were de- 
e è 0 gi urch a 2 ° 
ey ee pubes ey rived and ¢ tests were applie ee com. 
parisons of interest: (a) in the church 
Data from Church Records condition, support by member he small 
Data on amount of money contributed were avail- church versus support by of the 
able for couples in the small, large, and merged large church (expectation: large), 
churches. At the beginning of each fiscal year, (5) in the merged church con support 
members are asked to pledge the amount of money b Bore hrc he small 
they are willing to give during the year. Contribu- y members trom the small 1 ves 
tions are placed in envelopes with an identifying support by members from the church 


number, and records are kept for each donor unit 
(family). Pledge data were available for the merged 
church only. Worship service attendance data on 
individuals were available only for the merged 
church. These data were based on signup sheets 
| which were passed along the pews at each service. 
Data on the number of persons attending Sunday 
worship service per Sunday (based on head counts) 
were available for the small, large, and merged 
churches. This last measure was thus based on all 
members of the churches, and not merely the present 
sample. 


RESULTS 
Support jor Church Activities 


A 2 (church condition, prior or merged) 
X 2 (church size, large or small) repeated 
measures analysis of variance was applied to 
each of the five support measures in Table 1. 

Tt was expected that, for each measure, there 
= would be a significant main effect due to 

church condition (lower support in the 

merged church than in the prior churches) 

and a significant interaction (a greater differ- 

ence between prior and merged conditions 
for members from the small church than for 
members from the large church). The main 
effect of size of church from which members 
came was not expected to be significant, since 
it represented the mean across the two church 
conditions, prior and merged. 

The data in Table 1 show that the present 
predictions were partially confirmed: for four 
of the five measures, the level of support 
was significantly greater in the prior church 
than in the merged church, and for two of 
the five measures, there were significant 
interactions. The plots of these interactions 
(Figure 1) show the expected greater decline 
in support by members from the small 
church. None of the measures showed signifi- 
cant main effects due to the size of the church 
from which subjects came, 


the dif- 
ior and 
he small 
r mem: 
ectation; 


(expectation: small = large), a: 
ference between support in 
merged churches by members 
church versus the same differe 
bers from the large church 
small > large). 

Before the composite scores were derived) 
the four support measures ayailable on indi- 


vidual subjects in both church condi ‘ions (i.e 
the first four measures in Table 1) exe inter- 
correlated, Data for members from the small 
and large churches were pooled ar< separate 
matrices were computed for the por church 
and the merged church. For each urch con- 
dition, all coefficients were positie and rela: 
tively high: (a) In the prior church condi 


tion, the range was from .40 to .94, with @ 
median of .67. (b) In the me 
condition, the range was from 
with a median of .63. It was there 
legitimate to combine the individual measures 
into an overall index of support. 

Composite indices of support were genei 
ated in the following manner: for each of the 
four measures, subjects’ scores were converted 
into standard scores; the four standard scores 
were then summed to yield a support com- 
posite for each subject. Three different 
composite scores were calculated for each Sub. 
ject, one for each comparison of interest: The 
first was based on the distributions of SUP) 
port measures in the prior churches, 
second on the distribution of support mea 
sures in the merged church, and the thir 
on the distributions of difference scores ob: 
tained by subtracting each support measure” 
in the merged church from the corresponding 
measure in the prior church. 

The expectation that for the prior church 
condition, members of the small church wou a 
show higher levels of support for chu! 


TABLE 1 
Means any F Ranos roe Meastens or CNN Surroer 


Measure of support 


Na, adtivities attended 22 


o, activities in which subjects were workers 

ot leaders 18 
No, hours per month spent in activities SN 
No, times per month subjects attended 

Sunday worship service 29 | 
Amount of money contributed per couple 8151! 


Note, For all analyses a = 118 for small and e = 115 for large, except for amount of money contributed pet couple for 


~ @ for small amd æ = $5 for large, 
“9 <o. 


activities than members of the large church 
was not supported (small church composite 
index X = .16, large church £ = —.17; 
t = .74, df = 231, ns). Also, the expectation 
that the two groups of subjects would not 
differ in their levels of support in the merged 
church was disconfirmed at a borderline level 
of significance; members from the small 
church tended to show less support (& = 
38) than members from the large church 
(K = 39; £=1.77, df = 231, p < -10). As 
expected, the decline in support after the 
merger was significantly greater for members 
from the small church (X =.58) than for 
members from the large church (X = —.60; 
3.15, df = 231, p < .01). 

An additional measure of support in the 
small, large, and merged churches was wor- 
ship service attendance based on head counts 
taken at the services. Across the four quar- 
ters which immediately preceded and fol- 
lowed the merger, the percentage attendance * 
(Wicker, McGrath, & Armstrong, 1972) was 

2 Some problems in using the statistic, number of 
persons attending/number of persons on membership 
rolls, as an index of members’ support have been 
noted by Wicker, McGrath, and Armstrong (1972): 

It is relatively unstable for organizations having 

small memberships, and visitors are generally 

counted along with members in the attendance 

tally. Although the instability problem is probably 
not severe for churches as large as those in the 
present study, the data on percentage of attendance 
Should be interpreted and evaluated in the context 
of the other support measures reported. 
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Lance One: 


Merged church | 
P 
Small | Large 

19 9 | 13 | 
14 S 4 8 
66 | 27 | 47 | 
27 | 23 | 23 | 
! 
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greater in the small church than in the large 
church (38% versus 29%; ¢ = 8.60, df = 3, 
p< .01) or in the merged church 05 
versus 30%; f 16.20, dj=3, p< 01). 
However, the percentages of members who 
attended services in the large church and in 
the merged church were nearly identical 
(29% versus 30%). * 
Two measures of church support were 
available only for the merged church: (a) 
amount of money pledged to the church, and 
(b) frequency of attendance at Sunday wor- 
ship service based on the sign-up sheets. As 
expected, members from the small and large 
churches did not differ significantly on either 
measure. À 


Feelings about Supporting Church Activities 


Responses to the question regarding feel- 
ings of closeness toward the merged church 
as compared to the prior church showed the 
expected pattern. A greater percentage of 
members from the small church reported feel- 
ing less close than did members from the 
large church (50% versus 33%), while more 
members from the large church reported no 
difference in feelings of closeness (57% 
versus 45%) and feelings of increased close- 
ness (10% versus 5%). The chi-square for 
the 3 X 2 contingency table = 7.89, df = 2, 
p< .05. , 

Contrary to expectation, members from the 
small and large churches did not differ 
markedly in responses to the question asking 
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Ficure 1. Members’ support for church activities: Interactions between siz 
of church from which members came and prior-merged church conditions. 


them to compare the pressure to participate 
in church activities in the merged church 
with the pressure in the prior church: A 
greater percentage of the small church mem- 
bers reported less pressure (21% versus 
15%) and more pressure (7% versus 2%), 
while more members of the large church re- 
ported the same amount of pressure (83% 
versus 73%). The chi-square for the 3 X 2 
contingency table = 5.83, df= 2, p< .10). 

Although no differences were expected be- 
tween persons from the small and large 
churches in feelings of obligation or feelings 
of being welcome to participate in activities 
of the merged church, the two groups did 
differ. Members from the small church felt 
olbigated to participate in fewer of the six 
named activities of the merged church than 
did members from the large church (Xs were 
-6 and 1.1,respectively; t = —2.87, dj = 231, 
p < .01). The same pattern was obtained for 
feelings of being welcome to participate in 
the same six activities: small church & = 4.1; 
large church X = 4.8, (£ = —3.80, df = 231, 
p < 01). Moreover, significantly more mem- 
bers from the small church said they did not 
feel obligated to participate in any of the 
activities (69% versus 49%, x? = 10.24, df 
=1, P< 001). A large percentage of mem- 


bers from the small church also rted that 
they did not feel welcome narticipate 
in any of the settings (13% versus 3%, . 


x? = 8.67, df=1, p < 01). 


Acquaintanceship with Church i end 


Members from the two churches also dif- 
fered in their acquaintance with ‘ve former 
officers from the other church. The mean 


number of former officers of the church 
known by members of the small church (1.9) 
was greater than the number of former of- 
ficers of the small church known by members 
of the large church (1.3; t = 3.27, df = 234, 
p <.01). Also, a smaller percentage of mem 
bers from the small church reported that they 
did not know any of the former officers (280% 
versus 45%, x? = 7.41, dj = 1, p < 01). 


DISCUSSION 


The pattern of the present results, while 
suggesting that organization size does influ- 
ence members’ participation and experiences, 
also suggests that the additional factor of site 
chosen for the merged church may have 
played an important role. 

The strongest findings supporting the bY- 
pothesis that increases in organization S465 
produce reduced levels of participation and 
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lesser feelings of identification with the orga- 
nization were the following: (a) Support for 
the (smaller) prior churches was greater than 
for the (larger) merged church. (Y) Decre- 
ments in support following the merger gen- 
erally were greater for the members from the 
small church than for members from the 
large church. (c) Feelings of being less close 
to the merged church than to the prior church 
were more prevalent for members from the 
small church. However, members from the 
small church did not report a greater decline 
in pressure to participate as a result of the 
merger than did members from the large 
church, nor did the former group show gen- 
erally higher levels of church support in the 
year preceding the merger. 

A number of differences obtained between 
the two groups as members of the merged 
church suggest that organization size, by it- 
self, is an inadequate explanation for all of 
the present results. Although the size of the 
merged church was obviously the same for 
both groups, members from the small church 
tended to show lower levels of support, and 
felt welcome and obligated to participate in 
fewer activities than members from the large 
church, The apparent weaker identification 
with the merged church by members from the 
small church may be due to the fact that 
for them, the merger involved leaving a 
familiar building and migrating to a different 
site (that of the large church) which was 
already occupied and functioning. And al- 
though there were attempts to integrate the 
members from the small church into the 
merged church by holding joint social and 
organizational meetings and by electing a 
number of small church members to official 
positions in the merged church, many of them 
may have regarded themselves (and have 
been regarded) as new members of an already 
functioning church. 

Two kinds of additional evidence are con- 
sistent with the above interpretation: (a) 
During the telephone interviews, a number 
of members from the small church spontane- 
ously expressed feelings of alienation from 
the merged church and some members from 
the large church indicated they felt their 
group had not done enough to draw in the 


glance appear to be inconsistent with the 
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members from the small church, () Results 
from an earlier study (Wicker & Mehler, 
1971) that contrasted new and established + 
members in two other churches showed a 
pattern very similar to that obtained in the 
small-large comparisons of the present study. 
Wicker and Mehler found that new members 
showed little or no difference from established 
members in participation in the more open 
kinds of church activities, such as Sunday — 
worship service attendance and church con- 
tributions, both of which are available eyen 
to visitors. But the new members showed 
considerably lower rates of participation 
than established members in the various 


other church activities, such as business meet- 


ings, social gatherings, and so on. These 


differences precisely parallel the present dif- 2 


ferences between members from the small and 


large churches in the merged church (Table 


1). 85 
Finally, the following result may at first 
above discussion: members from the small 
church had a greater familiarity with former 
officers from the large church than did 
members from the large church with former 


officers from the small church. If the mem- 


bers from the small church identified less 
closely with the merged church, then why 
would they be more familiar with former 
officers of the large church? Possibly because 
the large church’s former officers were more 
active and visible in the merged church than 
were the former officers of the small church. 
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This study was conducted to test the hypothesis that the role motivation 
theory of managerial effectivences is applicable in highly structured organiza- 
tional contexts of the bureaucratic type, but not applicable in low structure 
contexts of the professional type. Simulated organizations with student mem- 
bers were used—a facsimile of a large business firm at Western Michigan 
University and a series of independent project teams at the University of 
South Florida. The results generally support the hypothesis, with managerial 
motivation being significantly related to promotion decisions in the high 
structure situation, but not in the low structure situation. 


One approach to the study of human mo- 
tivation in the organizational context involves 
an emphasis on the motivational requirements 
of tasks and of informal and formal role re- 
quirements, rather than on the specific needs 
and motives of individuals, which are the 
central focus of the various content theories 
of motivation. Such a role-based approach 
has as its major concern the sum total of 
motivational inputs needed to satisfy job re- 
quirements. What is important is that these 
motivational inputs make a difference in the 
performance of a particular kind of job 
rather than that they be of a particular type, 
such as the need for self-actualization or in- 
trinsic job satisfaction. 

To date this role-based approach has been 
developed most fully in the areas of entre- 
preneurship (McClelland & Winter, 1969) 
and the management of bureaucracies (Miner, 
1965a). However, as has been noted else- 
where (Miner & Dachler, 1973), the approach 
could be extended to cover a whole network 
of key jobs in varying organizational con- 
texts. There is thus the possibility of creating 
a highly parsimonious but comprehensive 
theory of motivation at work, which would 
provide a base for meaningful organizational 


1 Requests for reprints should be sent to John B. 
Miner, Department of Management, Georgia State 
University, 33 Gilmer Street, S.E., Atlanta, Georgia 
30303. 


31 


actions, long before the total com 
human motivation is understood. 

A basic problem in the development o 
a theory relates to the generality or b 
of job definitions. Research to date indicates 
that a theory originally developed to apply 
to managerial positions in the context of a 
large highly structured, hierarchic organiza- 
tion cannot be extended to less structured 
professional organizations where face-to-face 
relationships predominate (Miner, 1971). 
Thus the degree of structuring in the organi- 
rational environment where the job is per- 
formed appears to be an important variable 
in defining job categories for purposes of 
developing role-based theories of motivation 
at work. The present research attempts to 
shed further light on this matter by moving 
from the field situation typified by ongoing 
organizations to simulated situations where a 
greater degree of control of relevant variables 
is possible. 


Role-Motivation Theory of Managerial 
Effectiveness 


The theory which provided a stimulus to 
the studies reported in this article posits six 
types of role prescriptions for managerial 


jobs. These role prescriptions are assumed to 


be primarily of an informal nature in most 


organizations, although the fact that they are 
formally stated would not make the theory 
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less applicable. The role requirements are 
hypothesized to have a high probability of 
occurrence across a broad range of mana- 
gerial jobs and their presence is expected to 
be positively related to organizational goal 
attainment and thus to contribute to organi- 
zational effectiveness. The theory as originally 
stated was viewed as applying to “business 
firms organized in accordance with the scalar 
principle” (Miner, 1965a). 

The six role requirements and their ac- 
companying motivation may be described as 
follows: 


(a) Managers are typically expected to 
behave in ways which do not provoke nega- 
tive reactions from their superiors. In order 
to represent his group upward in the organi- 
zation and to obtain support for his actions 
a manager should maintain good relationships 

~ with those above him. It follows that he 
should have a generally positive attitude 
toward those holding positions of authority. 

(b) There is, at least insofar as peers are 

concerned, a strong competitive element built 
into managerial work. A manager must com- 
pete for the available rewards, both for him- 
self and for his group. If he does not, he may 
lose ground as he and his function are rele- 
~ gated to lower and lower status levels. Cer- 
= tainly without competitive behavior rapid 
5 promotion is very improbable. Thus, mana- 
gers must characteristically strive to win for 
themselves and their groups. In order to meet 
this role requirement a person should be 
favorably disposed toward engaging in compe- 
tition. 

(c) There is a marked parallel between the 
requirements of the managerial role and the 
more general demands of the masculine role 
as it is defined in our society. Both a mana- 
ger and a father are supposed to take charge, 

to make decisions, to take such disciplinary 

action as may be necessary, and to protect 
l others. Thus, one of the more common role 
requirements of the managerial job is that 
dhe incumbent behave in an assertive mascu- 
line manner. Even when women are appointed 
to managerial positions they are expected to 
follow an essentially masculine behavior pat- 
tern. It follows that a desire to meet the re- 


„ 


quirements of masculinity as it is typically 
defined will generally lead to success in meet- 
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ing certain role prescriptions of the 
gerial job as well k 
(d) This is the requirement that the m 
ger exercise power over subordinates; 
direct their behavior in a manner consi 
with organizational, and presumably his 0 
objectives. He must tell others what 
when this becomes necessary, and enfore 
words through appropriate use of po 
and negative sanctions. The individual} 
finds such behavior difficult and emotid 
disturbing would not be expected to 
this particular role prescription. k 
(e) The managerial job requires a pë 
to stand out from his group and a 
position of high visibility. He cannot 
actions of his subordinates as a guide for 
own behavior as a manager. Rather he 
deviate from the immediate group ani 
things that will inevitably invite at 
discussion, and perhaps criticism from 
reporting to him. When this prosp 


viewed as unattractive, when the ide 
standing out from the group, of behaving 
different manner, and of being highly vis 
elicits feelings of unpleasantness, then } 


havior appropriate to the role will occur mi 
less often than would otherwise be the € 

(f) The final role requirement refers 
process of getting the work out and keej 
on top of routine demands. The things ® 
have to be done must actually be done. 
are administrative requirements of this K 
in all managerial work, although the 
activities will vary somewhat from onê 
tion to another. To meet these prescrip 
manager must at least be responsible in 
ing with this type of routine, and ideal 
will gain some satisfaction from it ( 
1965a). 


In order to test this theory the Miner $ 
tence Completion Scale was developed: 
provided measures of motivation relev: 
the six role prescriptions. The Miner 
yields two total scores and seven sub 
scores (two in the area of competitivi 
These subscales have been designated 
ity figures, competitive games, com 
situations, masculine role, imposing 
standing out from group, and routine 
istrative functions. ; 

Tests of the theory using the MSC 


proluced positive results in the types of or- 
ganizations and under the conditions where 
was originally considered applicable 
Cante, Erickson, & Stephenson, 1971; 
Miner, 19682). Criteria used have included 
supervisory performance ratings, peer ratings, 
promotion rates, and managerial level. How- 
ever, the theory has not received support 
when extended to less structured organizations 
such as a management consulting firm which 
uses the professional form and three small 
school distincts where the administrative 
process is largely of a face-to-face nature 
(Miner, 1971). Also, the criterion must be 
one of managerial performance; in the re- 
search and development context when promo- 
tion is heavily weighted with creative and 
professional components negative correlations 
may in fact be obtained (Gantz, Erickson, & 
Stephenson, 1972; Miner, 1965b). 

In the hierarchically structured organiza- 
tion, which relies heavily on formalized pro- 
cedures, relationships and communication of 
a vertical nature are of major importance; 
the very nature of the bureaucratic form 
makes this so. Thus it does not seem surpris- 
ing that the Miner Sentence Completion 
Scale subscales dealing with role or 
in the areas of authority relationships, com 
petition with peers, and exercising power over 
subordinates have contributed the most con- 
sistently significant correlations with criteria, 
although all the other subscales have pro- 
duced significant results on occasion. In small 
“face-to-face and professional organizations, 
on the other hand, horizontal relationships 
and communication tend to be emphasized to 
a much greater extent, relative to the verti- 
cal; the Miner Sentence Completion Scale 
variables should make little difference under 
such circumstances. 

Furthermore, effectively functioning or- 
ganizations should place high value on and 
bestow rewards for behavior that is in ac- 
cordance with the role requirements of their 
particular organizational forms. Thus, both 
role performance evaluations and role assign- 
ments (managerial level) should be predicta- 
ble from the variables of role-motivation 
theory. In addition, to the extent individuals 
are aware of the requirements of managerial 
Work in hierarchic organizations and possess 
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the motives required to perform in aoa 
with them, these individuals would be 
pected to seek out managerial positions in 
order to satisfy their strong motives. There 
is evidence that this does in fact occur (Miner 
& Smith, 1969). 

Given these lines of reasoning and the re 
sults obtained from research on actual organi- 
zations, the role motivation theory of ma 
gerial effectiveness should prove applicable 
any reasonably large organization of 
administrative or bureaucratic type, and 
the very fact of structuring an organiza 
tion in this manner should tend to elicit 
a positive value toward the role prescrip 
tions of the theory. One method of inves- 
tigating this hypothesis is to test the 
theory in temporary simulated organiza- 7 
tions having varying degrees of verti- 
cal structure. Where structuring is greater = 
one would expect the Miner Sentence Com- 
pletion Scale measures to correlate more” 
highly with criterion variables. The approach 
involved here in moving from the field to the 
simulated situation is essentially the same as 
that used by Ghiselli and Lodahl (1958) in 
establishing the construct validity of the 
Ghiselli Self-Description Inventory (Ghiselli, 
1971). 


MeTHOD 
Low-Structure Situation 


The low-siructure simulation used student team 
that worked on various case projects throughout the 
spring quarter of 1972 at the University of South 
Florida. Each project team worked 
of the others on a separate and distinct problem} 
the teams were drawn from three different manage- — 
ment courses. The groups formed on a voluntary 
basis, rather than with an assigned membership, and 
in some cases they formed well in advance of the 
start of the course. The amount of interaction Ws 
sizable with some teams spending as much as 0 
hours per week working together. At least 50% of 
the course grade derived from this group effort. 
There was no organizational structure above be 
project team level and there was no specification o 
a division of labor on either a horizontal or vertical 
basis within the teams. To the extent such internal 
structuring appeared it was entirely emergent. pe: 

In all there were 19 teams studied with member- 
ships ranging from 4 to 7. The total number of in- 
dividuals in the teams was 89. Of these 72 completed 
the Miner Sentence Completion Scale during the 
spring quarter. These 72 constitute the sample for 
purposes of study. 
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A At the end of the quarter when all projects were 
© compkted the team members filled out a brief 
questionnaire based on their experiences in working 
quite closely together over a two-month period. 
The questionnaire was designed to get at the nature 
and extent of emergent leadership; it was worded 
as follows: 


. 


Assume you are a manager of the people listed 
above. You have received a request from your 
immediate superior to recommend one of them 
for a promotion as a manager of another group, 
ie., he is not your replacement. What would your 
recommendation be in terms of who you think 
would be a good leader? 


The assumption was that if role motivation theory 
applies at all, in such low structure situations, it 
+ should differentiate between those nominated for 
leadership positions as a result of their behavior in 
| the project team context and those viewed as lacking 
leadership potential. 
Emergent leadership potential scores were calcu- 
lated by giving 2 points for a first choice and 1 
point for a second choice. To correct for differences 
in group size and because only 72 of the 89 group 
members completed the questionnaire, the score for 
each person was computed as a percentage of the 
total number of points that could have been ob- 
tained. Values of 60 or higher were taken as indica- 
tive of managerial potential. Using this criterion 7 
groups had a single leader and 12 had dual leader- 
ship. Of these, 4 individuals did not complete the 
iner Sentence Completion Scale; the total number 
of emergent leaders was thus 27. Their mean score 
“Was 80. 
The dual leadership phenomenon appeared to be a 
function of a certain amount of clique formation 
Within the groups. As would be expected the single 
leaders had a higher leadership score, 94 versus 76. 
However, the slight trend toward higher total Miner 
Sentence Completion Scale scores for the single lead- 
rs did not approximate significance and thus com- 
ining of the single and dual leaders for pur- 
poses of the analysis seemed justified. There was no 
" significant difference between the mean total Miner 
Sentence Completion Scale scores of members of 
one-leader and of two-leader groups, 
In addition to comparing the Miner Sentence Com- 
pletion Scale scores of the 27 leaders with those of 
the 45 nonleaders, a further differentiation was made 
within the latter group. Those with leadership scores 
in the 20 to 59 range were designated as possible 
leaders; the mean score for these 20 individuals was 
42. Those with the lowest scores 


= 
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choice nomination. © + 


High-Structure Situation 


. The high-structure simulation study used a stu- 
= dent-managed simulated organization called the 
Management Development Corporation operated by 
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individuals enrolled in manageme: 
ern Michigan University. Managem: 
Corporation was a six-level divisio: 
tion established for training purp 
archical form was intended to provid 
and subordinate role experience. 
The students worked on actual cu 
Often the output was in the form oi 
ered to the management department 


puter printouts, data on several cla sociated 
with grade records, analyses of depart studies, 
etc.). It could also be in the í f final 
plans (activities, costs, scheduling of es and 
the like toward the accomplishment of objec- 
tive to which the plan addressed itseli rmation 


to relev: publics about Western“ 

versity, the v 

agement Deve 

paper articles nev letters, etc.) ; nel and 


a r agement 
Development Corporation itself, Operating »rocedures 


| 
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tended to vary somewhat from semester t^ semester 
depending on the problems dealt w ind the 
particular management group. 

The organization was headed by a ; ent who 
reported to a three-man faculty boa > board 
reviewed and approved progress toy djectives 
and received final oral reports at the of the 
semester. Performance appraisals were n by the 
students themselves without any direct ity in- 
volvement. These appraisals were in t orm. of 


some type of superior rating and were s 
converted into course grades. 

Subordinate to the president were varyin 
of vice presidents, general managers, 
managers, project managers, and staff ansi: 
pending on course enrollments. The structure was 
in the form of a pyramid with decreasing numbers 
at each higher level. Leadership was appointive with 
promotion decisions having been made by superiors. 
The organization could and did, promote at any 
time, but larger changeovers occurred at the begin- 
ning of each semester. Each top management group 
selected its own successors. 

All those who subsequently became managers in 
the Management Development Corporation started 
out at the lowest level as staff analysts who worked 
on project teams just as in the Jow-structure situ- 
ation. After an individual had worked as a staff 
analyst for some time he became a candidate for 
selection as a project manager. Based on observed 
performance and anticipated managerial role per- 
formance some were promoted to project manager 
and many were not. Thus, in both the low- and 
high-structure situations what was involved was 
picking a good leader based on observation of prior 
Performance in similar situations, In both cases the 
choices were made by other students; but in one 
instance they were made in a manner typical of 
hierarchically structured Organizations and in an- 
other in a manner typical of organizations with 
minimal structure. This structural variation was the 
experimental differentiation, 
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Al a point subsequent lo the first move into 
management as a project manager factors other than 

verved performance at a lower level assumed in 

sd significance in the promotion process 
city began to enter in the promotion proces 

a sizable degree; so did a personal decision as to 
voether continuing as a manager fn the Manage 
ment Development Corporation was preferable to 
other types of karning opportunities. Thus the 
promotion from staff analyst to project manager 
differed significantly from those to department mana 
ger, general manager, vice president, and president 

The staff analysts derived primarily from the 
introductory management course which was re- 
quired of all business students, Participation in the 
Management Development Corporation was volun- 
tary within the course; students could complete the 
course requirements in several different ways. How- 
ever, the grading system was established so as to 
provide an incentive to work within Management 
Development Corporation, and a relatively high 
proportion of the students did in fact include this 
alternative as one of their activities. 

The Miner Sentence Completion Scale was ad- 
ministered in early 1971 to students enrolled in the 
introductory course in management and to the other 
more advanced students who had continued as 
members of the simulated organization. The response 
rate for the Miner Sentence Completion Scale was 
over 90%. Three basic samples were established as 
follows: (a) leaders—73 individuals who held man- 
agement positions in the Management Development 
Corporation; (b) nonleaders—264 individuals who 
had elected to work in the Management Develop- 
ment Corporation as gj aiden to complete 
requirements for the in ory management 
course; (c) nonorganization—78 individuals who 
could have clected to work in the Management De- 
velopment Corporation as staff analysts, but chose 
not to do so, and completed the management course 
tequirements in other ways. 

It was hypothesized that the Miner Sentence 
Completion Scale scores in the leader sample would 
exceed those in both the nonleader and the non- 
organization groups. In addition it seemed probable 
that if any differences existed between the two non- 
management samples, those who chose the organiza- 
tion should score higher. Reasons for not electing 
membership in the Management Development Cor- 
poration were sufficiently obscure; therefore, this 
could not be considered a strong hypothesis, 

A further differentiation was made within the 
leader sample on the basis of managerial level. The 
higher level group consisted of the president, the 
vice presidents, the general managers, and the 
department managers—a total of 31 individuals. The 
lower level group contained 42 project managers, 
the first-line supervisors. Higher Miner Sentence 
Completion Scale scores were hypothesized for the 
higher level managerial sample than for the lower. 
In addition, comparisons were made between the 
two leader groups and the two nonleader groups to 
identify differences that would hold up over two 
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mpks of cach type. This eee provided 
approximation to replication. 


Rss 
Lone Structure Study 


Table 1 contains the mean Miner Sentence 
Completion Scale scores for the emergent 
leaders and the total nonleaders. None of 
differences attained significance at even 
$ < .10 level. On 5 of the 9 comparisons 
total nonleader mean was higher than 
for the leaders, in direct contradiction to t 
hypothesis. 

It remained possible that the definite non- 
leaders within the total nonleader group 
might have exhibited very low scores. Data on 
this point are also given in Table 1. However, 
an examination of the mean Miner sc 
scores for the three levels of emergent project 
leadership (leaders, possible leaders, and defi- 
nite nonleaders) indicated no support for the 
hypothesis, None of the comparisons among. 
the three samples attained significance at p 
< .10. On 4 of the 9 variables, including the 
two total score measures, the definite non- 
leaders had the highest score, in only one 
instance were they the lowest. The data pro- 
vided no support for role motivation theory 
in a context of low organizational structure. 


High-Structure Study 


Table 2 contains the comparisons involving 
the various groups. As indicated the major 
hypothesis was confirmed. The total leaders, 
who held managerial positions in the simu- — 
lated organization, had significantly higher 
means on both the total score measures than 
both nonleader groups. The subscale compari- 
sons indicated that the degree to which there 
was the potential for meeting managerial role 
requirements that involved competitiveness ~ 
and the performance of routine administrative 
functions distinguished the organizational 
leaders from those who did not hold mana- 
gerial positions, 5 

There was no evidence that those who E 
elected organizational membership at the non 
managerial level differed at all from those 
who did not elect membership. However, the 
low standing out from group score in the 
nonorganizational sample, as contrasted with 
that of the managers, did suggest a possible 
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TABLE 1 
Mean MSCS SCORES FOR EMERGENT Project LEADERS 
AND NONLEADERS, AND WITHIN THE NONLEADER 
GROUP FOR POSSIBLE LEADERS AND DEFINITE 
No: LEADERS (Low STRUCTURE Sruby) 


Possible | Definite Total Tender 
MSCS measure | leaders | nonleaders | nonleaders ( 27 
l(a = 20)| (a = 25) (a = 45) | 
Total score- | 
item -1.00 | — 80 —⏑⏑ | = 93 
Total score- | | 
rare 2.20 — 1.68 -1.91 2.19 
Authority | Í 
figures — 05 — 42 — oO | As 
Competitive } | | 
games 65 | .92 8S 
Competitive 
situations — 95 | -4.52 -1.27 | -1.70 
1 wej — 70 = 52 — — 48 
m ng 
Wishes 1 00 09 | 33 
Standing out | 
- from group 65: | 0 62 | 26 
Routine | 
functions — 80 | — 16 = 44 — 67 
Completion 


aides Abbreviation: MSCS = Miner Sentence 
je. 


reason for not joining the Management De- 
velopment Corporation. The Management 
Development Corporation, compared to other 
course options available to the students, 
required more interaction with other students, 
more group work, and the possibility of 
making informal and formal presentations to 
_ peers, superiors, and faculty. Students who 
Preferred to work independently in more tra- 
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TABLE 2 
_ Mean MSCS SCORES FOR APPOINTED ORGANIZATIONAL LEADERS, NONLEADERS, AND THOSE CHOOSING NOT TO JOIN 


ditional academic roles (e.g., reading, a 
ing lectures, taking exams) may have avoi 
the Management Development Corpora 
and its opportunity to stand out, as an op 0 

The comparison of managerial levels within 
the Management Development Corporat 
failed to yield any significant differences 
p < .10 or better, although the trend of the 
data favored the higher level group. Further- 
more, as indicated in Table 2, the hig 
level managers differed from both nonma 
gerial groups at  < .10 on 5 of the 9 M 
Sentence Completion Scale l V 
the lower level manager group y 
parable differences on only 2 v: 


gerial samples had been available, a 
but statistically significant difference 
tween the two levels would have emerged 0 


both total score and both competitiv 
measures, “ KA 


DISCUSSION 

One of the most striking findings from the 
high-structure study was the consistent dif 
ferentiation between managers and nonman- 
agers on the measure of motivation to per- 
form routine administrative duties. Experi- 
ence suggests that this measure reflected a 


THE ORGANIZATION AND WITHIN THE LEADER GROUP FOR HIGHER LEVEL LEADERS AND LOWER LEVEL LEADERS 


t values 
i | 
Higher | Lower =, Non- Higher level Lower level 
MSCS level level 9 8 BS em organi- leaders vs. leaders vs. Total leaders vs 
measure leaders | leaders (7 3) (1 =264) zation 
(n =31) | (m=42) 7 2 15 (x =78) ia 
Non- | Nom Nen: | Nom | Non- 
leaders dn leaders se pti leaders 

Total score- 2.14 98 1.47 —.76 1.122.087 2.7 | 1.738% 1.76* 3.01 

item 
Total score- 2 

rare 1.13 —1.52 —1.35 —2 2 2.122.305 2.06** — — 2.48 EU 
Authority 

figures fe 03, —.10 —.04 09 36 — 

iti 
pame 1.68 1.00 1.29 85 85 2.18 | 1.99% = — 1.83 
e — 48 —.98 Ske —1.36 —1.31 2,38** 2.08** — — 2.57 
e ~.33 —.33 —.33 —.64 —.7 
1 z 
e 55 55 55 55 49 
out. 

em n eel 60 60 1% % 9 1825 — 22 | 
b 09 24 18 =55 —.76 2.0% 2.50% | 2.939% | 3.23% | 340k 


Ne ole. t 10 (p K. 10) only. MSCS = Miner Sentence Completion Scale. 


*p <A 
2 7 She: 
* SO. 
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Rote Morivarios Trou oy MANAGERIAL Errecrivennss 


ty} nscientiousness, or perhaps a mild 
eness. In a student-run organization 
he Management Development Corpo- 


r his role requirement would appear 
t varticularly important. There was a 
5 need to get someone to participate 
a the work. Those who exhibited 
a re to perform the necessary routine 


t were thus most likely to be selected 
b existing managers for new managerial 
p ns. Each group of managers that was 
I ‘ted to higher levels perceived a need 
f sp in getting its work accomplished and 
© tapped a group that exhibited motiva- 
to fill the role requirement to replace the 
level managerial positions that were 
v rated. 
ne of the major problems with the Man- 
ment Development Corporation simulated 
-anization as a teaching device noted by 
management faculty at Western M 
ıs that a number of students exhibited only 
nimal involvement, It appears from the 
resent research that the hierarchic leader- 
ip selection process operates to provide a 
least at the level 


group work. Yet 

nization was staffed and run by students who 
move from junior to senior standing and then 
graduate, it departed from most actual orga- 
nizations in the rapidity with which turnover 
occurred and openings developed in the mana- 
gerial ranks, most significantly at the upper 
levels, The result was that each year a high 
proportion of the existing project managers 
had to move up at least one managerial grade 
level, a much higher proportion than one 
would find being promoted that fast in most 
real organizations; seniority per se was a 
major factor in promotion. As a consequence, 
the degree of selectivity on the basis of ob- 
served performance tended to be much less 


agement were contrasted with those who were 
not. It was presumably, at least in part, be- 
cause of the artificially inflated tumover rate 
at the top that the managerial level differ- — 
ences were so small within the simulated 
organization, k: 

Yet, in spite of this inherent difficulty with 
the particular simulation used, the evidence 
in support of role-motivation theory as ap- 
plied to highly structured administrative ot 
bureaucratic organizations seems quite strong. 


Completion Scale emerged consistently be- 
tween those appointed to management po 
tions and those not appointed, On the o 
hand, the research provided no evidence in 
support of the applicability of the theory in 
a context of minimal organizational structure, 
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SUPERVISORY BEHAVIOR RELATED TO JOB SUCCESS 
OF HARD-CORE UNEMPLOYED OVER 
A TWO-YEAR PERIOD 


RICHARD W. BEATTY? f 


Graduate School of Business Administration, University of Colorado 


This study examined the impact of two levels of supervision on the job 
success (job performance and earnings) of 21 hard-core unemployed black 
women. Over a two-year period supervisory consideration, structure, and insight 
to intervene in hard-core unemployed interpersonal difficulties were measured. 
Hard-core unemployed black women’s initial perceptions of supervisors were 
related to early job success and were predictive of future job success. Super- 
visory consideration was positively associated with job success while super- 
visory structure was negatively related. It was concluded that attempts to be 
successful with the hard-core unemployed should not only encourage super- 
visory styles that are initially supportive of hard-core unemployed job efforts, 
but should refrain from imposing structure on a culture that may be un- 
accustomed to external, rigid demands on behavior. 


Y 


After Bloom and Barry (1967) reported 

that “hygiene” factors were more important 
_ to blacks than whites in entry-level jobs, one 
hygiene factor, the importance of supervision 
and the climate supervisors create, has been 
repeatedly mentioned as critical to the 
job success of the hard-core unemployed ? 
(Ehrenberg, 1967; Goodman, 1969; Hodgson 
& Brenner, 1968: Lawrie, 1969; Morgan, 
Blonsky, & Rosen, 1970; National Indus- 
trial Conference Board, 1969). In fact, ef- 
fective supervision appears to be so impor- 
tant that the Report of the National Ad- 
visory Commission on Civil Disorders (1968) 
recommended that supervisors be given spe- 
cial training to deal with the hard-core un- 
employed as one of the basic strategies to 
reduce urban unemployment rates. This rec- 
ommendation was not surprising because 
supervision has long been mentioned as a 


* Requests for reprints should be sent to Richard 
W. Beatty, Graduate School of Business Adminis- 
tration, University of Colorado, Boulder, Colorado 
80302. 

In order to be classified as hard-core unem- 
ployed, a person must be a member of a poor 
family and be unemployed, underemployed, or 
hindered from seeking work and at least one of the 
following: (a) school dropout, (b) minority group 
members, (c) under 22 years of age, (d) 45 years 
of age or older, (e) handicapped, or (f) subject to 
special obstacles to employment (U.S. Department 
of Labor, Manpower Administration, Request for 
Proposal MA-3, 1968). 
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critical variable in the performance of st 
employees, regardless of their employmen or 
cultural histories. 

Despite the attention given to the surer- 
vision of the hard-core unemployed, the la- 
tionship between supervisory styles and Erd- 
core unemployed job success has not een 
investigated over different lengths of tim: on 
the job, nor have established instruments 
been available to assess supervision. To date, 
the literature that exists in this area can be 
summarized by noting three variables that 
continue to be discussed: (a) the support the 
hard-core unemployed receives from a super- 
visor, (6) the amount of close or rigid super- 
vision given the hard-core unemployed, and 
(c) the supervisor's ability to recognize the 
need to intervene in hard-core unemployed 
interpersonal difficulties. The need for a gen- 
erally supportive supervisory climate was 
demonstrated by Friedlander and Greenberg 
(1971) and by the National Industrial Con- 


ference Board (1970) which found that the 


hard-core unemployed wanted supervisory 
Support in terms of friendliness, encourage- 
ment, and courtesies. This description seems 
to be no different than the consideration di- 


mension developed for all types of super- i 
visory personnel by the Ohio State Leader- 


ship studies (Stogdill & Coons, 1957). Close 
supervision has been mentioned as generally 
dissatisfying to the hard-core unemployed 


a 1973) and described as a means 
gaining social control over hard-core unem- 
trainees (Krosney, 1966), or as an 
attempt to control the hard-core unemployed 
due to differing perceptions of job behavior 
norms (Triandis & Malpass 1971), Thus it 
appeared that close supervision of the hard- 
core unemployed was similar to the defni- 
tion of initiating structure, the second super- 
visory dimension developed by the Ohio State 
Leadership studies. The final variable men- 
tioned, defined as social insight, has been cited 
as the ability to use cooperative behavioral 
skills to reduce conflict without resorting to 
hostility (Triandis & Malpass, 1971) or as the 
ability to intervene and mediate the work 
environment of the hard-core unemployed, 
such as in instances of hazing or discrimina- 
tion (National Industrial Conference Board, 
1970). 
In addition to reports indicating that the 
hard-core unemployed were influenced by 
their immediate supervisors on the above fac- 
tors, the National Industrial Conferencee 
Board (1970) reported that supervisory atti- 
tudes beyond the first-level (immediate super- 
visors) were important in successful hard- 
core unemployed programs, and Nealey and 
Fiedler (1968) found that second-level super- 
vision (managers) appeared to influence per- 
formance more than first-level supervision. 
Therefore, it appeared as if the level of 
supervision beyond the hard-core unem- 
ployed’s immediate supervisor might be im- 
portant for the hard-core unemployed job 
success. The purpose of this study, then, was 
to investigate over different lengths of time 
hard-core unemployed job success as related 
to two levels of supervision measured on di- 
mensions of consideration, structure, and so- 
insight. More specifically, four sets of 
val were measured: (a) the supportive 
relationship a supervisor provides, (5) the 
closeness of supervision or the degree to 
which a supervisor defines programmed and 
controlled hard-core unemployed job behav- 
ior, (c) sensitivity of the supervisor to inter- 
vene in hard-core unemployed interpersonal 
difficulties, and (d) criteria of hard-core job 
success. These measures were available for all 
= variables after six months and again after 
two years. 
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METHOD 
Program and Participants 


The organization studied was the home office 
a large insurance company in the Midwest. 
company’s president was the metropolitan 
chairman of the National Alliance of Busine 
JOBS program which at least gave the organit 
hard-core unemployed effort visible support from ü 
top. The company had over 800 clerical p 
tions for which 41 black women who met hard 
core unemployed criteria were hired through funding 
from the manpower administration of the US, 
Department of Labor. Thirty-one of these wom 
were with the company after six months and 2 
remained two years later. 


To measure the consideration and structure d 
sions both the Supervisory Behavior Description 
the Leadership Opinion Questionnaire were 
The Supervisory Behavior Description (Fleishi 
1953, 1957a, 1972) and Leadership Opinion Ques 
naire (Fleishman, 1957b, 1960) were administered to 
obtain measures of how the hard-core unemplo} 
perceived the behavior of her supervisor (Supe 
visory Behavior Description) and the supervisor's 
attitudes toward supervision (Leadership Opinion 
Questionnaire). The use of both instruments fol- 
lowed the suggestion of Greenwood and McNamara 
(1969) who advocated the collection of sel- perceived 
and others-perceived data on supervisory style. 
Fleishman and Harris (1962) defined consideration — 


as: 


behavior indicating mutual trust, respect, and a 
certain warmth and rapport between the super- 
visor and his group. This does not mean that 
this dimension reflects a superficial “pat on the 
back,” “first mame calling,” kind of human 
relations behavior. This dimension appears to 
emphasize a deeper concern for group members’ 
needs and includes such behavior as allowing 
subordinates more participation in decision 
making and encouraging more two-way com- 
munication [p. 43]. 


Procedure 


Supervisors’ attempts to structure hard-core un: 
employed job efforts were measured by the initiating — 
structure scale of the Supervisory Behavior De- 
scription and Leadership Opinion Questionnaire 

The turnover rates for these hard-core un 
ployed clerical workers were less than that for oth 
clerical workers employed by the company. 

4 All instruments were reworded for administra: 
tion to the hard-core unemployed. The changes 
were made by a similar group of hard-core un 
ployed at the local Concentration Employment 
Program office. Care was taken to change only lan- 
guage and not intent of a phrase. For example, in 
some instances “criticizes” was changed to “hassles” 
and in other instances it was changed to “told you.’ 
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instruments 
Structure as: 


Fleishman and Harris (1962) defined 


behavior in which the supervisor organizes and 
defines group activities and his relation to the 
group. Thus, he defines the role he expects each 
member to assume, assigns tasks, plans ahead, 
establishes ways of getting things done, and 
pushes for production. This dimension seems to 
emphasize overt attempts to achieve organiza- 
tional goals [pp. 43-44]. 


Supervisory sensitivity to intervene in hard-core 

_ unemployed interpersonal difficulties, social insight, 

Was measured by Chapin’s (1942) Social Insight 

Scale, which was administered to supervisors. Social 

insight (Chapin, 1942) has been defined as the 
ability to recognize 


the need to adjust group conflicts or tensions 
such as known remarks to relax a dangerous 
intensity, a suggested compromise to attain 
temporary agreement, a face-saving remark to 
_ avoid embarrassment and to preserve status, or 
to discover the missing part to complete a 
thought [p. 214]. 
: 


_ Gough's (1965) study of the Social Insight Scale 
C indicates that high scorers were seen as perceptive 
and imaginative, quick to respond to nuances, and 
quick to sense what others think and feel; con- 
versely, low scorers were seen as inhibited, cautious, 
bound by convention, and lacking in verve and 
independence. 

The job success variables were measures of hard- 
core unemployed job performance and weekly earn- 
_ ings. The job performance measure was the com- 
‘Pany’s standard performance-review form used for 
all clerical employees, After six months and again 
After two years of supervising the hard-core unem- 
Ployed, supervisors recorded job-performance data 
in terms of specific job behavior pertaining to abil- 
ity and productivity. The company’s performance- 
evaluation form conformed to the recommendations 
of the American Psychological Association task force 
for criterion variables (Baxter, Bray, Carp, Dun- 
nette, Ebel, Messick, & Wesman, 1969) by avoiding 
the use of subjective measures of minority perform- 
ance (eg, cooperation) in favor of the actual be- 
havior necessary for successful job performance (e.g., 
filing and typing for clerical jobs). In addition to 
reflecting performance with relative objectivity, the 
form also demonstrated validity when correlated 
with performance rankings by the company’s train- 
ing coordinator (Kendall’s 7 = .57, p< 01). 

The weekly earnings measures were available both 
at the six-month and two-year intervals, but at six 
months all hard-core unemployed had the same 
earnings and thus earnings were not a meaningful 

criterion. However, after two years there was suf- 
ficient variance in earnings for it to be used as a 

criterion variable. In fact, after two years on the 
job, the weekly earnings measure became the more 
important criterion because the hard-core unem- 
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ployed were placed in different levels of jo! 
demanded different skills for which they 
different wages. The performance-review 

then less appropriate because it was prin de- 
signed to compare employees in the sa 
Thus, a new employee or a low pe 
high-skill job could receive a low 

performance-review form while an expe m- 
ployee in a low-skill job might recei h- 
performance score, although not earn 

the employee in the high-skill job. The pe 


review score was therefore a relative score ed 
to other employes at the same job level the 
weekly earnings measure reflected the wor the 
job to the company as well as additional ngs 


adjustments for performance variance within 
job level. The weekly earnings measure tl eby 
became the more important criterion for emy 
with two years of experience. Job-perior ce 
evaluations were then considered the criteri 
early hard-core unemployed job success 1 
earnings the criterion for later job success 


RESULTS 


No significant relationships were found be- 
tween hard-core unemployed job success and 
supervisory consideration, structure, or social 
insight when supervisors recorded their own 
scores. The failure of significant relationships 
to be found might be explained by the fact 
that the instruments (the Leadership Opinion 
Questionnaire and the Social Insight Scale) 
measured only supervisors’ self-perceptions 
and did not measure how the supervisors 
were actually perceived by the hard-core un- 
employed as did the Supervisory Behavior 
Description. Therefore, supervisors might per- 
ceive themselves as high in consideration, low 
in structure, or high in social insight but not 
be seen the same by their hard-core unem- 
ployed employee. Obviously how the hard- 
core unemployed perceives the attitudes and 
behavior of a supervisor may have far greater 
influence on hard-core unemployed job suc- 
cess than does a supervisor’s self-perception. 
Also, in the case of the social insight measure 
its failure to be related to hard-core unem- 
ployed job success not only might be attrib- 
uted to the hard-core unemployed not feeling 
that this variable was important, but also 
to the age of the instrument and/or the ex- 
treme difficulty of measuring such a variable, 
especially where cultural differences may be 
involved. 

Hard-core unemployed perceptions of su- 
pervisory behavior did, in fact, demonstrate 


TABLE 1 
ATIONS BETWEEN FiksT- AND Sxconp-Lever Supervisory Bexaviok Descerrmox Scones 
Axo Harp Cour U NEMPLOYED Jos Svecnss 


Hard-core unemployed job success 


———— 
First-level supervision | Second: Jee servis 


Consideration | Structure | cei | sin — 


6 upervision with 6-month performance 
31) 
6 supervision with 2-year performance 
21)* 
2 ipervision with 2-year performance (V = 21) 
0 Supervision with 2-year earnings (V = 21)* 
2 t be = with 2-year earnings (V = 21) 


——ͤ— —— 
) Insure = the reduced sample size had not influenced the significant levels of the six-month measures, the A · mont li te 
1 tested with the reduced sample sizes. All relationships remained significant 


< 01. 


ral significant relationships (Table 1). 
er six months on the job hard-core unem- 
ed perceptions of a first-level supervisor's 
sideration and structure were concurrently 
ited to hard-core unemployed job per- 
rmance, Consideration demonstrated a posi- 

e relationship with performance, but struc- 

ire was negatively related. 

Two-year hard-core unemployed job-per- 
mance measures were neither predicted by 
the six-month Supervisory Behavior Descrip- 
tion consideration and structure scores nor 
concurrently related to two-year measures of 
supervisory consideration or structure. How- 
ever, six-month measures of supervisory con- 
sideration and structure were highly predic- 
tive of the other job success variable, weekly 
earnings after two years on the job, Again 
consideration was positively correlated and 
structure negatively correlated, but the rela- 
tionships held for both first- and second-level 
supervisors. In addition there tended to be a 
negative concurrent relationship between the 
hard-core unemployed two-year perceptions 
of supervisory structure and two-year weekly 
earnings. 


Discussion 


The hard-core unemployed’s experience 
with supervisors the first six months on the 
job appears to be important for initial and 
future hard-care unemployed job success. The 
nature of the relationship seems to be that 
when the hard-core unemployed perceive a 
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supervisor as considerate or supportive the, 
tend to perform more successfully, whereas 
when a supervisor is perceived as highly de- 
fining or structuring work activities, he 
hard-core unemployed tend to perform less 
successfully. The positive relationship with — 
consideration seems to be consistent with the 
findings of Friedlander and Greenberg (1971) 
and the negative relationship with structure 
consistent with Goodale (1973). oa 

It was further indicated that the concur- 
rent relationship between supervisory con- 
sideration and hard-core unemployed job 
sucess does not remain after the hard-core 
unemployed’s early experience with the com- 
pany. This relationship breakdown may be 
due to the hard-core unemployed’s need for 
supervisory consideration early in their ex- 
perience with the organization; however, 
later on other sources of positive reinforce- 
ment, such as the work itself or peer support, 
may explain more of hard-core unemployed — 
job success. Supervisory structure did, how- 
ever, continue to have a tendency to be con- 
currently and negatively related to hard-core — 
unemployed job success which may indicate 
that even after two years on the job the 
hard-core unemployed may still not respond 
favorably to supervisory attempts to imposi 
structure on their work efforts. 

The finding that six-month consideration 
and structure scores were both predictive of 
two-year hard-core unemployed earnings fur- 
ther reinforced the indication that the initial 
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relationship with a supervisor is crucial to 
hard-core unemployed job success. What may 
be more important was the finding that not 
only first-level supervisory behavior but also 
second-level supervisory behavior was pre- 
dictive of later hard-core unemployed job 
success. This finding provides support for the 
importance of supervisory climate for the 
hard-core unemployed at all levels of an orga- 
nization and confirms the results of Morgan, 
Blonsky, and Rosen (1970). 

Thus, this study seems to indicate that 
programs attempting to be successful in the 
employment of the hard-core unemployed 
should, in the initial stages, opt for super- 
visory styles that are supportive of the hard- 
core unemployed job efforts, and perhaps per- 
manently refrain from imposing structure on 
a culture that may be unaccustomed to exter- 
nal, rigid demands on their behavior. The 
study also demonstrates that when longitudi- 
nally investigating the effects of supervision 
on employee success, it may be useful to (a) 
study levels of supervision beyond the im- 
mediate, (4) use employee perceptions of 
supervisory behavior rather than supervisors’ 
self-description, and (c) examine criterion 
alternatives. 
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A RELATIONSHIP BETWEEN LEADER INTELLIGENCE 
AND LEADER RATED EFFECTIVENESS 


LOUIS S. CSOKA 
United States Military Academy 


5 


The relationship between a leader's intelligence and his performance has been 
consistently low, However, these disappointing results can be explained by 
the contingency model (Fiedler, 1967, 1971) which postulates that the leader 
effectiveness depends on his motivational style (least preferred co-worker) 
and the favorableness of the leadership situation. The approach in this paper 
conceptualizes intelligence as the leader's ability to Integrate his experience 
8 and thus provide him with skills to deal effectively with the technical aspects 
of the tasks and his interpersonal relationships. Four successive studies showed 
that the relationship between leader intelligence and performance depended on 
the leader’s motivational style (least preferred co-worker), his experience, and 
the leader- member relations. 


8 


8 eff been t pically the leader is either an insignificant factor or, A 
i ermined eithe onal attributes as this si attempts to L it is 
alone or by some combination of the char- poorly used in 2 
= acteristics of the situation and personal at- Recent studies by * (1971) and 
tributes. Campbell, Dunnette, Lawler, and Csoka and Fiedler (1972) have shown. that 
Weick (1970) provi convincing evidence the leader’s experience and training can be 
of the limited snp le aed y the interpreted in light of the contingency N. 
leader’s personal att intelli- This theory postulates that the effectiveness 
gence. The relationship between the leader’s of the group depends on (a) the personality, 
intelligence and his group’s performance has or more specifically, the motivational system 
been generally low and inconsistent. Stogdill of the leader measured by his least preferred 
(1969), in a review of personal factors asso- co-worker score, and (5) the situational 
ciated with leadership, found a positive but ſavorableness or the degree to which the situ- 
rather small relationship (median r of .28) ation gives the leader power and influence. 
between intelligence and leadership. Holling- Leadership training can be interpreted as 
worth (1926) found that extreme discrepan- improving the favorableness of the situation 
cies between the intelligence of leaders and by increasing the power and influence of the 
their followers militate against the successful leader. The contingency model predicts that 3 
exercise of leadership. Mann (1959) reported the additional power and influence is bene- 
the typical relationship between intelligence ficial to certain leaders under some conditions 
and leadership to be between .10 and .25. and detrimental under other conditions. Spe- 
This finding suggested that the intelligence of cifically, the model predicts that task-moti- — 
vated leaders perform best in very favorable 
“This study was conducted under Contract nd unfavorable situations while relationship- 4 
1$00014-67-AO103-0012, Office of Naval Research, motivated leaders perform best in intermedi- 
Department of the Navy, and Contract NOO14-67- ate situations. It can be said that training 
A0103-0013, Advanced Research Projects Agency, jeaders for jobs which fall into the favorabl 
Shee a Research (Fred E. Fiedler, principal situation improves the performance of task: 
2 The 105 expressed in this paper are those of motivated leaders while it decreases th 
the author and do not reflect thinking of the performance of relationship-motivated lead 
Department of the Army or of the United States ers. It would not be surprising if the leaders 
Military Academy. intelligence plays a somewhat similar role in 


„Requests for reprints should be sent to CPT Louis chaning the utavorabléness of theres ituation 
S! Csoka, Office of Military Psychology and Leader- 5, 15 11 e e 
Ship, United States Military Academy, West Point, °Y ying xe 
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be related to leader-rated effectiveness only 
insofar as the leader’s motivational style 
(least preferred co-worker), his experience, 


into account. The complex interactions should 
reveal the following relationship between 
leader intelligence and rated performance: 


Group Least preferred 
atmosphere Experience co-worker Rated effectiveness 
High High High High intelligence—Lower 
1115 High Low High intelligence—Higher 
ee Low High High intelligence—Lower 
ig Low Low High intelligence—Higher 
507 me High High intelligence—Higher 
590 ig! Low High intelligence—Lower 
ow Low High High intelligence—Higher 
Low Low Low 


High intelligence—Lower 


1 


cognitive information that must be properly tively structured task (obj defined) 
encoded, structured, and categorized in order should seem initially unstruc r an in- 
to make its use possible. This encoding and experienced, low-intelligenc iual. It 
structuring requires intelligence. In effect, an appears that not only the obj 1aracter- 
interaction of intelligence and experience pro- istics of the task should be red but 
vides a form of on-the-job training. An in- also the subjective characterist he indi- 
telligent leader can make use of his experi- vidual, that is, experience an lligence. 
ence more effectively than a leader with low The base of knowledge with w e leader 
intelligence, For example, the remark has enters into the situation must t > degree 
often been made—sometimes facetiously— alter the nature of the task ierefore, 
that some people do not learn from their the favorableness of the situatio erefore, 
experience. An analysis relating leadership in terms of the contingency | intelli- 
experience to changes in performance for rela- gence is an asset for leaders i e situa- 
tively intelligent and less intelligent leaders tions but a liability in others ries o 
should permit us to infer whether the less studies have been conducted these 
intelligent leader is unable to integrate, and hypotheses. 
therefore, use his leadership experience. In Hypothesis 1. In this study ence is 
contrast, the intelligent leader might be able viewed as that which enables der to 
to make good use of his leadership experi- profit from his experience. Thus lligent 
ence and thereby increase his power and in- leaders with long experience Sho! ee the 
fluence. These analyses could explain the in- task as structured while relatively intelli- 
consistent results in past research relating gent leaders with experience shou > the 
intelligence and leadership effectiveness, task as less structured. Inexperien aders 
Intelligence should be viewed as an inter- should, of course, see the task as tively 
vening variable that modifies the leader’s ex- unstructured regardless of their ini- ence. 
“perience and hence the favorableness of the Therefore, in terms of the contingen: model 
Situation. The change in the situational fa- the following relations between les pre- 
vorability occurs through the modification of ferred co-worker and performance <n be 
the task structure dimension. Even a rela- predicted: 
Group Task structure Position Predicted 

Octant atmosphere Experience Intelligence Power relationship 

1 High High High High — 

III High Low High/Low High = 

V Low High High High ae 

VII Low Low High/Low Weak = 

Hypothesis 2. Leader intelligence should and the leader-member relations are taken 
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Leader member relations 


Good Í Fone 
Experience 
Leader — — — 
High Low i High | Low 
4 Intelligence 
Octant I | n [Ocan HE) w | Octane V | n | Octant VEIL 
X Vicld artillery section chiefs =H | 6| —s2 ji) s u] S8 
> Navy maintenance supervisors =52 10 —.SS* 18 b 60 | —23 
gı Company commanders = {u| —.70* |19 s „ —.S3° 
7 ompany mess sergeants — $9 | 9 | —.67** | 22 a7 10 f = 20 
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METHOD 
Suhjects 


‘Tho studies were conducted with subjects in 

various military situations. Speciñcally, they were: 

(a) 55 sergeants in charge of field artillery sections, 

(b) 55 navy petty officers in charge of navy avia- 

tion maintenance shops, (c) 58 unit commanders 

usualiy with the rank of captain, and (d) 52 ser- 

geants in charge of army mess units. A series of 

questionnaires was administered to obtain least 

preferred co-worker scores, the leader-member rela- 

tions (Group Atmosphere scale), leadership experi- 

ence, and relative intelligence. It might be noted 

that all leaders had relatively high position power. 

The methods and procedures in each of the studies 

were almost identical. The questionnaires were 

administered to the subjects in one session for each 

study. Even though the methodology was similar, 

the leadership situation for each group of subjects 

was diverse. While the leaders in three groups were 

ine supervisors, the company commanders 

econd-line or third-line supervisors. 

Preferred Co-worker (LPC) scale. Subjects 

were asked to think of all men with whom they had 

ever worked and then to describe the one person 

with whom they could work least well. These de- 

scriptions were made on a standard 22-item, bipolar 

© adjective scale. A high score indicated a basic 

motivation to relate to others while a low score 
indicated a task-motivation (Fiedler, 1967). 

Group Atmosphere scale (GA). This score was 

obtained by asking individuals to describe on a 10- 

item, bipolar scale, the atmosphere of their work 

= group. The score reflected the degree to which the 


Octant 1 is high intelligence; Octant III is high/low intelligence; Octant V is high intelligence; Octant viiki 
ce. 
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leader felt that the group was loyal and supporti 
of him (McNamara, 1968). The cutting score of 
high versus low was based on Posthuma’s (1970) 
finding for real-life groups which showed a ial 
group atmosphere score of .655. 

Intelligence. A shortened version of the H 
Nelson Mental Ability test was administered. The 
42-item measure contained both verbal and quanti 
tative scales and had a split-half reliability of .9 
and .76 for the verbal and quantitative items, re- 
spectively. This score bas been shown to be posi- 
tively correlated with the AGC test scores used by 
the services to measure intelligence. > 

Situational favorableness. Situational favorablet 
was typically measured on the basis of three 
mensions: (a) leader-member relations, (b) t 
structure, and (c) position power. Groups 
classified into the upper or lower half on each 
mension; each of these dimensions could have b 
scaled. The three-dimensional system resulted in 
ordered set of eight octants scaling the situatii 
favorableness from very favorable to unfavo 
The present studies predict that experience and 
telligence interact in affecting the leader's under. 
standing of and ability to handle the task (Csoka 
& Fielder, 1972). The intelligence and exp 2 
interaction modified the task-structure dimension. 

Performance measures. Two or three superio 
rated each leader’s performance with high inte! 
rater agreement among subjects. The rating s 
consisted of eleven items which were developed 
collaboration with the organizations to assess t 
relative effectiveness of the leaders on critic 
aspects of the job. 
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Ficure 1. Mean performance scores for high- and 
low. intelligence leaders in Field artillery crews. 
(Abbreviations: LPC = least preferred co-worker, 
EXP = experience, GA = group atmosphere score, 
HI= high, LO = low, INT = intelligence). 


RESULTS 


It was hypothesized (Hypothesis 1) that 
the relationship between least preferred co- 
worker and rated effectiveness should be de- 
pendent on the situational favorableness as 
defined by the leader-member relations, posi- 
tion-power experience, and intelligence. Table 
1 shows the results of the four studies. All 
orrelations between least preferred co-worker 
d performance were in the predicted direc- 
with 10 of the 16 correlations reaching 
nificance. In terms of Fiedler’s contingency 
el, the results showed that the degree to 
hich experience improved the favorableness 
the situation was contingent on the intelli- 
ence, and motivational TR (least preferred 


Lo HI Lo HI Lo Hi LO 
LO HE LO 
Hi LO 


Ficure 2, Mean performance scores for high- and 
low-intelligence leaders in Navy maintenance crews. 
(Abbreviations: LPC = least preferred co-worker, 
experience, GA= group atmosphere score, 
igh, LO = low, INT = intelligence). 
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FIGURE 3. Mean per 
low-intelligence leaders for Arm) hall stewards. 
(Abbreviations: LP( preferred co-worker, 
EXP = experience, voup atmosphere score, 
HI= high, LO= low intelligence). > 


had low intelliger key did not seem to” 
gain from their e ce. In fact, the less 
intelligent leaders conditions which 
could be predicte< n the contingency’ 
model, received performance ratings 
than did more inte leaders. 
Hypothesis 2 pre: the nature of 
relationship betwee. r intelligence and 
rated effectiveness in the operati 
framework of the c: ncy model. As F 
ures 1 through 4 show, the results do support 
the hypothesis. The relative rated perform- 
ance of high- and low-intelligence leaders 
changes depending on their least preferred” 
co-worker, experience, and leader-member re- 
lations. The data showed that high intelli- 
gence, in and of itself, did not in fact lead to 
higher-rated perfor e scores but rather 
depended upon the complex interaction of 
several variables. 4 


DISCUSSION 


It has been known for some time that 
leader intelligence does not uniformly in- 
crease leadership effectiveness. The present 
studies enabled one set of conditions under 
which intelligence is related to performance to 
be specified. Intelligence needs to be viewed 
as a variable that modifies one’s ability t 
deal with a specific task. This approach di 
not significantly alter the traditional objec 
tive measure of task structure as described i 
the contingency model. However, it did mod 
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VISURE 4. Mean performance scores for high- and 
-intelligence leaders for Army company com- 
iders. (Abbreviations: LPC = least preferred co- 

her, EXP = experience, GA = group atmosphere 

ore, HI = high, LO = low, INT = intelligence). 


fy the measure to the extent that each indi- 
vidual saw the task as more or less structured 
depending on his knowledge, which he gained 
from training and experience. Intelligence 
was conceptualized as the ability to integrate 
one’s experience. It was found that leaders 
with high intelligence were not necessarily 
che higher performers and that when they had 
very little experience, their intelligence was 
not beneficial. In effect, the less intelligent 
leaders could not use their experience and 
the inexperienced leaders could not use their 
intelligence in better directing the group. 

In accordance with the findings, the inter- 
actional effects of the leader’s motivational 
style (least preferred co-worker), experience 
and leader- member relations showed that high 
leader intelligence was beneficial to perform- 
ance only for low least preferred co-worker 
leaders in favorable situations (high group 
atmosphere, high experience, strong position 
power) and for high least preferred co-worker 
leaders in situations of intermediate favor- 
ableness (Low group atmosphere, high experi- 
ence, strong position power). The findings 
also showed that less intelligent leaders ac- 
tually performed better in some conditions 
than do more intelligent leaders (low group 
atmosphere, low experience, weak position 
power). 

The results seem to indicate that leader 
intelligence should be viewed as a moderating 
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variable which affects the usefulness of one's 

experience. This, in turn, should substantially 
influence the relative expertise that a leader 
has in any given situation. Such expertise is 
essential in the determination of the power 
and influence of a leader over his members. 
This power and influence was differentially ` 
related to leader effectiveness according to 
the contingency model. Within the concep- 
tual framework of the model, the four studies 
shed light on the complexity of any relation- 
ship between leader intelligence and per- 
formance. The generalizations that can be 
made from these findings may be somewhat 
limited due to the similarity of the military ` 
subjects in each set of data. At present, the 
results are meaningful only within the tight 

operational and theoretical framework of. 
Fiedler’s contingency model. Future research 
should focus on alternate approaches to this 
relationship in hopes of expanding the mean- 
ing of intelligence in leadership research. F 
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CONFLICT AND ATTITUDES TOWARD THE OPPON 


AN APPLICATION OF THE COLLINS AND HOYT 
ATTITUDE CHANGE THEORY TO GROUPS 
IN INTERORGANIZATIONAL CONFLICT ! 
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Purdue University 


Thirty-four four-man teams participated in an experimental simulation of ar 


internation conflict. Each team was subdivided into two decision mak: 
(central roles) and two information handlers (peripheral roles). In keepir 
with the attitude change theory of Collins and Hoyt (1971) 


that this task allocation, 
tion between subteams, 


„it was predicte< 


with a possible additional factor of physical separa 
would be a determinant of the players’ emergen 


attitudes toward an opponent team. The results were consistent with th 
prediction. Teams which engaged in actions with aversive consequences for < 
opponent held more negative attitudes toward that opponent than tean 
which (although equal in all other respects) were prevented from direcil: 


engaging in these actions, 


Man, as a social animal, spends much of 
his time in groups. These groups interact with 
other groups in a variety of relationships. 

Some of the relationships reflect cooperation, 
others competition, and still others neither or 
both. The research teported in this study 
deals with small groups in a competitive rela- 
tionship with one another, and with the 
differential effects of this competition on 
attitude formation in members of various 
subgroups. 

Research on intergroup conflict has been 
primarily concerned with the identification of 
the causes of conflict and with modes of its 
resolution (e.g., Deutsch & Krauss, 1960, 
1962; Sherif, Harvey, White, Hood, & Sherif, 
1961). Relatively little interest has been 
expressed in the consequences that intergroup 
conflict has for the group members them- 
selves. These consequences may take a vari- 
ety of forms. The present research focuses 
specifically on the attitudinal consequences of 


+ This research was supported by a contract be- 
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participation in conflict. Streufe:: Sand- 
ler (1971) have shown that the ‘udes of 
decision-making groups toward oppo- 
nents tend to be rather negative flecting 
the mirror image perceptions pro; sed by 
Bronfenbrenner (1961) and White 66). Tt 
is not known, however, how this negativity} 
toward one’s opponents is initially generated. 


Some relevant information may b 
by reference to studies on the forn 
change of attitudes as a function of 
or commitment to behavior, For i 
number of studies in the area of 

theory have demonstrated that, foll 
performance of a given behavior, an indi- 
vidual may alter his attitudes to achieve con- 
sistency with the behavior he has performed 
(e.g., Brehm & Cohen, 1962). Kiesler (1971), 
in a somewhat similar vein, has demonstrated 
the importance of commitment to behavior as 
a determinant of the amount and direction of 
change in task relevant attitudes. Weick 
(1966) offered evidence that acceptance of 
an assigned task often involves a kind of 
attitude change toward the task, which serves 
to make cognitions consistent with antici- 
pated behavioral requirements. These studies 
Suggest that attitudes may change consequent < 
to performance or commitment to perform- 

ance of a given task. If we translate this 

latter proposition into the area of intergroup 
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ct, it suggests that members of groups 
; can engage (and have engaged) in com- 
tive actions directed toward an opposing 
p should hold more negative, that is, task 

evant, attitudes toward their opponents 
an members of groups who are unable to 
wage (and have not directly engaged) in 
impetitive actions. 
Collins and Hoyt (1971) have recently 
fered a theory of attitude change that 
pears relevant to this proposition. These 
uthors suggest that an act must be performed 
ften obtained via a forced compliance ma- 
ipulation where the subject believes he can 
fuse the action, but usually does not) in 
e presence of perceived choice and aversive 
onsequences; without these conditions atti- 
tudes will not be modified. The decision- 
making groups of Streufert and Sandler 
(1971), for example, engaged in aggressive 
decision making toward their opponents. The 
attitudinal negativity of these groups’ views 
of their opponents may consequently be a 
reflection of the “attitude change” (here in a 
negative direction) that, according to the 
Collins and Hoyt theory might have been ex- 
pected. Streufert and Sandler’s subjects made 
aggressive decisions; they did have a choice; 
and they expected aversive consequences for 
their opponents. Thus, these actions meet the 
requirements of the theory. 

In many settings, for example, in organiza- 
tions, group tasks are subdivided among sev- 
eral hierarchical or lateral subgroups. Only 
some of these subgroups might be involved 
in actual competitive or aggressive activities. 
If the theory of Collins and Hoyt, and the 
derivations of other data (previously men- 
tioned), hold, then one may expect differen- 
tial degrees of negative attitudes toward an 
opponent among divergent (cooperating) sub- 
groups, depending on the activity they per- 
form, and the perceived choice and conse- 
quences. These attitudes may result in the 
communication and coordination problems 
found in organizations (cf. Walton, Dutton, 
& Cafferty, 1969). While decision-making 
groups that actually engage in competitive 
acts would be highly negative toward op- 
ponents, one might expect that other groups, 
for example, those involved in the gathering 
of information, to show much less negativity 
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because they feel less personal responsibil 
for the consequences of the group decision, 
The present research design specifically tests 
this hypothesis. 

The comparison between decision-making 
groups and lateral subgroups that are 
involved in the competitive or aggressive de- 
cision-making process, however, results in a 
potential confound. Even though environment 
and information flow to both kinds of groups 
may be held constant, instructions to the two 
kinds of subgroups are necessarily different. 
The very fact that decision-making groups 
can make decisions, and other lateral sub- 
groups are not able to make decisions, might 
produce differences in behavior that are due 
to instructional variables and not decision- 
making activity per se. To test for the Col- 
lins and Hoyt theory, control for instructio: 
is required. One way to achieve this control 
is to introduce group size as an additional 
variable. If all decision-making groups are 
specifically instructed to make decisions, and 
if all information handling groups are spe- 
cifically instructed to handle information (ie,, 
not to make decisions), but if half the infor- 
mation handling groups are de facto permitted 
to engage in decision making, then the design 
would be adequate. This result can be 
achieved by placing half the information i 
handling subgroups together with the decision- 
making subgroups (so that the information 
handlers can in effect make decisions, have 
access to decision-making materials, etc.), and 
by separating the other half of the informa- 
tion-handling subgroups from their decision- 
making lateral subgroups. Such a manipula- 
tion results in a 2 X 2 design (task allocation ` 
by group size). Divergent predictions woulda 
be made for group size effects, instruction 
effects, and the effects predicted via the Col- 4 
lins and Hoyt theory. If differences obtained 
in the data collected with such a design were d 
due to instructions (different for decision 
makers (DM) than for information handlers 
(IH) no matter whether separated (2 + 2) 
or working together (4)), then one woul 
expect that a measure of attitudinal negativ- 
ity toward the opponents would show that 4 


DM2 2 = DMA e IH2 A 2 IHA. 
S 


If any obtained differences were due to group 


n 
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size, one would expect that 
DM4 = IH4 DMZ + 2 = IH2 + 2. 


If the theory of Collins and Hoyt holds, then 
information handlers who are placed with the 
decision makers into a four-man group and 
consequently are able to participate in the 
‘decision-making activity should show more 
negative attitudes toward their opponents 
than information handlers who cannot share 
in the decision-making activity (i.e. those 
Who are placed in a pure two-man informa- 
 tion-handling group). Further, the negativity 
_ of information handlers who can share in the 
decision-making activity (in spite of their 
instructions) should be as great (or nearly as 
great) as the negativity found in decision- 

g groups. Finally, if there were no 


be equally negative toward their opponents, 
no matter whether they work in two-man 
(separated from the information 
andlers) or in four-man groups (together 
th the information handlers). In other 
words, derivation from the Collins and Hoyt 
theory and the finding of Streufert and 
Sandler would predict that negativity would 
differ as follows: 


ngs would be supported if groups in the 
+ 2 condition show less negativity than 
g s placed in the other three conditions. 
_ As stated previously, the Collins and Hoyt 
(1971) theory suggests that performance of 
an act that has aversive consequences for 
another would be reflected in attitude change. 
is not entirely clear whether this attitude 
change would be a gradual process, producing 
increasing negativity with repeated perform- 
ance of acts with aversive consequences, or 
whether the negative attitudes would be 
formed as the outcome of one or very few 
initial actions. (B. Collins, personal communi- 
cation, 1972, favors the latter interpretation.) 
Further, one might expect that the degree to 
which information handlers engage in decision- 
making activity should covary with the degree to 


which scores for IHA and DM4 are similar. This 
position is not tested in this study. 
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To check on the alternatives, rep: 

sures of attitudes after time interv 

length (during which subjects d 

engage in actions which have ave 0 
quences for the opponent) are i d 
this research. 

To test the applicability of the 
Hoyt (1971) theory, an experimen 
tion setting was chosen. The applic 
this research me o organizatio 
lems has been dis in detail in 
book chapter of Fromkin and Strev ert 


press). 
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One hundred and thirty-six undergra 
teers from a large midwestern university 
in the Tactical and Negotiations Game 
decision-making environment developed 
fert and others (Streufert, Castore, & Ki 
Streufert, Clardy, Driver, Karlins, Schrod 
feld, 1965; Streufert, Kliger, Castore, 
1967). In the Tactical and Negotiations © 
jects were given responsibility to make 
about military, economic, intelligence, a: 
matic affairs in an experimental simulai< 
scale internation conflict with some 
characteristics. Thirty-four four-man tea 
formed. The teams spent two hours reading 
ual about the game. The purpose of the 
was twofold: (a) to expose subjects to t! 
plexities of the conflict with which they l+ 
to deal, and (b) to provide two hours of con 
preexperimental exposure for all subjects 
attempt to somewhat equalize preexperimen!s 
periences. 

After the reading period, subjects in each team 
were told that they would be assigned in t a 
subgroups to one of two different tasks. One sub 
group was to function as a decision-making group; 
the other subgroup was to function as an informa 
tion-handling group. For 17 of the four-man teams 
this was the only task manipulation employed. These 
teams remained operative as four-man teams, eve 
though their assignments to subgroups varied. Typi- 
cally these teams paid relatively little attention to 
their assignments, and team members jointly filled 
information-handling and decision-making function 
For the other 17 teams an additional task manipu- 
lation was employed. The subgroups were sepa 
rated from each other, Each of the two-man su 
groups were placed in identical rooms. The rooms 1 
which information-handling subgroups were place 
however, did not contain the forms necessary fi 
decision making. Communication between subgrou 
was possible via telephone. They were permitted 
make as many calls as they wished. Calls wel 
allowed in both directions; information acquisitio 
groups were able to call their decision-making 
group and vice versa. 


ects believed that they were playing the 

a) and Negotiations Game agaimt another 

an team, In reality, information received by 

cam was preprogrammed. All information re- 

by a team was submitted to the team's is- 

ion-handling subgroup via a mail slot. This 

up was then responsible for forwarding the 

mation to the team’s decision-making sub- 

p. This procedure was accomplihed via the 

hone in the subgroups-apart condition. Teams 

written decisions that were transmitted to the 

cimenters via a mall slot. Subjects believed that 

outcome of the events in the Tactical and Nego- 

ons Game was greatly influenced by their deci- 

s (Subjects attributed more than $0% of the 

sality for the programmed events to decisions of 

ir own and the opposing teams rather than to 

iom events, characteristics of the environment, 
arbitrary decisions of the experimenters.) 

The experimental simulation was conducted in six 

minute playing periods, with short intermissons 

fier each period, During the playing periods, the 

ams received written messages from the experi- 

renters, These messages contained programmed in- 

ormation dealing with the outcomes of the teams" 

decisions and with various supposed moves by their 

fictitious opponents. All teams received the same 

number of messages over the course of the game, 

but the order in which the programmed mesages 
were submitted was randomized across teams, 


Measurement of Dependent Variables 


During the intermission following each playing 
period, the subjects individually filled out a report 
form consisting of several and scales. 
Among these was a series of ten semantic differen- 
tial scales on which the subjects were asked to rate 
their opponents. Three of these were evaluative 
scales that have been found to be highly intercorre- 
lated in previous research (Streufert, 1965, 1966). 


When averaged, these scales provide an of 
favorable evaluation or liking. The scales used were 
good-bad, bright-dark, and nice-awful. 


REsuLts 


The data were analyzed with a three-way 
mixed design analysis of variance technique 
containing the following factors: (a) group 
size (two levels, between), (ö) task assign- 
ment (two levels, between), and (c) periods 
(six levels, within). No significance was ob- 
tained for the periods main effect (F = 1.05) 
or any of its interactions (all Fs < 1.0). To 
check on the applicability of the Collins and 
Hoyt theory on group conflict, the prediction 
that IH2 + 2 < IH4 < DM4 = DM2 + 2 
was tested. Analysis of variance main effects, 
interactions, and error term for the Group 
Size x Task design summed over periods were 


obtain! to permit planned o 
among the four levels contained in 
Check for Confounding Efects 
A main effect for task representing 

effects for both 2 + 2 and four-man group 
would have indicated that an instruction 
confound was present in the data. TI 
main effect did not reach significance (F 
2.63, df 1/132, ms). A main effect 
group size that reflected changes in attit 
for both IH and DM groups would have 
gested a group's size confound that 
that larger groups form different attitude 
toward opponents than do smaller group 
This main effect was significant (F = 4.0) 
dj 1/132, p < OS). However, this dif i 
was reflected only in the IH groups compari- 
son (planned comparison F = 4.37, df 1/ 
p < .05). Potential confounds could cor 
quently be excluded from consideration, 


Test for Experimental Predictions 


Post hoc comparisons based on the analy- 
sis of variance error term for the Group Size 
X Task interaction indicated that c 
IH2 + 2 groups differed from the combined 
evaluative ratings obtained in the three other 
conditions (F = 6.99, dj 1/132, p< 01). 
Single planned comparisons among the se- 
quential elements of the predictions indicated 
that TH2 + 2 groups produced less attitudinal _ 
negativity toward the opponent team han 
IH4 groups (F = 4.37, df 1/132, p <.05), 
and that there were no significant differences 
among the three other groups. The poten 
difference allowed in the theoretical predic- 
tion for comparison of IIIA with DMA or 
DM2 +2 (F =.23, F of, both ms) was 
not obtained. The data are presented in 
graphic form in Figure 1. 
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Discussion 


The results obtained in this research 
cate that negativity toward an opponent in 
competitive decision-making task is di 
as previously demonstrated by Streufert a 
Sandler (1971) and as predicted by Brofe 
brenner (1961) and White (1966). The neg 
tivity in attitudes toward an opponent 
modified, however, by the activities in ich 
the group engages. The predictions of a 
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@---@ Decision Mokers 


more favorable 
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° 
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Meon Evoluotive Rotings of Opponents 
N 
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more untovoroble 
x 
> 
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Subgroups Subgroups 
4 Together Aport 
(4) (2+2) 


Ficure 1. Evaluative ratings of opponents by 
decision makers and information handlers in the 
Together vs. Separate Subgroups conditions, (Lower 
! scores indicate greater negativity.) 
and Hoyt were supported: greater attitude 
change (here negative attitudes toward op- 
_ ponents) is produced when one makes deci- 

sions that have direct aversive consequences 
for the opponents.“ Decision makers in this 
research can be placed into all conceptual 
categories that Collins and Hoyt propose are 
necessary for attitude change: (a) the deci- 


In the Tactical and Negotiations game attitudi- 
nal negativity is produced after the participant reads 
a manual that contains bias against the opponent, 
and after he participates in a series of economic 
and military battles with the opponent. The degree 
of resulting negativity (attitude change) toward 
the opponent can vary, however. 

One might argue that the lesser negativity of 
TH 2+2 groups could be a reflection of an aliena- 
tion phenomenon. Such alienation, however, would 
express itself as well in other ratings obtained from 
such groups. No such differences (eg. attitudes 

toward their own group or in ratings of the degree 
of success and failure either team had experienced at 
any point in time were obtained, Suggesting that the 
finding is due to differential attitudes, rather than 
alienation. However, similar results (p < .05) were 
obtained when subjects were asked to rate the 
competence of the opposing team on a 7-point rating 
scale. Again, IH 2+2 teams viewed the opposing 
team as more competent than any of the other three 
groups, Since competence ratings tend to be posi- 
tively correlated with evaluative attitudes (Cafferty 
& Streufert, 1971); this finding supports the inter- 
pretation of data in line with the predictions of 
Collins and Hoyt. 
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sion makers had a choice whether or to 


aggress against their opponents: | all 
groups did engage in aggressive ac‘ d 
(c) they viewed their acts to have p ul 
negative consequences for their oppor (a 
necessity in a conflict situation whe ly 
one can be the winner, i.e., a situatio: ch 
subjects view as a zero-sum game). n- 
tial confounds of group size and task int 
instructions were excluded. Further, th- lif- 
ferences were obtained after only fev i- 
sions with aversive consequences (ay ng 
1.43 decisions of this kind) had been e. 
Attitudes later remained stable and we: it 


further affected by later decision makin: 

It is noteworthy that no difference 
obtained for any comparisons between } 
groups and both kinds of decision-mak 
groups. It appears that the ability to ens 
in decision-making activity with the indica‘: 
choice and consequence effects was a sui 
ciently strong manipulation. Effects of ii 
structions by the experimenters did not d 
minish the effect of the placement of the II 
groups into a situation where they were abl 
to contribute to the decision-making activit 
of their groups. (A check on the requires 
signatures on decision forms indicates that IE 
persons in these groups actually made 41 % 
of the decisions. In contrast, checks on tele- 
phone conversations showed that decision 
suggestions by IH 2 + 2 groups occurred very 
rarely.) 

The results of this research are encourag- 
ing. The theoretical predictions from attitude 
theories have generally had rather limited 
implications for other areas of social and 
organizational psychology. More seriously, 
opposing theoretical predictions of various 
writers have been supported by parallel re- 
search designs using often similar variables at 
diverse levels (of., Collins & Hoyt, 1971). 
Contradictory findings of this kind often sug- 
gest that the phenomenon under study is 
multidimensional rather than unidimensional 
in nature (cf., Streufert & Frombin, 1972), 
and consequently a number of predicts: 
conditions may be necessary to produce a 
effect reliably. Such multiple predictors are 
contained in the Collins and Hoyt theory. The 
successful application of the theory to a com- 
plex (itself multidimensional) group conflict 


sett uggests that the view of Collins and 
Ho: iy prove robust and might be poten- 


tial ‘ful as an initial integrative concept 
fo use of attitudes in a number of re- 
80 ieas in social and organizational 
I zy. 
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EFFECT OF WORKER ORIENTATION DIFFERENCES 
ON JOB ATTITUDE CORRELATES 


DONALD A. WOOD : 


Indiana University 


Correlations between various worker attitudes and job motivation, perform- 
ance, and absenteeism were examined for 290 skilled and semiskilled paper- 
makers. The hypothesis that significant relationships occur more frequently 
for those employees least job involved was supported, inasmuch as they 
accounted for 84% of the significant correlations. Further analyses disclosed 
twice as many associations for skilled as for semiskilled employees, Thus, 
highly involved employees, more intrinsically oriented toward their job, did 
not manifest satisfactions commensurate with company evaluations of per- 
formance; they depended more on intrinsic rewards. Those employees more 
detached from the job itself were more extrinsic in orientation and experi- 
enced gratifications more in line with company performance assessments due 


to their greater dependence on extrinsic rewards, 


It has been surprisingly difficult to demon- 
strate appreciable relationships between work 
performances and the satisfactions supposedly 
inherent in them (Ronan, 1970). Despite 

frequent disappointments, research in this 
area continues. These efforts seem to be based 
on the unacceptable premise that although 
performance and attitude are often unlike, 
they should not be. Even when similarities 
are found, they are often barely significant 

(Vroom, 1964). 

This study examines the correlational issue 
further by directing attention to the job in- 
cumbent and his work orientation. The theo- 
retical emphasis on intrinsic—extrinsic factors 
of job satisfaction (Herzberg, Mausner, & 
Snyderman, 1959) and the proposition that 
intrinsic and extrinsic rewards serve as satis- 
faction—performance moderators (Lawler & 

Porter, 1967) have prompted much needed 
research on some vital issues. However, the 
conceptual confusion that results from the 
finding that job satsfaction can originate from 
either job-related (intrinsic) or situational 

(extrinsic) features (Burke, 1966) and the 

inability to operationalize intrinsic rewards 


+The author would like to express his apprecia- 
tion to Dennis W. Organ and H. J. Reitz for their 
insightful comments on an earlier version of this 
article. 

Requests for reprints should be sent to Donald 
A. Wood, Graduate School of Business, Room 650, 
Indiana University, Bloomington, Indiana 47401, 
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creates certain doubts as to whether such 
tinctions can and should be pursued furtt-r. 

It is important to focus on the assessn 
of the intrinsic-extrinsic orientation of 
employee himself. Therefore, this study 
amined the neglected question of the imp 
of worker orientation differences on satisfa 
tion-performance relationships. Such associa- 
tions may well be influenced by the extent i 
which the worker is psychologically com 
mitted to work in general, that is, either 
intrinsically oriented and highly identified 
with work (high job involvement) or extrinsi- 
cally oriented and not psychologically depen- 
dent on the work itself (low job involve- 
ment). The intrinsic and extrinsic nature of 
high- versus low-involved workers, respec- 
tively, has been described elsewhere (Reitz, 
Groff, Ruiz, & Terry, 1973). 

Saleh and Grygier (1969) described job 
orientation differences in detail. It was found 
that those intrinsically oriented toward their 
work were more independent, confident, flexi- 
ble, less dependent on guidance or reassur- 
ance, and likely to reject conventional stan- 
dards more than their extrinsically oriented 
counterparts. It would follow that intrinsi- 
cally oriented or highly involved workers 
would be more dependent on their own stan- 
dards to determine appropriate performance 
levels and less concerned with organizational 
prescriptions of what performance should be. 
Vroom (1964) would support this contention 


by 1 that “some rank and file workers 
may latively indifferent to information 
from rvisors or others regarding their 
job | rmance [pp. 146-147].” In addition, 
Vrot 1962) has concluded that some em- 
ploy nay derive job satisfaction from 
the mance of tasks independent of the 
exte mediated consequences of that 
perí nce, 

1 differences in job orientation or in- 
volv: -nt would be expected to promote dif- 
fere in sensitivity to performance in- 
for: mn, Intrinsically oriented or highly 
inv | incumbents should be most reliant 
on own performance standards to de- 
ter! their work gratifications while those 
les omitted to work per se, in the absence 
of ..y personalized standards, should be 
mo~ dependent on external or “alternative” 
crit’ ia as defined by the organization. 

hypothesized in this study that per- 
form-nce as evaluated by the organization 


ot correlate with individual satisfac- 
tion levels for highly involved workers. Since 
the self-concept is not as work related for 
subjects more job detached, job satisfaction 
shouid be more related to the firm’s appraisal 
standards, Thus, it is hypothesized that least 
involved employees demonstrate significant 
satisfaction—-performance relationships. 


doe 


METHOD 
Sample 


The initial sample consisted of 352 semiskilled and 
skilled male papermakers in a large unionized paper 
mill plant located in the southeastern United States. 
The average employee had been employed at the 
plant for 13 years and was 39 years old. To insure 
the greatest validity and stability in involvement, 
satisfaction, and performance assessments, employees 
who had not been on their present job at least 
three months and employed by the plant for at least 
six months were eliminated from the study. This 
elimination resulted in a loss of 62 employees and 
yielded 110 semiskilled and 180 skilled workers or 
a total of 290 in the final sample. Because each 
skill level was amply represented and differed so 
in hourly wages received, plant seniority (progres- 
sion lines to skilled jobs were prescribed contractu- 
ally on the basis of both merit and seniority) and 
the complexity of work performed, it was decided 
to examine the moderating effects of involvement 
separately for the two skill levels as well as the 
total sample. 


company time to all incumbents during a 
period, Twenty Likert-scaled items were 
with the construct of job involvement 
Kejner, 1965) and were designed to measure 

nature of job orientation by examining the ext n 
to which work in any way is seen to affect 
own sense of personal worth, Because of the 
established multidimensional nature of this © 
struct, as demonstrated by Lodahl and Kejner, & 
principal components analysis (Dixon, 1968) that 
employed orthogonal rotation and multiple ¢ 

erfo 


ultimately provide a series of involven 
sion scores to be used for subsequent 
subgrouping. 

The remaining 11 items assessed On a 5-p 
scale the degree to which incumbents were 
satisfied (+S) or dissatisfied (+1) with their p 
job. Specifically, items pertained to attitudes to 
co-workers, wages, job security, recognition, 
vision, company policy, advancement, fringe ben 
work itself, job safety, and overall job feelings (for 
factor-analytic derivations of these dimensions 
Wood & LeBold, 1970). 

The greatest anonymity possible was prese 
for each respondent by requiring only his clock & 
number on the questionnaire form and by assu 
each subject that forms would be mailed to. 
author immediately following each survey se 
under union steward supervision, This pro 
made it impossible for anyone within the plant 
examine any individual questionnaire or items and 
hopefully assured each respondent that complete 
honesty to the inventory questions would be possible. 


Behavioral Indexes x 


Three different job behaviors to be later correlated 
with job attitudes were selected on the basis of their 
performance diversity and the high weight give 
them by the company in evaluating personnel. 
These included supervisory ratings, motivation, and 3 
absentecism. 

Supervisory ratings. Paired comparison ratings 
were secured from first-, second-, and  third-line 
supervision. All possible employee pairs within 
occupation under a rater’s jurisdiction comprised 
evaluation group (unskilled subjects excluded). 
numbers did not permit, occupations judged to 
similar in duties and skills required were combini 
into one rating group. Each supervisor was in 
structed to consider which of the two employees ir 
a given pair was currently performing most satis- 
factorily on his present job and then to indicate his 
preference by placing a check after the name. This 
checking process continued until every employee 
had been compared and rated with every other 
worker in his group. 

The number of times each employee had been 
rated first by each rater was converted to a stan- E 
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dardized performance index using T scale values (for 
conversion table see Lawshe & Balma, 1966). The 
average of these indexes across the three supervisory 
lines provided the performance score which was 
correlated with the job satisfaction items, 
Motivation. Following the comparison ratings on 
job performance, supervisors were given a 5-point 
Motivation scale for each of their semiskilled or 
skilled subordinates. The scale values were (a) too 
uninterested in job and lacks motivation, (b) some- 
what indifferent toward job, (c) some job interest, 
but not overly motivated, (d) actively interested in 
job, and (e) very interested in job and highly moti- 
Vated. The rater was instructed not to consider 
“actual performance or ability to perform, but only 
the desire or willingness to perform. Following this 
letermination, the rater merely checked the most 
appropriate scale number for each subject. 
__ Absenteeism, The third measure of work behavior 
concerned the frequency of absences for each worker, 
‘not the total number of work days missed, during 
a period of 18 months just prior to this study. In 
other words, the consecutive absences were counted 
only as one absence, since the frequency of absence 
and not the duration was considered more important. 


‘Preliminary Analyses 


The 33 unmoderated correlation coefficients be- 
tween the 11 satisfaction items and the three behav- 
ioral indexes for the entire sample disclosed only one 
exceeding the .20 level (between satisfaction with 
ion and job motivation, r= 23, p< 01). 
median correlation of only .07 demonstrated the 
ck of any meaningful affect-behavior associations 
when the total sample was examined in a single 
analysis. The independence between job satisfaction 
and job involvement was supported by a median 
item intercorrelation across constructs of only .07 
(ns). This finding would support the factoral inde- 
pendence of these concepts reported by Lawler and 
Hall (1970). Median item intercorrelations for job 
satisfaction and involvement were 33 ($ < .001) 
and .16 ($< 01), respectively. 
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The components analysis of the involveme: 
correlation matrix was conducted to determi 
extent of subgrouping required. This anal: 
vealed five factors which represented all 20 { 
ment items. These included (a) work attr 
seeing work as having positive appeal and p x 
oneself before each shift begins (ò) failure s 1 


tion—being bothered by work mistakes e- 
pressed by job failure; (c) work comm 

choosing work and its related responsibilities re 
important than other activities; (d) job pree: ce 
—job provides the major source of grati n 
when compared to all other activities; ( k 
identification—perceiving the importance o 8 


identified with work and evaluating others we n 
that basis. 

Five component scores were computed fo 
employee by summing across the raw res 
(+5 for strongly agree to +1 for strongly disa 
on all items loading above 30 on each dimen 
Factor score correlations ranged from .17 to 
Each score range for each involvement dimensi 
was then divided at the median value for the to 
sample. This division produced a low-involvem: 
group (below median) and a high-involvem 
group (above median) on each of the five factc 
Next, correlation coefficients were estimated betwe 
the 11 satisfaction items and the three performan-° 
measures within each of the resulting 10 subgrou; 


RESULTS 


The dramatic effect of differential job in- 
volvement on satisfaction-performance rela- 
tionships is shown in Table 1. Sixty-four of 
the 76 significant moderated correlation co- 
efficients appeared for those with low factor 
scores and offered strong support for the con- 
tention that workers low in job involvement 
manifest many more associations between job 
satisfaction and company assessments of per- 


TABLE 1 
FREQUENCIES OF SIGNIFICANT ATtTITUDE-BEHAVIOR CORRELATIONS (p < .05) vor Five INvotveMENT DIMENSIONS 


Job involvement 


Low involvement 


High involvement 


dimensions Sian 
Motivation | Performance Absences Motivation | Performance | Absences 

Work attraction 5 (.20*) 1 (.16*) 4 (—19* T 155 

Failure sensitization 8 (.21*) 0 8 Z 15 5 z 18505 l (.18*) l 
oa ES R 6 (—.23"*) 2 (18) 0 0 
Job preeminence 7 (.21**) 3 (.16*) 3 (—.17*) 0 1 (227 0 
Work identification 5 (342) 2 (.26**) 7 (300 | 2 (10) 10170 $ 
Subtotals 30 6 28 9 3 8 


Note. Significant correlation overall range was from .16 (p < 05) to .35 (b < 001). bn my 


arentheses, 


TABLE 2 
ÅTTITUDE-BEHAVIOR CORRELATIONS FOR Low- vs HicH-INvotvement GROUPS BY INVOLVEMENT FACTOR AND SKILE LEVEL 


Work attraction 


skilled correla! skilled not, M ee - 
1 e ee eee . = production, A = absences. Low involvement on a factor means below median score; 
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formance than do those highly job involved. 
This trend held for all five dimensions 
examined. 

One of the more striking findings in 
Table 1 is that analyses in the low-involve- 
ment groups produced 28 significant correla- 
tions between satisfactions and absenteeism 
(out of 55), while none could be found for 
those highly involved. This result would cer- 
_ tainly suggest that where such relationships 
are minimal or nonexistent in a total sample 
Sense, to seek out only those subjects ex- 
pressing low affinity toward work may well 
provide a most fruitful means for their 
enhancement. 

The significant moderated correlations pre- 
sented in Table 2 by involvement factor and 
skill level ranged from .18 to .48, The advan- 
tage of subgrouping in such analyses can be 
better appreciated when it is recalled that only 
one nonmoderated correlation exceeded .20. 

The most prominent correlates on the atti- 
_ tude side concerned job recognition, super- 

vision, and overall satisfaction, with the 
skilled groups accounting for the vast major- 
_ ity of such associations (see Items 4, 5, and 
11 totals). Satisfactions with co-workers, 
“wages, advancement, and the work itself were 
_ particularly limited in relatability for both 
skilled and semiskilled subjects. 

Probably the most notable finding detailed 
by the table, however, was the marked supe- 
riority in significance frequencies for those 
Wich low- as opposed to high-factor scores. 
Lo job preeminence was especially produc- 
tive of such frequencies followed by low fail- 
ure sensitization status. It should also be 
noted that except for work attraction, fre- 
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TABLE 3 
FREQUENCY SUMMARY TABLE For TABLE 2 CORRELATIONS 


om 


quencies for skilled employees far exceede 


those for semiskilled within the low g 
Table 3 is offered as a summary aid fo 


data in Table 2. Seventy-ei 
significant relationships 
job attachment. Also, it i 
this study that satisfactio 


t percent of 
ed under I 
vell confirmed in 
were related t 


behavior much more often for skilled 

for semiskilled employe: 4% of thi 

correlations). f 
Likert (1961) has sugs that satisfi 

tion and performance sh % more posi 

tively related as skill level eases, a prem 

ise which Vroom’s (1964) research re 


could not convincingly verify. 
is offered by the present study, since the 
median significant correlation for semiskilled 
was actually higher than for skilled—from 
33 (p < 01) to .26 (p< .01). The fre 
quency of significant correlati j 
higher for skilled employees 
tudes were not. Since the 
tend to gravitate toward the 1 
jobs, this finding would seem congru 
Carlson’s (1969) that attitudes related 
performance most often more able 
incumbents. q 

The correlational prominence for those 
least involved and most skilled held for each 
of the three job behaviors examined (see 
Table 2). For job motivation, more than 
two thirds of the correlations came from 
skilled ranks and five out of six were in low- 
involvement groups. For production, 6 corr - 
lations in 10 were in skilled groups and 4 in 5 
appeared in low-attachment groups. For moti- 
vation, 2 out of 3 correlations were for 


re able 


: 
Š 
. 


Motivation Absences Production Grand Total a 

Employee $ 

LI | HI | Total | LI HI Total | LI | HI | Total | LI | HI | Total 

Skilled yA So 28 17 4 1 15 54 10 OL 
= Semiskilled | 10 6 | 16 pb eee) : 
. mis 1 9 6 5 11 24 2 30 
4 Total l 5 30 20 6 | 26 | 78 | 22 | 100 

y. Note. LI = low involvement; HI = high involvement. 
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skill: is and 3 in 4 came from the 
least | subgroups. 
Discussion 

It ve concluded from this study that 
emp! least involved with work exhibit 
muc similarity between their job feel- 
ings xe organization’s evaluation of their 
peri- e than is the case for those ex- 
press more intrinsic or involved orienta- 
tion he former instance, high job satis- 
fact often associated with high motiva- 
tion performance, and low absenteeism. 
For utter group, this result is rarely the 
case haviors and attitudes show little 
syst correspondence. 

Sı ngly, this correlational phenomenon 
held all five factors employed in the 
mod m, despite their distinctiveness. 
Th would seem that low sensitivity to 
wo erally, irrespective of any specific 
as} rves to accentuate the correlations. 

In nt of the findings of this study, it 
seen portant to separate perceived per- 
forns. ce as determined by self-appraisals 
from company assessments of acceptability. 
Strauss (1966) reported distinct differences 


between self-ratings of performance and as- 
sessments by peers and supervisors and dem- 
onstrated the importance of considering which 
performance measure is to be employed in 
correlational questions regarding attitudes. 

The whole issue of performance assessment 
is closely linked with the question of whether 
rewards are intrinsically (self) administered 
or extrinsically (company) administered. The 
problem has been one of operationally dis- 
tinguishing between workers more sensitive to 
either reward type. 

Worker involvement would almost by defi- 
nition provide a solution. One involved with 
or intrinsically oriented toward work would 
perceive a stronger and more direct relation- 
ship between his self-concept and the work 
Performed than one more extrinsically dis- 
Posed (Lawler & Hall, 1970). It would follow 
that highly involved incumbents placing more 
Worth on the job itself would be more capa- 
ble of rewarding themselves intrinsically. In 
turn, low-involvement employees who do not 


identify with the work as closely 
more sensitive to externally mediated 
from the organization. Thus, the di 
in job orientation could well be used 
classifying those who are capable of se 
administration of rewards from those J 
are not. f 
Ii those highly involved are more prone to 
self-mediated reinforcements, then perform- 
ance can be rewarded irrespective of its 
“goodness” or “badness” in any 
sense, Consequently, if an organizationally 
defined standard of job performance, as u 
in this study, is compared with their satis! 
tions, little relationship should be evidene 
because of the possibility of job satisfaction 
occurring at many levels of perfo de, 
high or low. 
Continuing the same reasoning, if 
involved subjects are more sensitized to ex- 
ternal rewards and if reasonable contingencies: 
exist between such rewards and company-de- 
fined performance standards, then these sub- 
jects should experience satisfaction through 
these rewards more in line with prescribed 
performance levels. Since they would ne 
likely define work standards for themselves, 
satisfactions would be more dependent on 
how the firm evaluates and approves their — 
work behavior. 
To link reward theory with varying sensi- 
tivities of workers due to differential job 
involvement seems to offer a reasonable theo- 
retical basis to explain the results obtained 
in this study. Consequently, it would appear 
that the Lawler and Porter (1967) model 
that depicts equitable rewards intervening — 
between performance and satisfaction could 
be qualified to accommodate differences in 
job orientation and performance assessment. 
For example, for low-involvement subjects, 
job performance evaluated by the organiza- — 
tion (JPorg) leads to extrinsic rewards (ER) 
which in turn leads to job satisfaction (JS) 
or 


JP. ER JS 


The more appropriate model for those highly — 
involved would seem to be a 


y9 
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4 The problem for the latter subgroup seems 
: to be that the organization’s appraisal of 
performance does not give rise to the intrinsic 
_ rewards on which this subgroup was most 
dependent for work gratifications. As a re- 
sult, performance so measured does not 
; relate to satisfactions. Further research is 
needed to determine if self-evaluated per- 
formance would provide the necessary in- 
trinsic rewards demanded by such subjects 
which in turn would be reflected in com- 
mensurate satisfaction levels. 
F In addition, longitudinal studies of how 
= workers become identified with work are 
required. It may well be that highly involved 
= subjects have a history of positive reinforce- 
ment procured through prior work pursuits. 
II so, they may be more resistant to orga- 
nisational rewarding strategies, since they 
are more capable of initiating and perpetu- 
aling their own reinforcement system. On the 
contrary, those least involved, who may lack 
Such a history, may be more receptive to 
| current organizational rewards, as this study 
Strongly suggests. Worker orientation differ- 
ences toward work may well determine 
whether the organization indeed has any 
power to reward and satisfy effectively. 
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Bchaviorally based rating scales, lor programmer/analysts and systems analysts, 
which exhibited adequate Guttman scaling properties, were developed. These 
scales also demonstrated reasonably good convergent and discriminant validity. 
An attempt was made to identify relatively poor raters using Guttman 
scaling procedures, However, only slight evidence showed that supervisors 
who rate poorly on one dimension also rate poorly on other dimensions, or 7 
that supervisors who rate one individual poorly also rate another individual 


poorly on the same dimension, 


I ping adequate measures of job per- 
forr has certainly been one of the more 
cha! ig tasks in contemporary industrial 
psy« y. This task becomes somewhat 
more cult when the jobs are of a technical 
an plex nature, as for example, com- 
pute vsonnel jobs. 

Tk: development of behaviorally based 
ratin; «cales based on the procedure of scaled 
expectations (Smith & Kendall, 1963) and 
variants of this method (Campbell, Dunnette, 
Lawler, & Weick, 1970; Folgi, Hulin, & 
Blood, 1971) have seemingly resulted in reli- 
able and valid job criteria that help to mini- 
mize rater-biasing factors. Briefly, behavior- 
ally based rating scales reflect the major rele- 
vant dimensions of job performance where 
specific behavioral incidents serve as anchor 
points along the rating scales which represent 
these dimensions (Smith & Kendall, 1963; 
Folgi et al., 1971; Campbell et al., 1970). 
The behavioral anchors are ordered or scaled 
so that the top level of the rating continuum 
is anchored by those incidents that represent 
highly effective behavior within that domain 
of job performance. Similarly, the low end 
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. cently, Blanz and Ghiselli (1972) have used» 
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of the continuum is anchored by incidents of 
highly ineffective job behavior. E 
Guttman scaling procedures (Ed 
1957) can be used to help evaluate b 
iorally based rating scales. This t jü 
can be employed to determine whether t 
behavioral incidents representing an 
points are properly scaled. For exa 
Schultz and Siegel (1961) presented d 
which indicated that checklists for eval > 
task performance in several related job spe- 
cialties met Guttman scalability requirements. 
Hellervik, Dunnette, and Arvey (1971) de- 
veloped a series of behaviorally based rating 
scales, which exhibited Guttman Scaling 
properties, for production foremen. More re- 


Guttman scaling techniques to evaluate the 
accuracy of supervisors’ ratings along 18 
performance dimensions. One objective? of 
this research was to develop a set of behav- 
iorally based rating scales, which would ex- 
hibit adequate Guttman scaling properties, 
for systems analysts and programmer/ana- 
lysts. oe 
A second objective of this research was tio 
investigate the possibility of using Guttman 
scaling procedures to detect relatively poo 
raters, that is, supervisors who do not do a 


3 These scales are presently being used as criteria 
for a synthetic validation research project. 3 


particularly good job of rating job perform- 
ance. It should be recalled that the coefficient 
of reproducibility in Guttman scaling is a 
function of the number of rating errors that 
have been made. An “error” occurs when a 
response falls outside of the limits of a par- 
ticular item (or incident) cutting point that 
has been established during the scaling proc- 
ess and located so as to minimize error (Ed- 
Wards, 1957). When a large number of errors 
occur, it is commonly assumed that the in- 
_ cidents comprising the scales were not prop- 
erly scaled, that is, in the context of a job 
performance scale, the items did not really 
reflect a continuum of incidents ranging from 
high effectiveness to high ineffectiveness. Two 
“a alternatives also exist, First, the items 
may not be representative of the domain 
content. The second alternative is that cer- 
tain raters may be inaccurate. Blanz and 
_ Ghiselli (1972) found a wide range in the 
accuracy of raters using Guttman techniques. 
‘Concerning this last alternative, it may be 
| that the rater did not know the ratee well 
enough to judge his performance. Similarly, 
the rater may not have been paying suf- 
ciently close attention to the context of the 
"performance items used and thus might have 
been responding inconsistently. Using a Gutt- 
man scaling procedure, it may be possible to 
examine the errors made over a number of 
ers and incidents, and eliminate poor inci- 
_ dents and/or poor raters from the perform- 
ance ratings. 
II the tendency to rate poorly generalizes 
_ across ratees and across job dimensions, it 
may be worthwhile to attempt to learn more 
about the correlates of poor raters. First, 
__ however, the stability of the rating character- 
istic should be investigated by determining 
whether raters who rate poorly on one dimen- 
Jon also rate poorly on other job dimensions. 
Further, whether raters who rate one indi- 
vidual poorly also rate other individuals 
= poorly should be determined. 


METHOD 


The approach used to develop the scales was es- 
_ sentially a variant of the critical incident methodol- 
“ogy as modified by Smith and Kendall (1963). 
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appropriate personnel in a series of workshops te 
consider the major performance components or di- 
mensions of the two jobs in question—systems ana- 
lysts and programmer/analysts. Further, individuals: 
involved in these workshops were asked to gen 
specific behavioral incidents to serve as anchor point 
along the rating scales which reflected these dimen- 
sions. The general procedure for developing 
rating scales has been previously described if 
detail (Campbell, Dunnette, Arvey, & 
press; Folgi et al., 1971), and the 

development of the present scales are j 
where (Hoyle & Arvey, 1972); there 
will not be discussed in this study. One point womi 
noting, however, is the fact that the basic job dle 
mensions of the two job areas were very much the 
same, Thus, it was decided to develop one set of 
rating scales to evaluate individuals in these two 


nominally different jobs, However, the basic taski 
dimensions of the two jobs were suspected to be the 
same, but what differentiated the two jobs would be 

the relative importance of the dimensions. This 


reasoning corresponds closely to the statement by 
Mayer and Stalnaker (1968). 

The results of these series of workshops was 
identification of the major dimensions 


supervision and leadership, (e) train 
documentation, (g) maintaining comm: 
assessing customer needs and provi 
dations, (i) job commitment and efor 
ging, (&) program and/or systems modification an 


rating scales were developed for each job dimension” 
with the exception of one dimension—conducting — 
presentations—due to the limited behavioral anchors © 
generated for this scale. The first type was simply & 
rating scale where the behavior incidents were 
ranged on a continuum. Ratings we 
asking supervisors to infer the leve 
that could be expected of the ratee’s ty 
formance, and to indicate that individual’s rating 
by writing “typical” next to that particular inci- 
dent.“ This type of scale was scored by assigning a 
value of 7 to the top (highly effective) incidi 
down to a value of 1 for the bottom (highly ineffe 
tive) incident. Henceforth these scales are referred 
to as the typical rating scales. 

A second type of scale was constructed for each 
dimension wherein the behavioral incidents were 
originally arranged in random order on one page, 
but then reordered to reflect a continuum from h 
to low effectiveness. Behavioral incidents other 
those used in the typical method were used. Rate 
were instructed to read each incident and decide 


Katers were also asked to make a rating of 
“best,” indicating the level of performance that 
could be expected when the rater was at his best. 
These data were not used in the analysis. 


whet! \ividual be rated was better than, 


the $ not as good as the individual de- 
xribe incidents, The scales were scored by 
asig: :merical value to each responie cate- 
gory er than, 2 = the same as, 1 = not as 
goo! summing these values over the inci- 
dents continuum, If the items are properly 
saku uttman sense (Le, no errors), the ro- 
spons i follow a general pattern—better than 
rating bottom of the scale, change to the 
same ng, and finally switch to a not as good 
as rat where near the top of the scale. Scal- 
ing p res allow for the identification of the 
incide i raters where this pattern is not fol- 
lowed ter rating technique will be referred to 
as the man rating method. 

Th phase in the research was to evaluate 
the ; ctrie properties of these rating scales. 
Sam; 

Tu dred systems analysts and programmer/ 
analy } programmer/analysts, 87 systems ana- 
lysts loyed by a large midwestern computer 
manu ing company were rated by their super- 
visor he above dimensions of job performance 
usin; ypical and Guttman rating methods, On 
cach on, these supervisors were asked first 
to m decision concerning whether the incum- 
bent lly should be evaluated, and if so, to 
com is rating on that dimension. If it did not 
apply » supervisor left the page blank and went 


next dimension. Raters were further 
to indicate an overall effectiveness rating 
on a int scale. They also rated how impor- 
tant t work dimensions were for the individual 
job incumbent by allocating 100 points to the dif- 
ferent dimensions according to their importance 
(higher importance = more points). One hundred and 
three supervisors rated the 200 incumbents. 


on to th 
instr 


Procedures 


The ñrst set of analyses involved comparing indi- 
viduals in the two jobs on the importance ratings 
and the actual ratings made by the supervisors on 
the dimensions. Thus, means and standard devia- 
tions were computed and £ tests run between the 
individuals in these two job areas on the impor- 
tance ratings, the typical, and the Guttman ratings. 
Internal consistency reliability values for the Gutt- 
man rating scales were computed using Cronbach's 
alpha (Guilford, 1954). It was not possible to esti- 
mate the reliabilities of the typical rating scales as 
the necessary data were not available. 

Second, a multidimensional-multimethod correla- 
tion matrix was computed in order to examine the 
convergent and discriminant validity of the rating 
scales (Campbell & Fiske, 1959). The Guttman 
Scales were subjected to scaling procedures whereby 
the coefficient of reproducibility for each dimension 
Was computed. The Biomedical Computer Program 
(BMD 055) was used for this purpose (using the 
Cornell method) and modified slightly in order to 
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supervisor who rated three or more E 
pair of ratees was selected randomly for this Lat 
part of the analysis, q 


Resvnts 


Table 1 presents the results of the analy- 
sis comparing individuals in the two nominal 


pects to note about these data. First, as was 

suspected, there were aggregate differences — 
between the two job areas with regard o 
what is considered important in the jobs. 
Statistically significant differences emerged on 
7 of the 12 dimensions. The dimensions con- 


tion, debugging, and program and/or Systems 
modification and/or development, while those 
considered more important for the systems 
analysts were maintaining customer relations, 
maintaining communications, assessing cus- 
tomer needs, and conducting presentations. 
It appears from these data that, on the 
average, systems analysts have more to do 
with personal contact activities, while pro- 
analysts deal perhaps at a more 
technical level. It is important to note, how- — 
ever, the wide variances within each nominal 
job category with regard to the importance 
of each job dimension. Thus, for some pro- 
grammer/analysts, it is important that the, 
maintain customer relations and communica- < 
tions. Similarly, even though statistically 
significant differences occurred between the 
two job areas on various dimensions, there is 
large overlapping between the two distribu- — 
tions as indicated by the overlap percentages 
(Tilton, 1937) presented in Table 1. 
Whereas statistically significant differences — 
appear between programmer / analyts and 3 
systems analysts with regard to the rated 
importance of the job dimensions, there were 
no differences between the two groups with 
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TABLE 1 
IMPORTANCE RATINGS For TWELVE DDIENSIONS OF Jos BEHAVIOR roe 
Systems ANALYSTS AND PROGRAMMER/ANALYSTS 
Ratings 
Programmer Systems 
Job behavior dimensions analysts ys Distribution 
(N = 112) (N = 87) 
| ï SD x ASD: d* t Overlap % 

Technical knowledge 15.42 | 9.25 14.52 7.12 90 76 97 
Planning, organizing, scheduling 9.57 | 491 | 9.57 | 441 00 00 100 
Maintaining customer relations 6.56 | 6.22 | 10.69 | 6.05 | —4.13 | —4.72* 66 
Providing supervision and leadership 2.70 | 6.24 3.34 | 5.07 4 — 80 95 
Training others 5.52 | 4.63 | 661 | 4.02 | —1.09 | —1.77 90 
Documentation 11.18 | 5.59 | 845 | 4.29 2.73 3.90° 80 
Maintaining communications 7.05 4.16 8.50 | 3.72 | —145 | —2.59* 85 
Assessing customer needs and providing | 

recommendations 4.30 | 445 5.94 | 4.40 | — 1.64 —2.59* 85 
Job commitment and effort | 11.61 | 4.74 | 10.46 | 4.50 1.15 1.75 90 
Debugging 12.13 | 6.33 | 8.84 | 5.90 3.29 3.78* 78 
Program and/or systems modification and/or | | | 

development | 10.95 | 7.31 | 8.62 | 7.20 2.33 2.25* 87 
Conducting presentations 3.26 2.83 4.62 | 348 | —1.26 | —2.74* 85 


Mean difference, 
*p <05. 


regard to their rated performance using 
either of the two rating methods. Therefore, 
the two groups were combined for further 
analyses. 

The 22 X 22 correlation matrix (2 rating 
methods, 11 dimensions) comprising the 
multidimensional-multimethod matrix is pre- 
sented in Table 2. Also shown are the cor- 
relations between the dimensions and the 
overall effectiveness measure and the internal 
consistency reliabilities. 

In terms of convergent and discriminant 
validity, Table 2 indicates that adequate 
convergent validity was achieved. Convergent 
validity is demonstrated when the correla- 
tions between the same dimension using differ- 
ent rating methods are significantly different 
from zero and reasonably high. The cor- 
relations which represent the convergent va- 
lidity coefficients are circled in Table 2. All 
of these are significantly greater than zero 
and thus suggest that both rating methods 
show agreement or convergence on the 11 
dimensions. 

Discriminant validity may be defined as 
the extent to which a dimension or trait is 


correlate with other measures that are con- 
ceptually different and from which they are 
intended to differ. Discriminant validity i 
demonstrated by several criteria. First, 4 
diagonal correlation value (validity coeffi 
cient) should be higher than the values lying 
in its corresponding columns and rows in the 
heterodimension-heteromethod triangles. The 
logic behind this is that a measure of a di 


with other variables having neither dimension 
nor method in common. 

The validity values in Table 2 are higher 
than in their respective columns and ro 
(in the heterodimension—heteromethod tri 
angles) on 204 out of 220 comparisons. Di 
mensions 5 and 8 accounted for 8 out of the 
16 occasions where the validity coefficien 
were equaled or exceeded by these 0 
values. 3 

Another criterion for demonstrating d 
criminant validity is that a measure of 
dimension should correlate higher with 
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independent effort to measure the same di- 
mension than with measures designed to get 
at different dimensions that happen to use 
the same method. For a given dimension, this 
involves comparing its value in the validity 
diagonal with its corresponding values in the 
heterodimension-monomethod triangles. In 
Table 3, the validity values exceed the values 
in the corresponding columns and rows in the 
typical rating method triangle 79 out of 110 
comparisons and 59 out of 110 comparisons 
in the Guttman monomethod triangle. 
Again, dimensions 5 and 8 do not stand 
up well under this criterion, exceeding the 
monomethod coefficients only 8 out of 40 
comparisons. 

With the exception of these latter two di- 
mensions, we can conclude that the rating 
scales exhibit reasonably good convergent and 
discriminant validity. It is interesting to note 
that the monomethod correlations for the 
Guttman rating method are generally higher 
than those in the typical monomethod block. 
The median intercorrelation between the 
eleven dimensions using only the Guttman 
rating method was .61, while the median 

correlation for the typical method was .55. 
The next step in the analysis involved 
computing the coefficients of reproducibility 
for the eleven Guttman rating scales. Table 3 


TABLE 3 


COEFFICIENT OF REPRODUCIBILITY AND MARGINAL 
REPRODUCIBILITIES FOR ELEVEN GUTTMAN 
RATING SCALES 


Reproducibility® 
Guttman rating 

sala Coefficient Minimum 
marginal 

1 86 80 

2 82 74 

3 90 81 

4 88 74 

5 83 75 

6 92 82 

7 89 75 

8 88 80 

9 90 81 

10 91 82 

11 84 2 


a Coefficient of reproducibility = 1 — # of errors /total num- 
ber of responses. 
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presents these coefficients along wiih the 
minimum marginal reproducibilities for the 
eleven dimensions. Guttman (1944) suggests” 
that reproducibilities over .85 indicate that 
the data have been sufficiently scalable. 
Guilford (1954) suggests that reproducibili- 
ties in the neighborhood of .80 represent 
quasi scales. Eight of the eleven reproducibil- 
ities are over .85, and all of them exceed .80. 


Another factor to consider when ey 
the scalability of a set of statements is 
whether the coefficients of reproducibility ex- 
ceed the minimum marginal reproducibilities. 
The minimum marginal reproducibility refers 
to the percentage of nonerror which occurs” 
if the modal response categories of the items 
were used as the predicted rating. All 11 di- 
mensions exhibit coefficients of reproducibil- 
ity which exceed their respective minimum 
marginal reproducibilities, indicating that the 
rating scales did, for the most part, meet 
this criterion for scalability. 

The number of rating errors for each rater 
per dimension was computed. The intercor- 
relations between the dimensions with regard 
to the number of rating errors made revealed 
that 30 out of 55 correlations were signifi- 
cantly different from O. However, the median 
correlation was .25 with a range between .41 
and —.07. These data inciate that there is a 
slight tendency for raters who make errors 
on one dimension to make errors on other 
dimensions. 

It was noted that the number of errors 
made from a given dimension was related to 
rated job performance. Visual inspection 
of the scatterplots did not reveal evidence of 
nonlinearity. The correlations between the 
number of errors and the job performance 
rating (typical rating format) in that 
dimension were as follows for the eleven 
dimensions: : 


iating 


=31, —.24, 28, —.27, —.33, —.27, 
= 11) 29 09, = 31l,and’ —.22: 


All but two of these correlations were sta- 
tistically significant. What these data indi- 
cate is that individuals who are rated as” 
doing poorly on a particular aspect of theit 
job are not rated as accurately by their 
supervisors. 


. A 
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In zating the reliability of super- 
visor scores when evaluating more 
than dividual, the resulting intraclass 
corre coefficients ranged between —.12 
and . th a median correlation of .16. The 
coeffi for dimensions 3, 9, and 10 were 
signi y different from O (McNemar, 
1962 nevertheless quite low (.35, .30, 
and spectively). In general, the evidence 
does oint to a strong tendency for raters 
who errors on one dimension when 
evalt a particular individual to make 
errors that dimension when rating another 
indiv ud. 

DISCUSSION 

Ti- data presented indicate that it is pos- 
sible develop behaviorally based rating 
scales which exhibit Guttman scaling prop- 
ertie irther, these scales (along with more 
tradiiicsal behaviorally based rating scales) 
also ibit reasonably good convergent and 
discris-inant validities. The finding of con- 
vergen! and discriminant validity is perhaps 
more table when it is recalled that the two 
ratine methods differed not only in their re- 


sponse format, but also in the behavioral inci- 
dents ihat were included in each method. 
With the exception of two performance di- 
mensions (5 and 8), these scales appear to be 
appropriate for evaluating performance in 
these jobs. However, because of the relatively 
lower intercorrelations among dimensions 
using the more traditional typical rating 
method when compared to the Guttman 
ratings, the typical rating scales might be 
used as the primary measuring instrument. 
The use of these Guttman rating scales to 
identify relatively poor raters does not seem 
promising. While there was some tendency 
for raters who made large numbers of errors 
on one dimension to make errors on other 
dimensions, the relationships were weak and 
practically meaningless. Similarly, there was 
only slight evidence that raters who made 
large numbers of errors on a dimension for 
a particular ratee also made errors when 
rating another individual on that same dimen- 
sion. In sum, rating poorly as defined by this 
technique does not appear to bea generalized 
trait characteristic of raters. In fact, rater 


errors are more strongly related to individual 
ratee job performance in a particular dimen- 
sion than anything else, Gordon (1970) pre- 
sented results showing a phenomenon where 
raters do not note incorrect performance as 
accurately as they do correct performance, © 
where accuracy referred to the relationship 
between judges’ perceptions of behavior and 
the actual behavior that took place. What 
might be occurring is that the poor performers 
may actually be inconsistent or variable in 
their job performance. In a slightly different 
vein, variable performance might also be 
judged as poor performance by supervisors. 

Finally, using ratings to gather information 
about the importance of each job dimension 
appears to be a useful technique in under- 
standing similarities and differences among 
jobs. Significant differences appeared between 
the two jobs with regard to the rated impor- 
tance of the various dimensions; however, 
wide overlap also existed between the two 
nominally different jobs. It was also noted 
that the individuals performing in these two 
jobs differed widely in the patterns of dimen- — 
sions that were important. Further research 
should perhaps focus on identifying clusters — 
of individuals who share similar job re- 
sponsibilities and determining whether these 
clusters moderate other relationships. 

The dimensions which were derived and the 
ratings of their importance have further im- 
plications. Conceivable use could be made of 
them for job evaluation, wage, and salary — 
purposes. Further, these scales might be used 
in training and development functions by 
helping to specify strengths and weaknesses 
for both the aggregate and individual cases. 
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DEVELOPMENT OF CRITERION MEASURES OF 
OVERSEAS MISSIONARY PERFORMANCE 


PATTYE WEAVER KENNEDY axo RALPH MASON DREGER * 


Louisiana State University 


To establish criterion measures of effectiveness in overseas missionary per- 
formance, a committee of experts selected criterion variables involving personal, 
social, and professional elements of missionary work and designed The Mis- 
sionary in Action—A Descriptive Check List (MINA) to measure behavioral 
attributes relevant to the missionary enterprise. A factor analysis of the MINA 
check list resulted in 11 factors, The major results of the study indicated that 
future efforts should be directed toward developing the MINA check list and 
other criterion measures derived from native cultures. 


Chi rissionary boards have been send- 
ing | numbers of missionaries abroad 
since zinning of the modern missionary 
move in the early part of the 19th cen- 
tury. ns of dollars are spent annually 
to re train, and maintain this body of 
overs ersonnel. Despite the involvement 
of a l vumber of people and the vast ex- 
pendi of money, however, little is known 
about quality of services being rendered, 
the eic: ents which make for effective over- 
seas Pe ‘ormance, and the personal character- 
istics «1d experiences needed to succeed in 
missions: y service (Cleveland, Mangone, & 
Adams, 960). This lack of knowledge re- 


garding the efficacy of the missionary enter- 
prise has prevented the establishment of cri- 
teria for measuring overseas missionary per- 
formance and, consequently, the selection of 
predictors of these criteria. 

While the investigation of missionary work, 
specifically, has been the focus of very few 
research endeavors, extensive efforts have 
been made to examine other areas of overseas 
work. A number of sources have suggested 
the usefulness of psychological examinations 
(Gunderson & Kapfer, 1966; Hanfmann, 
1966; Mischel, 1965), a knowledge of back- 
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ground information (Jones, 1967; Mabry, y 
1969), and an understanding of situational 
variables (Mabry, 1969) in reducing the 
occurrences of incompetence, maladjustment, 
and breakdown in overseas situations. 

However, from a practical standpoint the 
results of the research efforts have been dis- 
appointing. One of the major problems seems 
to be that investigators have concentrated 
primarily on evaluating selection procedures 
without sufficient information pertaining to 
criteria, so that the results of efforts to estab- 
lish selection procedures are difficult to in- 
terpret and tend to be insignificant (Men- 
ninger & English, 1965; Gunderson & Kapfer, 
1966). As far as is known, no research has 
focused specifically on the establishment of 
criteria for effective overseas missionary per- 
formance. From the literature it is also clear 
that criterion measures should be derived on 
the basis of information gathered in the native 
culture rather than in this culture. Ratings 
made in a home situation and ratings ob- 
tained from overseas respondents have been 
found to vary significantly (Jones, 1967). 

For several decades mission boards have 
recognized the limitations of their selection 
methods and have felt a need to evaluate 
their procedures. It was not until 1966, how- 
ever, that a group of missionary personnel 
secretaries and psychologists conferred about 
evaluative procedures. Subsequently, a com- 
mittee was established within the National 
Council of Churches of Christ for the purpose 
of investigating selection procedures (Dreger, 
1967). The present work was an outgrowth 
of the efforts of this committee. 
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The purposes of the total study were to 
establish criteria for effective overseas mis- 
sionary service and to examine predictor 
variables in relation to the criteria. However, 
this report primarily focuses on the develop- 
ment and use of one instrument, The Mis- 
sionary in Action—A Descriptive Check List 
(MINA) } for criterion purposes. A report of 
the total study can be found elsewhere (Ken- 
nedy, 1972). 


METHOD 
Subjects 


The subjects were 567 individuals, 137 of whom 
were missionaries constituting the primary targets 
and 430 of whom were peers serving as respondents 
re the missionaries. Besides missionaries and peers 10 
home office personnel secretaries rated the mission- 
aries, and six served as judges to obtain ideal mis- 
sionary profiles. The 137 missionaries, men and 
women, were associated with the United Methodist 
Church, United Church of Christ of the Philippines, 
American Baptist Foreign Mission Society, the Epis- 


~ copal Church, and the Missouri Synod Lutheran 


Church. The majority of the female missionaries 
were wives of male missionaries. These subjects were 
in service in the Philippines or on furlough as of 
July 1, 1970, or had completed their missionary 
term, had withdrawn from service, or had been on 
leave of absence since January 1, 1968. 


Description of the MINA Check List 


The Missionary in Action check list was de- 
signed for this study to measure behavioral 
attributes specifically relating to personal-social- 
work relationships of missionaries. Initially, 
items were derived from suggestions pertaining to 
qualities of missionary effectiveness made by con- 
sultants in the missionary field. These items were 
supplemented with items representing more general 
effectiveness of a person in social situations, The 
total pool of items was subjected to revision, dele- 
tion, and/or addition by experts in the missionary 
enterprise. Where it was felt the terminology would 
be confusing to the Filipino people the pool of 
items was further revised in the Philippines on the 
basis of a translation into Tagalog, a native lan- 
guage. The final revision was given a preliminary 
tryout in the Philippines prior to this study. 

The MINA check list consisted of 155 items on a 
5-point scale. A question mark was used in re- 
sponding if a subject felt he could not rate the 
missionary on an item. 


Administration of Instruments 


The MINA check list, along with the FIRO-B 
(Fundamental Interpersonal Relations Orientation- 


3A copy of the Missionary in Action—A Descrip- 
tive Check List may be obtained from either author. 
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Behavior), a standardized measure of interpc 
relationships (Schutz, 1967), was administer 


the Philippines by the experimenter, the nior 
author, to the missionaries and peer respondents who 
had worked closely with the missionaries. Sta:.card 
instructions were read at the beginning of cach 
session. 

Ratings of supervisors on the MINA che list 
were also obtained for 112 of the missionaries om 
members of the missionary personnel staff or 
the regional secretaries in the home offices. The 
remaining 25 missionaries were not well ugh 
known to permit ratings by the home office staff 
members. 

For purposes of comparison, regional and personnel 
secretaries from six of the various mission rds 
completed the MINA check list and the FIRO. in 
accordance with their conceptions of a hypotteti- 
cally effective, desirable missionary who might si and 


within the top 10% to 20% of missionaries. T 
median of the six profiles on each instrument ser 
as the standard or ideal profile for the MINA a 
the FIRO-B. 


Analyses of the Data 


Multivariate procedures were utilized in the major 
analyses of the data. Since a number of the sta- 
tistical processes involved results derived from 
previous analyses, the specific analytic procedures 
are detailed with the results. 


RESULTS 
Analysis of the MINA Check List 


When respondents had not observed the 
missionary in situations described by particu- 
lar items, the response mode used on the 
MINA check list resulted in missing data. 
The mean of a respondent’s total score was 
entered in place of the missing information. 
Then, the scores for the 567 respondents in 
the Philippines, missionaries and peers, on 
the 155 items of the MINA check list were 
intercorrelated, and the resulting matrix was 
subjected to a principal axes factor-analysis 
using squared multiple correlation coefficients 
as diagonal entries (Harman, 1967). Rota- 
tion of the derived factors was accomplished 
through the varimax method, followed by the 
promax method which yielded an oblique 
solution.* = 

Although the latent-root-of-one test and 
Cattell’s Scree Test would allow all 30 of the 
factors the program produced, only 11 of 


*Credit for the integration of these procedures 
should be given to Richard L. Gorsuch of the Ken- 
nedy Center, Vanderbilt University. 
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TABLE 1 


1 


re 


calm 4 


Drs ION OF 11 MISSIONARIES IN Action CHECK List FACTORS AND REPRESENTATIVE [TEMS 
= = : 
Factor Representative items 
A. Descril erson who is very understanding and | 77. People say of him, “He understands me and he 
acceptir seople and ideas. sympathizes with me. He knows how to put him- 
self in my position.” i 
106. Christians say they are grateful for his helping 
them to understand better what it means to love 
their fellowmen. 

B. Descri erson who is insensitive to events 78. From the remarks he makes you can tell that he 

arour has very few, if any, close friends, and does not see and appreciate the humor in 

has fe onal skills for coping with social and situations. 

profes situations. 130. He seems not to be well informed about impor- 
tant local and world developments. 

2. Is con | with an individual’s ability to organize 152. He handles details well without losing sight of 

his tin | energy in carrying out his professional the aims of the project in which he is engaged. 
respo ies. 24. Whatever he undertakes, he does well. 

D. Meas person’s openness to and acceptance of 121. People say that when social changes take place, 

chan; seople and social situations. he reacts positively to them. 

84. He helps others develop according to their 
natural abilities, without forcing them to become 
what he wants them to be. = 

E. Deals che individual’s philosophy of life and 86. People ask him for advice on both practical and 
the wa which this philosophy affects his personal intellectual problems. z 
and prefcssional activities. 142. People appear to enjoy being with him. 

F, Is concerned with an individual’s leadership abilities. | 53. When following rather than leading is called for, 
he fulfills a follower’s role in such a manner as to 
deserve favorable comment. 

31. When leadership is called for, he fulfills a leaders 
role in such a way as to deserve favorable d 
comment. 

G. tems which pertain to an individual’s 52. He is spoken of as one who is very deep spiritually. 

to Christ and his efforts and abilities 28. Because of his contact with them, some people 
his faith with others. who were without Christian faith have become 
committed to Christ. 

H. Taps the traits of humility and dedication. 148. It is clear that he is trying to understand and 
adjust to life in the Philippines. 

140. He gives and takes compliments gracefully. 

I. Describes a person who has the capacities to adjust 134. He has close friends among the people of the 
to cultural demands and to relate to people in Philippines. k Fa. 
cultures different from his own. 20. He is generally considered successful in his 

missionary work. 

J. Measures a person’s concern about people with 21. People feel he has made them more aware of 
special needs, such as the poor, the blind, and the those who have special needs, such as the blind 
physically handicapped. and the physically handicapped. aa 

42. He speaks and acts in such a way as to indicate 

he is aware of persons with special needs, such as 
the blind and the physically handicapped. 

K. Deals with the family and home relationships of the | 111. (as a married person) There seems to bea 7 
missionary. and pleasant atmosphere among the members of 2 

$ his family. 

103. (as a married person) The members of his 


family — 
manifest their affection for one another. 


these could be retained for interpretation, The 
Majority of the remaining 19 factors had 
factor loadings below .35, the criterion for 
acceptance, while several were singlets. Table 


1 gives a brief description of each of the 11 
factors along with representative items. 


3 


Reliability coefficients for the MINA check 


list were computed for the total test and each 


r 
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of the 11 factors from the intercorrelations of 
items (Nunnally, 1967), according to the 

procedure rationalized by Dreger (1973) 

for nonability tests. The results reflected a 

coefficient of .98 for the total test, with a 

range of 93.73 for the factors. Although 

authorities differ as to whether coefficient 

alpha and its special cases constitute “lower 

limits” (Lord & Novick, 1968) or “upper 

limits” (Nunnally, 1967) to reliability, from 
an empirical standpoint, it seems that the 
above coefficients must assuredly represent 
the latter. 

For validational purposes Pearson product- 
moment correlation coefficients were calcu- 
lated for each of the 11 factors on the MINA 
x check list between (a) median Z scores for 


x 


each missionary, taken from the scores of the 


| 
> 
p 
i 


missionary and peer respondents and (b) 
supervisory ratings, derived by having at least 
_ three administrators in the home offices 
designate the top 25% of their mission- 
_ aries, that is, a rating of 1 was given to 
each of the missionaries within the top 
group and a rating of 2 was given to the 
other missionaries. The coefficients ranged 
from .289 to .025, with a median of .153. 
These coefficients served as validity measures 
along with the 7, coefficients described below. 
The factor structure loadings of the 11 
factors were used as weights for devising a 
3 factor score system for the MINA check list. 
“= 


The resultant 11 factor scores served, thus, 

as criterion measures for further analyses. 

4 Analysis of MINA and FIRO-B Profiles 
For criterion purposes Cattell’s (1949) 
coefficient of pattern similarity (7,) was 
calculated for each missionary between his 
scores and the scores of the peer respondents 


3 

on the MINA check list and between his 
scores and the standard MINA profile. Similar 
FIRO-B. 


N 


r,s were calculated for the scores on the 


Table 2 presents the mean and median 5 
coefficients between self and colleague profiles 
and between self and standard or ideal pro- 
_ files on the MINA check list and the FIRO-B, 
_ These mean and median coefficients were 
based on the 7,s of 117 missionaries who com- 
pleted the MINA check list and 126 who 
completed the FIRO-B. The results show 
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that the self versus colleague and sel 
standard correlations were reversed ir 


mi- 
tude for the MINA check list the 
FIRO-B and that the highest rel hip 
existed between self and colleagues the 
MINA check list. The comparison da 
mean of .28 and a median of .40. 

Canonical Correlation Analyses 

Data from the MINA check list the 
FIRO-B were used as criteria alo ith 
other criterion measures. The final ses 
were computed on 25 criterion varia and 
46 predictor variables. Canonical co ion 
analysis was employed to determine la- 
tion between the predictors and ria 
(Cooley & Lohnes, 1962, 1971), vO 
analyses were performed, However, s he 
results of these analyses go beyond th ype 
of this report, they are not given his 
study but can be found in Kennedy 29 

Discussion 

One of the major purposes of this dy 
was to establish criterion measures of : fec- 
tive missionary performance. Previous re- 
search has pointed to the necessity of deriv- 
ing criterion measures of effective overseas 
performance on the basis of inform=tion 


gathered in a foreign culture. The present 
study attempted to achieve this purpose 
through extensive data collection the 
Philippines. 

The effort to develop criterion measures 
was only partially successful, but a promising 
aspect of the study relates to the develop- 
ment of the MINA check list. Its construc- 
tion included the combined efforts of for- 
eign and home office personnel, and it was 
given an extensive tryout in the Philippines. 
The factor analysis of the MINA check list 
produced 11 factors representative of attri- 
butes generally considered to be important in 
social and professional relationships in the 
missionary situation. The reliability is satis- 
factorily high. The 7p coefficients between 
self and colleagues indicated that for these 
two groups the MINA check list measures 
similar concepts. The reversal in magnitude of 
the self versus colleagues and self versus 
standard 7, correlations on the MINA check 
list and FIRO-B suggested that the concepts 


in 


eee 


TABLE 2 


COEFFICIENTS ON THE Miıssioxary by 
THE FUNDAMENTAL INTERPERSONAL 
TIONS ORIENTATION- BEHAVIOR 


Fp Coefficient 
€ gory — AED 
Median | Mean 
i and colleague 40 | 28 
and standard 16 AS 
clf and colleague 10 07 
elf and standard 21 27 


\ = Missionary in Action—A Descriptive Check 
= Fundamental Interpersonal Relations Orien- 


by the MINA check list are more 
the missionary enterprises and the 
ho used the MINA check list than 
ats measured by the FIRO-B, an 
developed for more general pur- 
ese results provide support for the 
of the MINA check list as a tool 
uring certain behavioral attributes 
turn, the findings support the im- 
of these behavioral attributes as 
n measures. 

is recognized that the results derived 
the use of the MINA check list are ten- 
and must be replicated in independent 
= studies in order to be interpreted with confi- 
= dence. It is also recognized that revisions of 
dhe instrument are needed. However, the de- 
velopment of any measurement instrument is 
a long and arduous task. When the process 
is begun with very little information to build 
on, as was true in the development of the 
MINA check list, the difficulties are accentu- 
"ated. Although the present efforts represent 
only the beginning phases of the construction 
Process, they do provide information on which 
to build and provide a rationale for continu- 

ing the process. 


REFERENCES 


_ Carrert, R. B, rp and other coefficients of pattern 
similarity. Psychometrika, 1949, 14, 279-298. 
-Creverann, H., Mancone, G. J., & AAN, J. C. 


LOPING CRITERION MEASURES OF OVERSEAS MISSIONARY — 


Wenn oe 


Cooney, W. W., X Lounes, P. R. Multivariate 
cedures jor the bekavioral sciences, New Yi 
Wiley, 1962. 

Coorzy, W. W., & Lonxes, P. R. Multivariate 
analysis. New York: Wiley, 1971. b 

Drecrr, R. M. Research design for evaluating t 
effectiveness of several predictors of acceptance 


An unpublished manuscript, 
versity, July, 1967. 

Drecer, R. M. A note on the use of absolute 4 
in estimating reliability from the inter-item cor- 
relations. Educational and Psychological Measure 
ment, 1973, 33, 3-5. 

Guworesox, E. K., K Karrer, E. I. The diei 
validity of clinical ratings for an extreme environ. 
ment. Brit’sh Journal of Psychiatry, 1966, 112 
(48S), 408-412. 4 

Haxrsass, E. Projective techniques in the 
ment program of the Office of Strategic Servic 
In A. Anastasi (Ed.), Testing problems in per- 
spective. Washington, D.C.: American Council o 
Education, 1966. 

Harman, H. H. Modern factor analysis, Minols: — 
University of Chicago Press, 1967. J 

R. R. The predictive validity of background 

in Peace Corps selection. 


Kraerpy, T. W. A pilot study to investigate 
and evaluation 


M. R. E 


Lorn, F. M., & Novick, 
Reading, Mass.: Addison- 


oj mental test scores. 
Wesley, 1968. P 
Masry, H. P. Career change and continuity: Some 
social and orientative correlates among active 8 
terminated Methodist missionaries, 1946-1967. An 
unpublished dissertation, Boston University, 1969. 
Mexsixcer, W. W., & ExcnsH, J. Psychiatric 
casualties from overseas Peace Corps service. 
Bulletin of the Menninger Clinic, 1965, 29(3), 
148-158. = 
Miscnet, W. Predicting the success of Peace Corps 
Volunteers in Nigeria. Journal of Personality and 
Social Psychology, 1965, 1, 510-517. - 
Nunwatty, J. C. Psychometric theory. New York: 
McGraw-Hill, 1967. t 
Schurz, W. C. FIRO-B (Fundamental interpersonal 
relations orientation-behavior). Palo Alto, Calif. 
Consulting Psychologists Press, Inc., 1967. 


(Received November 13, 1972) 


Journal of Applied . d 


p 


t 
] 

í 
r 
l 


1974, Vol. 59, No. 1, 74- 


INTRINSIC VERSUS EXTRINSIC CUES AS DETERMINANT 
OF PERCEIVED PRODUCT QUALITY 


GEORGE J. SZYBILLO anp JACOB JACOBY * 


Purdue University 


This study tested a portion of Olson’s cue model which hypothesizes that, 
other things equal, intrinsic cues (e.g, physical product differences) will be 
stronger determinants of perceived-quality judgments than will extrinsic cues 
(e.g, price or store image). Ninety female undergraduates were randomly 
assigned to the conditions of a 2X 3X3 (Price & Store Image X Hosiery 
Product Samples) analysis of variance with repeated measures on the in- 
trinsic factor. Results supported the hypothesis; omega-square values revealed 
that 73% of the total variance was due to intrinsic product sample differ- 


ences, Additionally, Shapiro’s hypothesis 


t perceived worth would be more 


strongly related to stated purchase intentions than would perceived quality 


was supported. 


Purchase behavior in contemporary Amer- 
ican society requires, among other things, that 
the consumer be able to make judgments and 
comparisons across products, brands, and 
services. Such judgments are arrived at by a 
process in which some or all of the various 
items of information (i.e., cues) associated 
with the product, brand, or service are identi- 


œ fied, evaluated, and integrated to form a 


composite judgment. 

One judgment usually of considerable impor- 
tance to the consumer is the product’s qual- 
ity, and more than a score of published 
studies have examined the effects of specific 
cues on the judgment of perceived quality 
(cf. Jacoby, Olson, & Haddock, 1971; Olson, 
1972). Little thought, however, has been given 
to developing a conceptual framework for ex- 
pressing how and when specific cues or types 
of cues operate to form the composite quality 
judgment. Until recently, the most notable 
exception has been Cox (1962), who devel- 
oped a cue-processing model based upon the 
notions of predictive and confidence value. 

Other investigators (Jacoby et al., 1971; 
Valenzi & Andrews, 1971) have distinguished 
between intrinsic and extrinsic cues. The 
former refer to cues which, if changed, would 
produce a resultant change in the physical 
product itself (e.g., flavor and aroma cues for 
beer), while the latter refer to such factors 


1 Requests for reprints should be sent to Jacob 
Jacoby, Department of Psychological Sciences, 
Purdue University, Lafayette, Indiana 47907. 


as price and store image. While ne er 
Valenzi and Andrews (1971) nor Jacoby « al. 
(1971) have tested any a priori hypo ùe- 
ses regarding cue-intrinsicness—extrinsic \ ss, 
the latter speculate, on a post hoc ‘is, 
that intrinsic cues might be more in or- 
tant determiners of perceived quality an 
extrinsic cues. 

The most explicit and detailed conceptu- 
alization of the quality cue selection and 
integration process has been . by 
Olson (1972; Olson & Jacoby, 1972). One 
of the numerous propositions contained in this 
model is that, other things equal, intrinsic 
cues will be more strongly related to quality 
perception than will extrinsic cues. The pri- 
mary purpose of the present investigation was 
to provide an a priori test of this hypothesis 
using a product possessing different cues than 
were involved in the earlier studies. The in- 
trinsic cues employed by Jacoby et al. (1971) 
and Valenzi and Andrews (1971) were pri- 
marily olfactory and gustatory in nature, 
while the intrinsic cues for the product used 
in this study (ladies nylon hosiery) were 
primarily tactile. 

Price and store-image information were the 
extrinsic cues utilized in the present investiga- 
tion. Only two prior studies examined the 
effects of price and store image in combina- 
tion as cues to product quality (Andrews & 
Valenzi, 1971; Stafford & Enis, 1969). How- 
ever, subjects in both studies were exposed 
to all experimental treatment conditions, 
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that biasing-demand char- 
2) may have been intro- 
repeated measures were 
treatment factor in the 
cover, neither of these 
1al product differences to 
ctor in the experimental 
Jacoby et al. (1971) sug- 
edures may oversimplify 


that such 


quality perce; ion process. 
Finally, Shape (1970) presented evidence 
o indicate that ;«:ceived worth (i.e, value 


stronger relationship to 
i of purchase than does 
inasmuch as unbranded 
re used in the current in- 
‘Jationships between per- 
‘ceived quality, pe:ceived worth, and likelihood 
of purchase were «Iso examined. The specific 
hypotheses of the study were as follows: 


_ Hypothesis . Intrinsic cues (ie., in the 
present study, composition differences in three 
“different sample, of hosiery) exert a greater 
effect on quality perception judgments than 
"extrinsic quality cues (ie. in the present 
Study, price and store image). 
“Hypothesis 2. Perceived worth, defined as 
Value for the money, is more strongly related 
to stated likelihood of purchase than is 


perceived quality. 


r the money) 
ved likeli 
ved quali 


METHOD 


Subjects were 90 female Purdue University under- 
graduates enrolled in an introductory psychology 
course during the fall of 1971. They were required to 
have lived in the Lafayette area for a minimum of 
six months in order to insure that the independent 
Variable of store image was meaningful for them. 


Design 

A 2X33 factorial consisting of two levels of 
price (i.e, present vs. absent), three levels of store 
age (ie, high-quality store image, Jow-quality 
ore image, and no information provided as to 
ore image), and three levels of product samples 
three physically different samples—different 
ands—of hosiery) was employed. Repeated mea- 
ures were taken across the latter factor (Winer, 
1971, p. 559). Fifteen subjects were used in each of 
e six between-subject conditions. 


Independent Variables 


Price information was manipulated by either pre 
senting or withholding the actual prices ($0.98, $1 a. 
and $2.49) of the three product samples, 

Store-image information was manipulated 
either presenting or withholding the names of 
local stores purportedly selling the three product” 
samples. Two stores were selected on the basis ofa 
pretest which indicated that they represented high- 
and low-quality store images. The difference between 
the store with the highest pretest mean quality rat. 
ing (Ñu = 7.95, on a 9-point scale) and the store 
with the lowest mean quality rating (n= 2.56) 
highly significant (f = 16,58, 4% = 22, p < 001). S 

Each product sample of hosiery was identified by 
cither I, J, or R, since a pretest showed no differ- 
ences in preference across these three letters. 
letters were randomly assigned to product samples 
(ic. hose) for each experimental condition. For pur- — 
poses of a simplified exposition, the letters I, J, and 
R will hereafter be used to designate the low- 
medium-, and high-quality product samples, respec- 
tively. 

The three samples of hosiery were pretested on the 
attributes of knit, texture, workmanship, cut, and 
overall quality, Results of an analysis of variance of 
overall perceived-quality ratings indicated a signif- 
cant main effect for samples (F = 24.97, df= 2/2 
p< 0001), and Newman-Keuls analysis indica! 
that all differences between pairs of means for the 
three samples (X= 34.28, X. = 54.64, Xu = $4.50) : 
were significant beyond the O! level. These differences 
were in the anticipated direction. >, 


Dependent Variables 

The major dependent variable for this study was 
overall perceived quality. ‘The minor dependent varia- — 
bles were perceived worth and stated likelihood of 
purchase. All were measured using the 100. point 
scale procedure described in Jacoby et al. (1971). 


Procedure 


Subjects in each of the six cells were run sepa- 
rately. The subjects were seated at a long table and 
separated by table-top partitions to restrict between- 
subject communication. A female introduced the ex- 
periment as a psychological study designed to de- 
velop reliable instruments for measuring hosiery — 
perceptions. She then gave the subjects a question- 
naire booklet and demonstrated how to use the 
rating scales. Subjects were told they would be evalu- 
ating three samples of hosiery available locally. 
Hosiery samples were presented in wrapped gift 
boxes. Price information and store-image information 
were given, as appropriate, by labels attached to the 
samples and communicated verbally before subjects 
began evaluating the hose. Samples were check 
after each experimental session for damage, replaced 
if necessary, and new labels assigned. : 
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ResuLTS 
Manipulation Check 


The manipulation check for store image 
indicated a signiñcant main effect for quality 
oí store merchandise (F = 53.02, df = 2/87, 
p < .001). A Newman-Keuls analysis indi- 
cated that all differences between pairs of 
means for the three store-image conditions 
(£i = 3.80, Ro = 5.73, Ku = 7.93) were sig- 
nificant beyond the .01 level. These differences 
were in the anticipated direction, with the 
store names selected on the basis of the pre- 
test to represent low store image and high 
Store image receiving the lowest and highest 
mean ratings, respectively. 


Model for Data Analysis of Dependent 
Measure 


The model for repeated measures analysis 
of variance is appropriate if the following 
assumptions are satisfied: (a) homogeneity of 
the variance-covariance matrices and (b) 


Symmetry of the pooled variance-covariance 
matrix (Winer, 1971, pp. 561-562). For the 
_ dependent measure, tests for homogeneity of 
variance for variation due to subjects within 
_ Sroups and due to the interaction between 


samples and subjects within groups were 


_ computed. The F max tests for homogeneity 
_ of variance yielded insignificant results for 


variation due to subjects within groups ($ > 


05) and due to interaction between samples 
and subjects within groups (p > .05). How- 


ever, since complete assumptions about the 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF OVERALL 
PERCEIVED-QUALITY RATINGS 


Sample 
ay Price present Price absent 
I J R I J R 

Loy image 

it. 24.80 | 36.46 | 85,13 | 19.80 | 49.86 82.46 

SD 20.35 | 21.64 | 11.33 | 13.34 12.87 | 14.21 
No image 

X 26.46 | 54.73 | 82.73 | 24.73 51.33 | 86.86 

SD 15,25 | 13,04 | 15.60 | 12.43 | 14.45 742 
High image 

Ag 30.00 | 57.33 | 87.06 | 34.40 | 56.86 86.66 

SD 18.51 | 15.33 | 9.98 | 23.74 | 17.38 13.37 


Note. n = Is per cell. 
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TABLE 2 


ANALYSIS OF VARIANCE oF OVERALL PE: iy 
Quarry RATINGS 


Source df 

Between subjects 

Price (P) 

Store image (SI) 47* 

P XSI 

Error between groups 
Within subjects 

Product samples (PS) 

F * PS 


| 
SI X PS | 
P XSI XPS 

Error within groups 


4(2) | 417.58 
168 (84) 196.48 


Conservative df used in Greenhouse and G pri 
cedure, 

2 <0. 

5 <.001. 


variance-covariance matrices were not 
the Greenhouse and Geisser (1959) proc-du 
was used for all tests involving the s2\..ol 
factor. This procedure is negatively bia i 
that it uses conservative degrees of fre: 
making rejection of false null hypothese le 
likely. 


Data Analysis of Major Dependent Variable 


The means and standard deviations of over. 
all perceived-quality ratings for each experi- 
mental cell are presented in Table 1. Resul 
of the analysis of variance based on th 
data are summarized in Table 2. A strong 
product samples main effect was obtained (F 
= 394.85, df=1/84, p<.0001), and a 
Newman-Keuls analysis indicated that all dif- 
ferences between the pairs of means for thi 
three samples (X; = 26.76, X; = 51.10, Xn 
= 85.15) were significant beyond the O! level. 
The analysis of variance also indicated a sig- 
nificant store image main effect (F = 5.46, dj 
= 2/84, p <.01). A Newman-Keuls analysis 
indicated that only the difference between the 
means for the low-store-image (XI. = 49.75) 
and high-store-image (In = 58.72) conditions 
was significant (p < .05). 

Intrinsic quality cues were predicted to ex- 
ert a greater effect on quality perception than 
extrinsic cues. To test this hypothesis, omega- 
Square values were computed for the effects 
of price, store image, and product samples 
(Winer, 1971, p. 428). In terms of these in- 
dices, 73% of the total variance was due to 
the main effect of samples, and approximately 
1% was due to the main effect of store image. 
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ition was relatively ineffective in 
iny percentage of the total vari- 
an finding confirms Hypothesis 1. 

I present, store-image-absent condi- 
tion is study was analogous to Shapiro's 
(Ae perimental setting. The correlations 
bet verceived quality and stated likeli- 
ho irchase for samples I, J, and R were 
07 and .49, respectively. The correla- 
tior veen perceived worth and stated like- 
lihe purchase for these samples were .72, 
56 64, respectively. Based on tests ſor 
difi cs between dependent correlations 
(B & Kintz, 1968, p. 193), perceived 


like l of purchase was more closely re- 
lat perceived worth than was perceived 
qui for samples I and J (tı = 2.68, ty = 
2.6 12, p < .05) but not for sample R 
(t 75). These results partially confirm 
Hy esis 2. 
Discussion 

( stent with earlier research (Valenzi & 
And vs, 1971; Jacoby et al., 1971), in- 
trin. cues had a greater effect upon quality 
pe- tion than did extrinsic cues. More spe- 
cific ‘iy, composition differences in hosiery 
sam; cs had a greater effect (accounting for 
73%. of the quality rating variance) than did 


eithe: price or store-image information. In 
contrast to previous studies, the amount of 
variance accounted for by product composition 
differences in this study is extremely large. 
This may be an effect of moving from prod- 
ucts for which between-brand differences, al- 
though discriminable, have a relatively small 
range (i.e., butter and beer) to a product for 
which brand differences appear to have a 
relatively large range (i.e., hosiery). 
Comparing the relative strength of price and 
store image, the results indicate that while the 
effect of price was not significant, the aware- 
ness of store image did influence quality per- 
ception. Apparently, knowing the samples 
came from a store which carries high-quality 
merchandise increases quality ratings relative 
to knowing that the samples came from a 
store which carries low-quality merchandise. 
Although the store image effect was statisti- 
cally significant, its practical importance is 
mitigated by the small amount of variance it 


explained. Once again, price was shown 
have an insignificant effect on judgments 
perceived quality when it was one of se 
cues in a multicue setting (cf. Jacoby et al 
1971). 

Shapiro's (1970) hypothesis that pe 
worth is more strongly related to stated 
hood of purchase than is perceived q 
received partial support, and this has subs 
tial implications for practitioners. More 
cifically, and given that stated purchase ii 
tentions have some positive relationship 
actual purchase behavior, these results sug- 
gest that products should not be designed to 
be of highest quality as much as they should 
strive to provide the best (perceived) value 
for the money. : 

This investigation represents but one small 
test of Olson’s (1972) quality perceptior 


remain to be examined. Hopefully, this model 

and investigations such as this will serve % 

stimulate the research and additional con- 
ceptualization necessary for arriving at a com- 
prehensive understanding of the quality per- 

ception process, E 

* 
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on demographic and socioeconomic variables, 


but such variables have not added much to 


the understanding ^f the psychology of 
sumer choice processes. There is no reason to 


expect demographic or general personality) 


trait measures to correlate well with behavior, 
as has been shown in many consumer studies 
(Kassarjian, 1971). Fishbein (1967) has 
pointed out the same type of reasoning in 
studying attitude-behavior linkages. General 
measures of attitude may be compatible with 


many different behavioral responses; hence, ! 


such a general measure should not be expected 
to correlate highly with any given response. 
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RE ONSHIP OF INFORMATION-PROCESSING ATTITUDE 
STRUCTURES TO PRIVATE BRAND 
PURCHASING BEHAVIOR ' 
JAMES R. BETTMAN * 
Graduate School of Management, University oj California, Los Angeles 
Variables related to consumer information-processing models and consumer 
attitude structures were used to discriminate between private and nonprivate 
brand purchasers. Discriminant analysis was used to derive discriminant func- 
tions and to classify subjects. In general, variables reflecting lower risk and 
greater information were associated with private brand choices. Approximately 
80% of the subjects in the three product classes studied were correctly classi- 
fied. A scrambled sample validation procedure showed that these results were 
not spurious. Arguments for the general usefulness of an information-processing 
and decision-structure-oriented approach were advanced. 
Ther :ave been several research studies 

that h attempted to characterize those 

consumo purchase private brands and to 

disting them from those who buy non- 

private nds. To date, the results have not 

proven y successful. For example, Frank 

and B (1965), defining private brands as 

those owed by either retailers or distributors 

and nouprivate brands as those owned by 

manufacturers, find virtually no predictive 

power for demographic and socioeconomic 

variables in attempting to predict the propor- 

tion of a household’s private brand purchases. 

They conclude that private brand and non- 


private brand consumers are essentially in- 
distinguishable with respect to such demo- 
graphic measures. Similarly, Myers (1967), 
in attempting to predict private brand atti- 
tude, found very little predictive power for 
general personality variables and socioeco- 
nomic factors. Finally, Burger and Schott 
(1972) again found that demographic varia- 
bles did not discriminate private brand from 
nonprivate brand buyers. However, they did 
find some degree of discrimination with atti- 
tudinal variables. 

These results stem perhaps from the lack 
of a proper analytic framework. It is cer- 
tainly easier to form market segments based 


+ This research was partially supported by the 
Division of Research of the Graduate School of 
anagement, University of California, Los Angeles. 
2 Requests for reprints should be sent to James 
R. Bettman, Graduate School of Management, Uni- 
versity of California, Los Angeles, California 90024. 
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In contrast, the more specific information- 
processing models have proven successful in 
modeling buyer choices in several areas (Bett- 


man, 1970; Haines, 1969). Such models uti- 


lize complex decision networks composed of 


product perceptions and situational variables. a 


Haines (1969) has noted the similarities be- 


tween this approach and the notion of com- 
plex attitude structures developed by 
Schroder, Driver, and Streufert (1967). They 


see attitude as consisting of two types of prop- — 


erties: 


An attitude is defined in terms of content properties i 


(magnitude and direction; for example, x degree of 


conservatism) and structural properties (the com- 


plexity of conceptual rules involved in processing 2 
information about stimuli relevant to the attitude 5 


area) [p. 129]. 


These structural properties of attitude seem 2 


to be what is depicted in the consumer deci- 
sion net information-processing models. As 
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Schroder et al. (1967) state, “ ‘structural 
variables’ provide a metric for measuring the 
way a person combines information . . . for 
adaptive purposes [p. 4].” Thus, consumer 
attitude structure properties represent the 
internal model used by the consumer to proc- 
ess data about stimuli relevant to product 
choice. 
1 This suggests that, in general, developing 
nan information-processing model of a choice 
__ Process can conceivably be considered as simi- 
lar to the problem of assessing attitude- 
behavior relationships, when attitude is repre- 
"sented by both content and specific attitude 
4 Structural properties. Fishbein (1972) has 
_ found that a fruitful approach to studying 
a ttitude-behavior relations involves using 
_ Situation-specific attitudes and measures of 
attitude- belief components, This suggests an 
approach to the problem of discriminating 
between private and nonprivate brand pur- 
chasers, Useful variables for such differentia- 
tion are likely to be both related to the con- 
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Ihe objective of this study is to reexamine 
F, the phenomenon of private brand purchasing 

from a different perspective than that of ex- 
. plaining behavior by using general traits and 
y descriptors. In particular, a situation-specific 
information processing approach is taken. 
This study tests the usefulness of this infor- 
mation processing viewpoint. Those variables 
_ previously found to be contained in the atti- 
tude structures active in processing informa- 
tion in the grocery brand purchase situation 
were used to attempt to discriminate between 
private and nonprivate brand choices. To the 
extent that this more micro and situation- 


specific approach is successful, the i 
tion-processing approach is supported 


METHOD 

Subjects 

The subjects for the present study were ! use 
wives selected from the Los Angeles area ub 
jects were mostly wives of students livi the 
married student housing complex at the | rsity 
of California, Los Angeles and mem! ia 
women's group. The subjects completed a ind- 


pencil instrument measuring the informa roc- 
essing variables and some behavioral var for 
each of nine types of grocery products. S; ally, 
the type of brand purchased and the attitu ruce 
ture variables were obtained as outlined be! 


Variables 


The subjects were asked to check which list 
of brands in a product class they usually pure wed 


This allowed determination of the dependent vay able 
of private brand or nonprivate brand choix here 
were large enough groups of private brand has- 


ers to analyze for three of the nine produc ses: 
paper towels, aspirin, and margarine. 


The 10 independent variables measured into 
four main types of variables which had pre- ously 
been found in consumer information-processir. atti- 
tude structures for grocery products: demos phic, 


perceived risk related, perceived product ality 
related, and information related (Bettman, 1°70). 
Only 1 demographic variable was used, num of 
children living at home. 

For the perceived risk variables, a measure oi the 
inherent riskiness in a product class was developed 
by using paired comparisons with a 10-point inten- 
sity scale. For each comparison, each subject rated 
which of a pair of product classes would be more 
risky to purchase in an imaginary store where all 
brand levels and prices were disguised. The subject 
then rated how much more risky the chosen class 
would be on the intensity scale (with 0 meaning 
equally risky and 9 meaning much more risky). 
These data for each subject were then process.d as 
follows to obtain a risk scale. Suppose product classes 
i and j were the two classes in the pair, and denote 
the numerical rating obtained by dij. The following 
sign convention was used: if i is more risky than j, 
dı; > 0; if i and j are equally risky, diy O; and if 
7 is less risky than j, ds; <0. Since this rating was 
obtained for all the distinct product class pairs only 
(ie, 9-8/2 = 36 pairs), set dje S di. Then scale 
values for each product class s; were obtained by 
setting sı = Xj ide Thus, the sı represent the 
relative degree of risk for that subject for each 
product class; that is, the s: for each subject summed 
to zero over the nine classes, In addition, the mea- 
sures of certainty and danger components of risk 


This is akin to Thurstonian scale values. For an 
example of the use of this technique, see Bechtel, 
Tucker, and Chang (1971, pp. 374-375). 
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sham (1967) were used, adapted to 
scale format, 
ariables related to perceived product 
ects were given a set of brand names 
t class (each subject had the same 
nber of brands given differed across 
. The subjects then placed these 
)-point quality scale, ranging from 
ery high quality, The subjects also 
vest level of quality for each product 
ld be acceptable to them. Given this 
ing measures were developed for 
lass. Mean quality was measured as 
the quality levels marked for the 
ads. Relative variance was obtained 
the sample variance in quality for each 
iding by the maximum variance in 
ch subject over all nine product classes. 
level was measured simply as the 
marked as the minimal acceptable 
percent acceptable was defined as the 
the sample of brands for cach class 
o or above the minimal acceptable 
ty by each subject. 
national variables were used. Mean 
obtained by having subjects rate each 
sroduct class on a 10-point familiarity 
verage rating across brands was then 
dence in information was measured on a 
ing scale for each product class. 


| Analysis and Hypotheses 


the dependent variable of private brand 
z was a binary variable, a two-group multi- 
inant analysis was performed using the 10 
{ lent. variables defined above. This analysis 
y yielded both discriminant functions and a classifica- 
= for each of the three product classes ana- 
As discussed above, the information-processing 
Variables measured should be related to consumer 
choice behavior, since these variables are contained 
in the attitude structures influencing consumer 
choice, and should also be related to the content 
Properties of the attitudes invoked in the choice 
Situation. In particular, private brand purchasing 
should be associated with a lower degree of risk and 
Uncertainty attributed to a product class, higher and 
less variable perceived quality for brands in the 

„and more information and familiarity with the 
Product class. These hypotheses follow from the gen- 
eral notion that private brand purchasing is more 
likely when the consumer is confident she can ob- 
tain satisfactory performance as well as save money. 

9, the greater the number of children at home, 
the greater the familiarity with the class and the 
Motivation to economize on purchases are likely to 
be. Frank and Boyd (1965) found a slight associa- 
tion between family size and private brand pur- 
5 These general arguments may be translated 
Into specific terms by hypothesizing expected signs 
for the variables in che discriminant functions. Pri- 
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m 


vate-brand-purchasing group membership 

associated with lower inherent takes 
greater certainty (+); less perceived danger (=); 
higher mean quality and percent acceptable (4); 
lower acceptable level and relative variance g 
greater mean familiarity and confidence (+); ai 
more children at home (+). 


ResuLTS 


The discriminant functions are 
Table 1 and the classiñcation results in Tab 
2. The effective sample sizes for each produc 
class were smaller than 123 because some su 
jects did not purchase in a product class 
purchased brands not listed in the sample 
brands used in the study. 

Generalized Mahalanobis distance was u 
to test the significance of the discrimi 
functions.* All three functions were sig y 
at $ < .001. Note that, in general, the varia 
ble coefficients are as hypothesized. One m 
be careful here, however, since many of th 
coefficients are not significantly different from: 
zero. Finally, several of the coefficients hay 
what appears to be the wrong sign becay 
they are acting as suppressor variables (D 
lington, 1968); that is, the variable actually” 
correlates in one direction with the dependent 
variable but has the opposite sign in the 
equation because it is correcting for error in 
another variable. This is true for percent 
acceptable for paper towels, acceptable level 
for aspirin and margarine, mean quality for 
margarine, mean familiarity for garine, 
and confidence for paper towels. The main 
counter-hypothesis finding seems to be for 
mean quality. By examining the magnitu 
of the standardized coefficients, it can be seen 
that, in general, certainty and percent ac 
ceptable are most important for discrimina- 
tion, although the results vary over product 
class. 

The fact that the discriminant functio 
are statistically significant does not mean that 
group members will be correctly classified. In 
the present study, however, the functio 
classify quite well, as can be seen from Ta 


ý 


4 Generalized Mahalanobis distance can be used 
test the hypothesis that the mean vectors for 
two groups are equal. Under this null hypothe 
the distribution of this quantity approaches that of 
chi-square. For more details, see Afifi and Azen 1972, 


especially pp. 235-250). 
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TABLE 1 
PRIVATE VERSUS NONPRIVATE BRAND DISCRIMINANT FUNCTIONS 
Coefficients for the product classes 
Variable x 7 = ie = y 
9 5 Paper towels Aspirin Margari 
(n = 115) (nm = 113) | ( = 11 
Constant 957 464 —.11³ 
Children at home 059 (148) 013 ( 030) 004 ( 0 
Inherent risk —.002 (—.007) —.008 (—.064) —.024 (—.1 
Certainty 056 ( 3355)" 025 (162) 030 ( 1 
Danger —.008 (—.043) —.034 (—.221)* —.033 (—.1° 
% acceptable —.177 (—.111) 388 ( .260)* 475 (37 
Relative variance —.178 (—.115) —.273 (—.189)** —.094 (—.0 
Acceptance level —.032 (—. 220) 007 (061) 035 ( 9 
Mean quality —.029 (. 090) —.050 (—.247)** 034. (—. 
Mean familiarity —.055 (—.229)** -018 ( 057) —.008 (—.03 
Confidence 020 (116) 018 (114) 921 (147 
Mahalanobis D? 39.991 90.363 45.682 * 


Note. Standarized coefficients are given in parentheses, 
à Variable acting as a suppressor variable. 
*p <.10. 
Dp <.05. 
ol. 


2. The percentages correctly classified are 
77.4% for paper towels, 82.3% for aspirin, 
and 80.5% for margarine. However, extreme 
care must be used in interpreting these results. 

First, since the groups of private and non- 
private brand purchasers are of unequal size, 
the appropriate chance classification model 
Must be determined as a standard of com- 
Parison. Morrison (1969) argues that the 
appropriate model is normally the proportional 
chance model. If p is the true proportion of 
private brand purchasers in the sample and a 
is the proportion actually classified as private 


TABLE 2 


CLASSIFICATION OF SUBJECTS USING THE 
DISCRIMINANT FUNCTIONS 


Product class 
Classification P 
Paper a Mar- 
towels Aspirin garine 
Actual private brand 
(classed private brand) 23 23 15 
Actual private brand 
(classed nonprivate brand) 5 7 7 
Actual nonprivate brand 
(classed nonprivate brand) 66 70 80 
Actual nonprivate brand 
(classed private brand) 21 13 16 
% correctly classified 77.4 82.3 80.8 
2 private brand 
correctly classified 82.1 16.7 68.2 


brand purchasers, then the probability of 


correct classification by chance and, 
the chance proportion correct is given b 


Dla a). 


For the three product classes of this study, 
the proportional chance figure is 56.0% for 
paper towels, 58.5% for aspirin, and 64.9% 
for margarine. Thus, the results were sub- 
stantially better than this chance criterion. 

The discriminant analysis of this study 
used all the data to estimate the discriminant 
coefficients because of interest in obtaining 
the most stable estimates of the coefficients 
possible, given the relatively small sample. As 
Frank, Massy, and Morrison (1965) point 
out, if the function based on all the data is 
then used to classify these same data, there is 
an upward bias in the percentage correctly 
classified. Since splitting the sample was un- 
desirable for coefficient estimation purposes, 
the modified V, validation procedure sug- 
gested by Frank et al. (1965, p. 255) was 
applied. The raw data were “scrambled” by 
assigning individuals to the private brand and 
nonprivate brand groups at random, while 
keeping the sizes of the two groups equal to 
their actual values. Discriminant analysis was 


P(correct) = p a+ (1 


Peek on M T | 


performed on this sample. This pro- 


ce ure was carried out 10 times for each prod- 
u lass. The average percentage correctly 
c l for each class was 63.5% for paper 
ti 1.9% for aspirin, and 63.5% for 
n Since the actual results are all 
si illy higher, the results do not seem 


S| because of bias. 

Morrison (1969, pp. 160-161) 
that the smaller group is usually 
» classify well, although it is often 
of interest. In that light, the results 
vivate brand group are particularly 
impressi, since 82.1%, 76.7%, and 68.2% 
ire class ed correctly for paper towels, as- 
in, and margarine, respectively. 


point 

difficu: 
the gri 
for the 


Discussion 


“Le results bear out the contention that the 

I ol information processing, with its 
er an the attitude structural properties 
t process information by the consumer 
the variables contained in these struc- 
is more suited to analyzing consumer 
vior than approaches based on ad hoc 
demographic or general personality trait ap- 
aches. That is, perceived risk, informa- 
i, and perceived-product-quality variables 
are seen as parts of the information-processing 
attitude structural properties that in conjunc- 
tion with attitude content influence behavior. 
The ability of the discriminant analysis to 
differentiate among the private and nonprivate 
brand groups bears out this line of argument. 
In general psychological terms, this type of 
viewpoint argues for closer attention to mod- 
els of choice situation perceptions and choice 
processes, In this sense, the perspective is 
similar to that of Fishbein (1967, 1972) in 
arguing that relationships with behavior will 
be found only if the actual information-proc- 
essing aspects of the choice situation are in- 
vestigated and variables related to the active 


INFORMATION-PROCESSING ATTITUDES AND PRIVATE BRAND PURCHASING 
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attitude structures in the situation are cons! 4 
ered. 
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SHORT NOTES 


PREDICTION 


OF ARTISTIC PERFORMANCE FROM 


BIOGRAPHICAL DATA 


LAWRENCE R. JAMES,? ROBERT L. ELLISON, DAVID G. FOX 


Institute for Behavioral Research in Creativity, Salt Lake City, Ulah 
CALVIN W. TAYLOR 
University of Utah 


A biographical inventory was constructed and administered to 312 nonart 
students and 501 art students in the areas of music, dance, theater, and the 
visual arts. The biographical inventory had significant average cross-validities 
against an art versus nonart criterion (.67, p < .01) and multiple art criteria 
(art sample only) both within and across artistic areas. A tentative conclusion 
was that prediction of art versus nonart potential and prediction of actual 
performance in the arts require different scoring procedures for the biograph- 


ical inventory. 


Relatively little research has been conducted 
on the prediction of different criteria of perform- 
ance in the arts (Anastasi, 1961). For those 
studies that have been conducted, however, re- 
sults have been promising (Anastasi & Schaefer, 
1969; Burkhart, 1962). Schaefer and Anastasi 
(1968), for example, demonstrated that bio- 
graphical information could predict a “creative 
artistic” versus a “control artistic” dichotomous 
criterion for adolescent boys. No studies were 
found on the prediction of several different 

criteria common to different art areas. 

The present study examines the concurrent 

_ Cross-validities of a biographical inventory devel- 
oped to predict an art versus nonart dichotomous 
criterion as well as several different performance 
criteria for art students in four art areas (music, 
dance, theater, and the visual arts). It was 
assumed that two different biographical inventory 
scoring procedures would be required, one for dif- 
ferentiating between art and nonart potential and 
one for predicting actual performance in the arts, 

This assumption was tested by comparing the 

cross-validities against art performance criteria 

for both of the above scoring procedures, 


= 
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= 
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METHOD 
Samples 


The biographical inventory was administered to 
835 art students and 363 nonart students. Return 


1 This research was supported by Grant OEG- 
8-9-540215-4004 (010), Office of Education, U.S. 
Department of Health, Education, and Welfare. 

2 Requests for reprints should be sent to Lawrence 
K. James, who is now at the Institute of Behavioral 
Research, Texas Christian University, Fort Worth, 
> Texas 76129. 


rates were 60% for the art sample and 86%- for 
the nonart sample. The art sample (n = 5 


tained was comprised of 106 subjects a 
private high school specializing in the ar 60 
subjects from a state summer school for p- 
tional art (or academic) high school student: nd 


130 and 105 subjects, respectively, from iwo 
urban high schools offering programs in the 
The nonart sample (n = 312) was comprised of 
33 subjects from the state summer school and 197 
and 182 subjects, respectively, from two urban 
high schools, 

No attempt was made to match the art and 
nonart samples on such biographical data as 
experience in the arts and sex, because the 
investigators were concerned with examining 
existing differences. However, the only practical 
difference between the two samples was a signifi- 
cantly higher proportion of females in the art 
sample (.61. vs. .41). 


Biographical Data 


By using either the original wording or wording 
revisions for art students of items developed in 
an earlier study concerned with predicting scien- 
tific and engineering performance (Taylor & 
Ellison, 1967), 249 biographical inventory 
items were made available for this study. 
The biographical inventory items were sepa- 
rated into the following two categories: (a) 
200 general biographical items (e.g., develop- 
mental history, parents and family life, extra- 
curricular activities, etc.) and (b) biographical 
items specific to the arts (e.g., number of hours 
devoted to daily practice, artistic accomplish- 
ments, special schools attended, etc.). The latter 
set of items was comprised of items that signifi- 
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cantly predi: eativity and other performance 
criteria in t} entific and engineering studies 
but that ha en rewritten to correspond to 
artistic ende Nonart students answered only 
the first 20 ns, while art students answered 
all 249 ite Data for academic grade point 
average, Sex and experience in the arts were 


also obtain 


Criteria 

Multiple ria were obtained for the art sub- 
jects for lowing four art areas: music, 
dance, the ind the visual arts. For each art 
subject, or faculty member completed sepa- 
rate single ratings (15-point scale with even 
alternative ed) on (a) sensitivity, (b) moti- 
vation, (c ression of self, and (d) potential; 
a three-it« reativity checklist (one summed 
score); a i as four specific art-area check- 
lists (corr: iding to the music, dance, theater, 
and visua! ireas, with each checklist providing 
a summed ve across six items), on which 
art facult mbers rated only subjects in their 
own art Ge., each subject was rated on 
one check only), In order to provide behav- 
ioral relev: to these criteria (i.e., wording the 
measures meaningful artistic terminology that 
Was amena o measurement), the rating and 
checklist ia were constructed with the 
assistance t faculty and students. An excep- 
tion was tte creativity checklist, which retained 
the form previously employed in research on 


engineers (Taylor & Ellison, 1967). 
: iterion data for the art subjects 
included self-reported number of awards and 
Major art-area grade point average. 


ANALYSES AND RESULTS 
Prediction of Art versus Nonart Criterion 


An item analysis was conducted to predict the 
art versus nonart dichotomous criterion on the 
Combined art and nonart samples (N = 813). 
Randomly selected halves of the art and nonart 
Samples were combined, and a double cross- 
Validation design was employed. Only the first 
199 general biographical inventory items were 
analyzed. The average of the two cross-validities 


against the art versus nonart criterion was 67 
(Fisher 2). 


Relations among Criteria 


The correlations between the four ratings, the 
creativity checklist, the four art-area checklists, 
and major art-area grade point average were all 
Significant and ranged from .33 to .83. Although 


the presence of some halo, or a lack of discrimi- 


Nores 8 


nant validity is indicated, the significant correla- 
tions between the four ratings (sensitivity, moti- 
vation, expression of self, and potential) and the 
separate art-area checklists provided evidence 
that the ratings were pertinent measures of 
artistic performance within each of the four art 
areas. Generally low to moderate correlations be- 
tween the number-of-awards criterion and the re- 
maining art-area criteria indicated the sampling of 
different criterion domains. The pattern of gen- 
erally low correlations between the art criteria 
and academic grade point average (rs = .11-.54, 
mean r= 20) implied that different subsets of 
abilities were required for successful performance 
in these two fields, Finally, the correlations be- 
tween the art criteria and the remaining control 
scores (e.g., age, sex, and experience in the arts) 
were generally low and not significant, An excep- 
tion was the number-of-awards criterion which 
correlated .36 (p <.01) with experience in the 
arts, Experience in the arts was partialed out of 
the number-of-awards criterion. 


Prediction of Separate Art Criteria Across 
Art Fields 


An item analysis was conducted on the bio- 
graphical inventory responses of the art sample 
to predict the different criteria of artistic per- 
formance. A double cross-validation design was 
employed on approximately equal subsamples of 
art subjects (us = 250 and 251). All 249 bio- 
graphical inventory items were analyzed; how- 
ever, average cross-validities were obtained sepa- 
rately for the first 200 biographical inventory 
general items, the 49 biographical inventory items 
specific to the arts, and all biographical inventory 
items combined. 

Significant average cross-validities (p <.05) 
were obtained for all art criteria. The average 
cross-validities provided by the 49 art items 
(highest average cross-validities per criterion 
ranged from .30 to .54) and all items combined 
(31 to .57) were generally higher than the 
average cross-validities provided by only the 200 
general items (.28 to .50). For example, the bio- 
graphical inventory scoring key constructed to 
predict the sensitivity rating from the 200 general 
items had average cross-validities of 31, 41, and 
.50, respectively, against the sensitivity rating, 
potential rating, and a composite (equally 
weighted) of the potential rating and the number 
of awards (with experience partialed out). The 
biographical inventory key specifically con- 
structed to predict the sensitivity rating on the 
49 art items had average cross-validities of .37, 
47, and .54, respectively, against these same 
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criteria, However, only 2 of a possible 12 com- 
parisons (for 12 criteria) between average cross- 
validities for the scoring keys provided by the 
general items versus the specific art items reached 
significance (two-tailed test for nonindependently 
distributed correlations, p < .05). Therefore, 
strong evidence was not provided that specific art 
items were more predictive of art performance 
than general items. 


Predicting Art Criteria with the Art versus 
Nonart Scoring Key 


Because the subsamples of art subjects were 


preserved throughout the analyses, it was pos- 
sible to determine the average cross-validities 
of the art versus nonart scoring key (constructed 
to predict the art versus nonart criterion) 
against the art criteria for the art sample. With 
one exception, the average cross-validities of the 
art versus nonart scoring key against the art 
criteria were significant (.14 to .53). However, 
comparisons of the validities for the art versus 
nonart scoring key and the scoring keys devel- 
oped on the 49 specific biographical inventory art 
items to predict the art criteria demonstrated 
that the latter scoring keys had significantly 
(p < .05) higher average cross-validities in 7 out 
of 12 possible comparisons. Further, the art 
versus nonart scoring key had significantly 
b < .05) lower validities in 3 out of 12 compari- 
sons with the scoring keys developed on the same 
biographical inventory general items to predict 


the art criteria. 


Discussion 
The present study indicated the feasibility of 
conducting studies of artistic performance using 
common criteria and predictors across different 


86 SHORT Noves 


art areas, such as music, dance, theate d the 
visual arts. This is not to imply that udies 
within separate art areas or comparisons tween 
areas would not yield higher validities ional 
criterion development and construct valid. ion of 
art criteria are needed. 

The average cross-validity received pre- 
dicting the art versus nonart criterio this 
study (.67) was approximately the san =s the 
results found in the Anastasi and Schae‘«» studies 


(1969; Schaefer & Anastasia, 1968). wever, 
the art versus nonart scoring key was den not 
as valid against actual performance in the arts 
as biographical correlates specifically determined 
by keying biographical inventory items to sredict 
artistic criteria. A tentative conclusion s that 
different biographical inventory scorin. keys 
should be employed to predict an ari versus 
nonart criterion and performance crit ‘n the 


arts; however, predictive rather than «oncur- 
rent validation studies are required more 
conclusive evidence. 
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METHOD 


Subjects were 470 U.S. Military Academy 
graduates from the classes of 1963 through 1966 
who responded to a mailed questionnaire con- 
cerning job satisfaction (response rate = 60%). 
The four classes were about evenly represented; 
and 331 of the respondents were still on active 


The questionnaire contained a list of 31 job- 


Journal of Af sychology 
1974, Vol. 59, ` 87-89 
U OF WEIGHTING AND DATA ON ALTERNATIVE CAREERS TO 
PREDICT CAREER RETENTION AND COMMITMENT * 
ALD W. McLAUGHLIN AND 
ia Polytechnical Institute 
This study investigated the desirability of collecting data on the importance 
of job characteristics and anticipated satisfaction with these characteristics in 
in alternative career to predict career commitment and retention. Subjects 
vere 470 graduates of the U.S. Military Academy. Six data bases were devel- 
oped and evaluated in terms of several validity measures. Results indicated 
that data on alternative careers can be meaningful, but data on the importance 
of job characteristics produced little practical improvement. 
The p e of this study was to evaluate two 
alterna trategies in data collection to pre- 
dict ca: commitment and retention. These 
strategi neerned the value of using the im- 
Portanc specific job characteristics as 
weights d the value of using data concerning 
anticipa satisfaction with the job character- 
istics i alternative career. duty in the Arm 
The ability of item weighting has obtained z 1 
both po. ‘ive and negative support. Studies by 


Glenne Owens, Smith and Albright (1960), 
Froehlich and Wolins (1960), and Youngberg, 
Hedbe: end Baxter (1962) led to the conclu- 
sion the: item importance was relevant to job 


On the other hand, work by Larsen 
(1965) and Mikes and Hulin (1968) 
ubstantiate these results. Ewen (1967) 
d results which led him to conclude 
might be of value, but should be 
y evaluated prior to utilization. 

The question of obtaining data on satisfaction 
with alternative jobs has received less attention, 
with the primary impetus coming from research 
done by Porter (1962). The adaptation of this 
Concept supports the belief that a persons job 
satisfaction is not a simple function of satisfac- 
tion with aspects in the current job, but is also 
related to the anticipated satisfaction with those 
aspects in an alternative job. It is thus a focus 
on relative satisfaction rather than absolute 
satisfaction. 


1 Any conclusions in this report are not to be 
construed as official U.S. Military Academy or De- 
Partment of the Army positions unless so designated 
by other authorized documents. 

2 Requests for reprints and a more complete ex- 
planation of methodology and results should be sent 
to Richard P. Butler, Office of Institutional Re- 
Search, U.S. Military Academy, West Point, New 
York 10996. 
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characteristics that were derived from prior 
studies by Maslow (1954) and Herzberg (1959). 
Respondents were asked to rate the importance 
of a job characteristic in selecting a career, their 
satisfaction with it in the military, and their 
actual or perceived satisfaction with it as a civil- 
ian. The response scale for the importance of a 
job characteristic in selecting a career ranged 
from O to 4 (not relevant to crucial). Satisfac- 
tion was obtained on a 5-point scale of —2.0 
to 2.0 (very dissatisfied to very satisfied). 

Two criteria measures were obtained. The 
first was a measure of retention, that is, those 
subjects still on active duty scored 1, and those 
resigned from the military scored 0. The second 
criterion was used only for those still on active 
duty, and consisted of a 7-point scale that mea- 
sured commitment to a military career. This 
scale has been shown to have reliability and pre- 
dictive ability (Bridges, 1967), with high com- 
mitment having a score of 7 and low commit- 
ment a score of 1. 

Three basic models were employed in the 
analysis of the data to develop initial data bases. 
The first model included information only on 
satisfaction with the 31 job characteristics in a 
military career. This data base is the one typi- 
cally collected in studies of this type and was 
therefore named the classical model. The second 
data base contained two responses to each item, 
one for military satisfaction and one for civilian 
satisfaction, and is viewed as an alternative 
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model, The final data base also contained infor- 
mation from both areas of satisfaction but used 
as the final measure the difference between mili- 
tary and civilian satisfaction on each item, It is 
therefore labeled the difference model. 

Multivariate procedures of factor and cluster 
analyses were used to reduce each data base to 
= set of scores. The weights for importance were 
used to form a second set of scores for each 
data base by multiplying each datum in a score 

by its importance. This methodology produced 

six different sets of scores for each respondent: 
_ Classical, alternative, difference, weighted classi- 
cal, weighted alternative, and weighted differ- 
ence. Twelve forward Stepwise regressions were 
run, using each of the six sets of scores with 
both of the criteria, Within each criteria, ¢ 
tests for correlations from correlated data were 
computed to compare the validities obtained 
from the various sets of scores. 


TABLE 1 
COMPARISON OF Six MODELS on THE 
Two CRITERIA 


Correlations 
Model 1 vs. Model 2 L 
Ne | zo rei, 
In vs. out 
Difference vs. weighted 
difference 52 | 51 | .99 1.87 
Alternative vs, weighted 
alternative 54 52 97 2.12“ 
Classical vs. weighted 
classical 47 45 93 1.46 
Classical vs. alternative 47 54 85 |3.08* 
Difference vs. alternative 52 | .54 | .91 1.22 
i Career commitment 
Er 
Difference vs. weighted 
_ difference 63 65 | .98 | 2.50* 
Classical vs. weighted 
classical 58 60 | .97 | 1.95 
Alternative vs. weighted 
alternative 63 64 93 20 
Classical vs. alternative 58 63 | .90 2.18 
Difference vs. alternative 68 (63 94 46 


Note. For N = 470 (for the in versus out criterion) and 33 

r 31 

_ (for career commitment), a £ value greater th i 
Significant at the .05 level. ae ee 


^ Fi, is the correlation between the first model in a gi 
_ row and the criterion, Been 


ee is the correlation between the second mod li i 
$ 8 el in a giv 
do and the criterion, such 


rue is the correlation between the two models in a given 
OW. 
*p <05. 
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RESULTS 

The multivariate procedures produced ster 
scores for both the classical and differe: lata 
bases, and 15 for the alternative da ase, 
The internal consistencies of the sets of ster 
scores, for each of the three models, rang rom 
moderate to high. Cronbach alphas vari rom 
34 to .85, with a median of .69 for the sical 
model; from .34 to .86, with a median of 
the alternative model; and from .55 to with 
a median of .73 for the difference moc The 
34 alphas are much below the next west 
alphas, which were .61 for classical and rna- 
tive. Item weighting produced lower mea s of 
internal consistency in 17 out of the 20 has 


which changed (z= 3.1, p < .05). This 
was significantly more than the 10.0 expe 
chance. 

The correlations between the models 
two criteria, and the tests of signific 
compare the validities of the models ar: 
in Table 1. It should be noted that for 
havioral “in versus out” of service criter! 
alternative model was a significantly bett: 
dictor than the weighted alternative and ci 
models. For the career commitment cr: 
the weighted difference and alternative r 
were significantly better predictors than 
difference and classical models, respectively. 


the 


Discussion 


In terms of the efficiency of the final regres- 
sion models, the validities for the weighted 
models were lower for the on active duty versus 
not on active duty criterion, and higher for the 
commitment criterion. The ¢ values showed the 
differences to be significant for two of the six 
comparisons. However, the high correlations be- 
tween the weighted and unweighted scores within 
a given data base enhanced the chance of find- 
ing small differences to be significant. It, there- 
fore, appears that the differences of .02 in the 
multiple correlations that occurred have very 
little practical significance. 

The use of the alternative data base produced 
significant improvement over the classical data 
base for the unweighted models for both criteria. 
However, the model for the alternative data did 
not make any significant improvement when 
compared to the difference model. Similar results 
were found for the parallel analyses with 
weighted data bases. 

In conclusion, the results of this study fail to 
Support the weighting of job satisfaction items 
by their importance when a behavioral, dichoto- 


mous Cr! 


ing a cri 
However, 
to be so 
added tin 
weights, 
need to 
both the 
decision ! 
sure for 
rately m 
affect 
equation 


Bricks, 
comm: 
Acade:. 

Ewen, 
factio 

6568-78. 

Fxornt, 


slight ben 


is used. There may be some 
applying weights when employ- 
that produces continuous data. 
enefits that can be derived appear 
as to be more than offset by the 
cost involved in collecting item 
study does, however, support the 
ı job characteristic satisfaction on 
ent and alternative job. While the 
lude this data as a difference mea- 
or to include each response sepa- 
have theoretical interest, it did not 
‘ficiency of the final regression 
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PREDICTORS OF MANAGERIAL CAREER 


MICHAEL L. MOORE * 


Michigan State University 


JOHN FOSSUM 


University of Wyoming 


Relationships among managers’ current managerial level, age, perceived posi- 
tion-learning time, level of formal education, and their career expectations 
were investigated. Results from a sample of 203 managers from five plants in 
one corporation indicated the current age of the manager was the most potent 


predictor of his career expectations. 


At any given point in a manager’s career, he 
is likely to have expectations regarding the level 
he will ultimately achieve in his company. This 
expectation is derived from such self-perceptions 
as his relative qualifications, opportunities to use 
his expertise, performance to date, and views of 
internal company politics. In addition, his expec- 
tations of promotability may be related to the 
length of time he perceives that it should take 
him to learn jobs enroute to his expected 
career endpoint (Glickman, Hahn, Fleishman, & 
Baxter, 1968, p. 25). 

Tsolating the social learning variable, Moore 
(in press) found support for Jennings’s (1967) be- 
lief that most managerial jobs can be learned 
in two years or less. Moore found a mean per- 
ceived position-learning time of about 19 months 
with a mode of one year. Position-learning 
time is defined as the amount of time reported 
as necessary to learn the work responsibilities 
and duties defined by the organization’s position 
description in terms of functions, hierarchical 
level, liaison, decision making and supervision. 

The present study examines the relationships 
between perceived position-learning time, age 
and educational variables, and a measure of career 
expectations, to determine whether shorter per- 
ceived position-learning time or other factors were 
most related to career-expectation perceptions. 


1 This research was carried out at the University 
of Michigan and was first presented in the senior 
author's doctoral dissertation, Managerial Learning 
and Socialization, University Microfilms, 1969. The 
author reiterates his earlier thanks to the managers 
who cooperated in this study. 

2 Requests for reprints should be sent to Michael 
L. Moore, School of Labor and Industrial Relations, 
South Kedzie Hall, Michigan State University, East 
Lansing, Michigan 48823. 
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EXPECTATIONS * 


EDWIN MILLER 
University of Michigan 


METHOD 


Moore’s sample provided data from 20 
agers above line-foreman levels represent 
functional areas in five plants in two d 
geographical regions of one manufactu 
consumer durables. 

The respondents to this study were all 
198 were married, 60.1% were less ti 
years of age and 83.7% were under 50 
(82.2%) had had education varying fron 
college to graduate work, that is, demog 
cally typical of managerial personnel in 
organizations. 

The perceived-learning time measure wës 
open-ended interval scale, Instructions wer: 


you began on the job)?” 

The manager’s estimate of the level of x 3 
agement expected to be achieved in his career 
was tapped by the item: “How high in the com- 
pany do you expect to be promoted in your 
career: (a) president, (b) vice-presidential level, 
(c) senior management, at the divisional man- 
agement level, (d) upper middle management, 
such as managers of large functional depart- i 
ments, (e) middle management, (f) lower middle 
management, such as supervisors of sections of 
functions, or (g) lower levels of management, 
including plant general foremen and supervisors 
of small functional units?” 

Using information on present position level, 
and the seyen levels of management given, man- 
agers were assigned to high- or low-career expec- 
tations groups. High-expectations group members 
(about 32%) were those at levels four or five 
who aspired to level three or higher; and those 
at levels six and seven who aspired to level four 
or higher. All others (about 68%) constituted 
the low expectation group. 

The criteria for setting up the high-career 


TABLE 1 


Ma AL Positions BY CAREER 
TATIONS [TEM LEVEL 
Career expectations 
| | Propor- 
tion of 
Level | salaried 
popu- 
lation 
Presider 1 N.A® 
Vice pre 2 N.A.* 
Senior r ement 3 .001 
Upper n management 4 008 
Middle zement 5 024 
Lower 1 management 6 04 
Lower | of management 7 15 
N. A. plicable. 
expectatic yup were derived from the firm’s 
manageria) Inning tables. Table 1 shows the 
approxima centage of salaried jobs at each 
final cares =xpectation level. Table 1 masks the 
fact that ich level two through five, there 
are opport es for three to four promotions, 
and thus „se levels do not represent equal 
career int 5 
Initial s sought to determine whether a 
difference existed in perceived position-learning 
time between high and low expectation groups. 


Then, a multiple regression analysis of education 
(four levels), age (six levels), and position- 
learning time was run against the dichotomous 
career expectations variable. Both age and edu- 
cation were included as a set since all study data 
were categorized for these variables. One cate- 
gory of each was not included in the regression 
equation to avoid linear dependency and, hence, 
this category becomes part of the constant term 
(Wonnacott & Wonnacott, 1970, p. 75). 


RESULTS 


Managers in the high expectations group 
(n=65) had a significantly shorter mean 
Perceived-position-learning time, 1.3 years, as 
compared to 1.72 years for the low expecta- 
tions group (n= 137; t= 2.83, p < 01). Since 
the expectation group assignment depends on the 
Manager’s present position and his expected 
future promotions, variables such as past per- 
formance may not be reflected in expectations. 
Further, it is possible that variables such as 
Present age and educational achievement level 


Snort Notes 


might be as highly related to career expectations 
as perceived position-learning time, 

If the manager sees his educational back- 
ground and current age as positive or negative 
factors in terms of his promotional possibilities, 
then these variables may predict career expecta- 
tions as well as perceived position-learning time. 
Further, if learning time, in a multiple regression 
equation did not turn out to be a significant 
predictor, then one could say that its variance 
was accounted for by other variables. These 
possibilities were treated in the analysis illus- 
trated in Table 2, which presents zero-order 
correlations between current managerial level, 
position-learning time, career expectations and 
various age categories and levels of educational 
achievement. Several trends and relationships can 
be observed. 

Education. As might be expected, managers 
with high career expectations tend to be better 
educated. For those respondents who had only 
completed high school or some college, significant 
negative relationships were found. For those who 
had a bachelor’s degree or had done some post- 
graduate work, the relationships were significant 
and positive. 

Education and learning time. Surprisingly, the 
respondent’s level of education accomplishment 
displayed an ambiguous relationship with respect 
to position-learning time. Only graduate educa- 
tion was inversely related to perceived learning 
time which could mean that effects on learning 
begin to be noted only when formal education 


TABLE 2 


CORRELATIONS BETWEEN BACKGROUND AND 
PERCEPTUAL AND ORGANIZATIONAL 


VARIABLES 
Correlations 
— 
Variable n Current Posi- | Career 
pe 1 ne z expec- 
geri; ea ni s 
level time tanos 
Current managerial level = 11 13 
Posttlon-learning time 11 —14 
Educational achievement 
High school education | 34 —.25** | —.02 = 2877 
Some college education | 61 01 08 52855 
Bachelor's degree 83 04 02 28 
Graduate education 25 21 —.157 Ze 
Age 
20-29 18 —.14* —.15* 125% 
3034 35 | —.05 10 Adee 
35-39 27 | —.02 —.08 —.07 
40-44 42 | —.13 —.05 —.08 
45-49 48 A 12 —.14* 
50-54 25 08 2308 —. 30 b 
Career expectations 33 —.14 
Note. n = 203, 
* <.05. 
wD <01. 
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reaches graduate level. Perhaps level of educa- 
_ tional accomplishment is too gross a variable 
when related to position-learning time, and, in 
l the future, more refined measures of education 
should be incorporated. 

Age. Age proved to be a meaningful variable 
Vith respect to career aspirations, For managers 
b ranging in age from 20 through 35, a relatively 
= strong positive relationship existed for the two 
~ age categories and career expectations. For man- 
agers 35 years or older, the relationship was 
increasingly negative. 

Thus, one can conclude that those individuals 
who are younger and college educated anticipate 
greater numbers of promotions and shorter 

periods of time spent in the steps up the 
* managerial ladder 


To examine these data more closely, a multiple 
regression was run with career expectations 
group membership as the dependent variable and 
position-learning time, age, and educational levels 
as shown in Table 2 as independent variables. 
The multiple correlation was .62 (F = 10.99, 
dj =11/191, $ < .01). However, only the two 
Jowest age levels had significant regression co- 
efficients. Age 20-29 was associated with high 
career expectations (t of beta = 3.07, p < .01) 
as well as age category 30-34 (t of beta = 3.96, 
b < 01). Thus it is likely that 

levels found in this study are — agh 
of age, and not position-learning time. 


Discussion 


pA fundamental avenue of inquiry centers on 
the determination of why younger, better edu- 
-cated managers have higher career expectations 
than their older colleagues. Was there something 
in the work environment, the experience of older 
managers, and the company reward system which 
_ tended to work in a socialization perspective to 
dampen the career expectations of those man- 
_ agers who were over 34 years old? Perhaps age 
35 represented that point in a manager's career 
when he “comes of age” and then more accu- 
rately assesses his career and the probabilities of 
accomplishing his goals. Or is it that after age 35, 
managers become cynical about life and their 
careers? Another possibility is that the orga- 
= nization’s policy in personnel decisions is to 
prefer younger men. 


Snort Nores 


In terms of path-goal motivation cx 
tions (Evans, 1970; Georgopoulous, M 
& Jones, 1957) it may be advantageou 
firm to develop means whereby performa: 
be shown to lead to faster or alternativ 
to promotions. Learning a broad range 
tions and responsibility areas may 
strengthened as a path to promotions. 
other hand, the older, less well-educated : 
who does not expect rapid promotions 
much more concerned with other types 
wards, It might be worthwhile for a firm 
out the perceived relationships betwee: per- 


formance and promotions for the manag: who 

is older and has not graduated from colle 
Societal and labor mar ctors m ilso 

have been factors in the expectat of 


younger managers in that 
World War II and experienced an afflu oci- 
ety and a strong demand for isi = n an- 
agerial talent throughout the 1 * 

question, considerable research using socia) on 
models (Brim & Wheeler, 1966; Brook & 
Gottlieb, 1964), must be launched into an iry 
of why managers’ career expectations char 
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DONALD J. SEARLS? 


Colorado State University 


A scale of specific work values, the 


chronically unemployed 
measureable way on W 
to assess work values. 
Industry experiences frustration when it at- 
ts to retain, in productive work, persons 
have been chronically unemployed. These 
rts might be enhanced by a better under- 
g of the work values of that group. A 
ew of literature indicates that studies of work 
falues are generally based on measurement of 
importance attached to jobs in general (e. g. 
1957, 1962); many of these investigations 
ed on the relationship of work values and 
e Protestant ethic (e.g, Blood, 1969; Wollack, 
dale, & Wijting, 1971). 
The primary purpose of this research was to 
velop a scale that would allow an examination 
Specific and commonly required on-the-job 
behaviors. The secondary objectives were to 
tudy the importance attached to those behaviors 
Supervisors, hourly wage employees, and eco- 
mically disadvantaged unemployed individuals 
, to examine sociocultural and personality 
acteristics of those three groups. 


METHOD 


Development of a scale of work values began 
interviews of supervisors and personnel 
agers of seven large companies representing 
ufacturing, retail, and service businesses. The 
his investigation was supported by the Us. 
ment of Labor (Contract No. 82-08-72-07) 
nd was conducted through the Colorado State 
University Organizations 
‘ aking sponsorship 
encouraged to exercise their own judgment 
: Therefore, points of view or opinions stated 
this article do not necessarily represent the offi- 
position or policy of the Department of Labor. 
Requests for reprints should be sent to Donald 
Searls, Colorado State University Experimental 
oratory, 50 West sth Avenue, Denver, Colorado 
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WORK VALUES OF THE CHRONICALLY UNEMPLOYED * 


R. W. MISKIMINS 


Colorado State University 


Work Requirements Rating Scale, was 
constructed and administered to supervisory, hourly wage, and chronically 
unemployed groups. The supervisory subjects were shown to differ from the 
other two groups in the high value they placed on basic work behaviors, The 
and worker groups were found not to differ in any 
ork values. These and other findings of the study 
suggest that the scale used in this study has considerable promise as a means 


93 


G. N. BRAUCHT 
University oj Denver 


interviews were directed so as to identify those 
specific work behaviors which were conducive to 
cither promotion or termination of hourly wage 
employees and resulted in an initial pool of 68 
such behaviors. This list was culled to minimize 
ambiguity and item overlap. The 47 items that 
were retained formed the content of the W. 
Requirements Rating Scale and were found to fit 
into five logical subscales: (a) Personal Habits 
(11 items)—for example, “The worker has a 
sloppy or unkempt appearance”; (b) Supervisor 
Relations (7 items)—for example, “The worker 
sometimes argues with his supervisor”; (¢) Em- 
ployee Relations (7 items)—for example, “The 
worker has been unable to make friends with 
co-workers”; (d) Company Relations (10 items) 
—for example, “The worker has occasionally 
taken home inexpensive company material”; and 
(e) Work Habits (12 items)—for example, Ihe 
worker has a higher rate of absenteeism than 
most employees.” 

Each item in the Work Requirements Rating 
Scale was in a five-point Likert-type format. The 
scale listed five subscale scores and a total sum 7 
of work values. Subjects were asked to rate the 
importance of each item in considering à line 
worker for promotion. In addition to this scale, 
all subjects were administered the Social Access 
Questionnaire, which measured components of 
the sociocultural environment and personality (in 
part from Jessor, Graves, Hanson, & Jessor, 
1968), previously shown to be predictive of vo- 
cational adjustment (Berry, Miskimins, & Keil, 
1970). Five aspects of the sociocultural environ- 
ment assessed by this instrument were: (a) op- 
portunity for deviance—the extent to which an 
individual has deviant alternatives to work avail- 
able; (b) exposure to deviant role models—the 


amount of exposure a person has had to non- 
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conforming persons and forces; (c) absence of 
sanction networks—the degree to which an indi- 
vidual lacks access to social agents capable of 
exerting pressures against deviant behavior; (d) 
religion the degree to which religion plays an 
important part in an individual's life; and (e) 
deviant behavior—the extent to which a person 
admits to engaging in deviant behavior. 

Five measures of personality assessed by the 
Social Access Questionnaire were: (a) delay of 
gratification—a person’s tendency toward im- 
mediate gratification; (b) alienation—the degree 
to which a person feels alienated from society’s 
mainstream: (c) internal-external control—the 
degree to which an individual feels that he is in 

control of his life; (d) time perspective—the 
“extent to which one plans for the future; and 
(e) attitudes toward deviance—an individual's 
attitudes toward many kinds of socially defined 
illegitimate behavior. 

Both the Work Requirements Rating Scale 
and the Social Access Questionnaire were ad- 
_ ministered to the following three subject groups: 
(a) first line supervisors (n = 23) employed at 
a large regional warehouse for over 10 years 
(presently with supervisory responsibility for 6 
or more employees); (b) hourly wage workers 
_ (n=52) employed at the same work setting for 
over two years; (c) chronically unemployed 
‘individuals (= 50) selected because of long 
histories of vocational failure and no recent 
“attempts to find employment. 


Le <<<“ = es 


RESULTS 


Each of the measures, previously described, 
_ were subjected to a one-way analysis of variance, 
with the supervisor (S), hourly wage worker 
(W), and unemployed (U) groups serving as the 
levels of the independent variable. When Fs 
reached statistical significance, ¢ tests were com- 
puted for all possible pairs (S-W, U-S, W-U). 
_  Supervisor-worker comparison. The supervisor 
ascribed considerably more importance to basic 
Work behaviors than did the line worker, and in 
fact, the average supervisor placed a numerically 
higher value on every one of the 47 items. The 
results for the five subscales and total were: 


Personal Habits (F = 4.61, p < .05; and S 
t= 4.06, p < .01); Supervisor Relations (F = 
6.22, p<.01; and S- t=4.57, p< 01); 
Employee Relations (F = 2.86, f = ns); Com- 
pany Relations (F = 4.76, p<.05 and S 
t= 4.03, p<.01); Work Habits (F = 6.66, 
p<.01 and SW #=4.72, p<.01); Total 
(F = 5.31, p < .01 and S-W t= 4.20, p < 01 
all Fs 2/120 df, and all ts 73 df). 


SHort Nores 


bai Gea taai 


The supervisory and hou 
not differ on any measur 
ground. 

Chronically unemployed-si 
son, The Work Requirem 
that supervisors placed a higher 
work requirements than chronically 
Statistically significant difference 
on four subscales and the total score 
previous section; 


Personal Habits U-S f = 3.24, p 
visor Relations U-S t= 4.00, p 
pany Relations (U-S t = 3.51, px 


C 

01); Wor 
Habits (U-S ¢ = 4.36, p < .01)y.Totah (U4 

t= 3.68, p < .01—d/ for all és a 
In regards to sociocultural background, h 
Supervisors were subject to more sanction ne 
works that prohibited deviant behaviors (F= 
8.92, 2/122 df, p < .001 and U-S t = 2.07, dj = 
71, p < .05) and also have less exposure to devi 
ant role models (F= 3.65, 2/122 df, p< 0 
and U-S t= 2.66, 71 df, p < .05). 
Chronically unemployed-worker comparison. A 
important finding of this study showed that tht 
chronically unemployed and successfully em 
ployed hourly wage subjects did not differ Of 
any factor of the Work Requirements Ratin 
Scale. They differed on one factor of the socio: 
cultural environment—sanction networks against 
deviant behaviors were more infrequent among 
the unemployed (see previous F; W-U t= 4.07 
df = 100, p <.01). 
In regards to personality variables, the mos 
striking differences were found between the un- 
employed and the other two groups. The unem- 
ployed felt more externally controlled (4 = 2.2 
df = 100, p Kos) and were less able to dela 
gratification (t= 4.02, df = 100, p<.01). 


015 
013 


Discussion 


The results of this study suggest that the scale 
constructed for the measurement of specific 
the-job work behaviors has considerable pron 
as a means to study work values. Although thi 
findings support the study’s validity and ini 
cate that future work with the Work Req 
ments Rating Scale is justified, further resea 
is necessary to clarify the influence of fact 
such as job tenure and increased responsibility 
on the development, maintenance, and cha 
of work values. 
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EFFECT OF SEX, RELIGION, AND ETHNICITY © 
OCCUPATIONAL STATUS PERCEPTION 


A. GEORGE GITTER, JULIE ALTAVELA, axo DAVID I. MOSTOFSSY 


Boston University 


A 2X3X3 design (subject’s sex and religion and job incumbent's ethni 
group) with 30 repeated measures (occupations) was used to investigate th 
perceived magnitude of (a) ascription (race, social class, and other accident 
of birth), (6) actual achievement (real performance), and (c) gilded achieve- 
ment (appearance of performance simulated by prevarication, “throwing the 
bull,” etc.) in 30 occupations. Results indicated that actual achievement was 
perceived to be far more important than either ascription or gilded achieve 
ment, and the incumbent’s ethnic group significantly influenced this percep- 
tion. No significant effects were reported for the subject’s sex or religion. 


Often an employee's or applicant’s perception 
of an occupation can affect the organizational 
milieu and cause additional problems of career 
identity, rationalized underproductivity, and 
interpersonal tensions, Analyses of the status 
components inherent in numerous occupations 
suggest much experimental research, little of 
which has in fact been pursued. 

A number of standard sociology texts (e.g., 
Broom & Selznick, 1963) discuss two different 
types of status; ascribed and achieved. Ascribed 
status is defined as a person’s position in a par- 
ticular system that results from characteristics 
or attributes over which he has no control, that 
is, family background, race, sex, etc. Achieved 
status, on the other hand, refers to the position 
a person holds within a system that results from 
characteristics or attributes over which he has 
control, that is, educational attainment, perform- 
ance of specified role tasks, etc. Thus, in the 
case of ascribed status one’s position within a 
particular system is independent of the incum- 
bent’s behavior, whereas in the case of achieved 
status the incumbent's position is dependent on 
behavior. 

Recently, an alternative conceptual framework 
has been proposed that, rather than categorizing 
status as either ascribed or achieved, views any 
given status as including three components of 
achievement: (a) ascribed, (b) actual, and (c) 
gilded (Gitter & Mostofsky, 1972). In that 
framework actual achievement was defined as 
actual performance of occupational role tasks, 
and gilded achievement as appearance of per- 
formance simulated by prevarication, such as 
“throwing the bull,” etc. 


1 Requests for reprints should be sent to David 
I. Mostofsky, Boston University, Department of 
Psychology, 64 Cummington Street, Boston, Massa- 
chusetts 02215. 


The present study investigated. th 
between sex of perceiver, religion 
and the ethnic character of the 
status incumbent on the perceive 
of these three components of occup: 
Because of the extreme competitio 
market, coupled with a focused inte on 
employability of blacks and women 
incorporated within a single design -ll 
variables of interest. 


METHOD 


Subjects were 350 white undergr 
Boston University. Table 1 shows the bre: 1 
by sex and religion. Subjects were randomli 
assigned to one of three groups followi hi 
design shown in Table 1. A list of 30 occupa 
was compiled for the study. This list was almos 
identical to the one used by Gitter and Moste, 
sky (1972) in a previous investigation of occi 
pational status. It included selected occupation 
statuses from Hatt and North’s (1947) classie 
“prestige of occupations” study. 

Included in the list were 11 pairs of occupa 
tions, with each pair representing a “superviso 
nonsupervisory” dyad. An example of such 
dyad is foreman of bricklayers-bricklayer; fore- 
man of bricklayers represents a supervisory 
sition, and bricklayer represents its nonsupe 
visory counterpart. These dyads were included 
investigate differences in perceived magnitude 
of status components between supervisory 
nonsupervisory levels within the same occupa 
tion, as” well as differences among different 
occupations. 3 

A self-administered questionnaire containing 4 
list of the 30 occupations was completed by each 
subject during regularly scheduled class Period 
For each occupation, the subject was aske! 
estimate the relative importance of the th 


a * i Bo ba ad SO Ue 
Snort Notes 
TABLE I 
: Exrcautentat Diwan 
Subject 
Black | Jewish i WASP 
— ee — 1 
It | 3 

vie Remale | Male Female | Male | 
in = 38) 60 wa | 


c| P * e| y fre] P | 


=) | 


y irc) P| y fre] e] y irc 


i 11 {15 


N Roman Catholic: P = Protestant; J = Jewish, WASP = White Anglo-Sax0e Protemant. 


stati ponents (ascribed, actual, and gilded 


achic nt); however, these three terms were 
repla the questionnaire by the terms family 
backe ond, work, and bull. Early pilot data 
showc ‘hat when particular instances which ade- 
quati ñned ascription, achievement, and gild- 
ing ed, che terms are more easily understood 
by n subjects and require less instruction or 
expli n when the more general construct 
label resented by itself. Subjects were fur- 
ther ructed to assume that the incumbents 


of al occupations, which they were about 


to ra: were of particular ethnic background. 
One hv dred and fourteen subjects were asked 
to assume that the members of the 30 occupa- 
tions, which they were to evaluate, were black 
(black target group); another 114 subjects were 


e that the occupational members were 
Jewish (Jewish target group), and an addi- 
tional 122 subjects were to think of “people in 
general,” (white Anglo-Saxon-Protestant target 
group). After the subjects completed the ques- 
tionnaire, they were asked to rate each of the 
Same 30 occupations (disregarding any con- 
sideration of ethnic group) on a status ladder 
of occupations in the United States. 


RESULTS ° 


An initial analysis of the two nonmanipulated 
Variables (sex and religion of the subject) 
showed them to be nonsignificant. The remain- 
Ing analyses were concerned with the subject’s 
estimate of the magnitude of a status compo- 
nent as affected by (a) ethnicity of occupational 
Status component (black, Jewish, white Anglo- 
Saxon Protestant), (b) type of status compo- 
nent (bull, family, work), and (c) type of 
occupation. 

Soa 


? Detailed tables of the individual ratings are 
wailable from the authors on request. 


2 | | 24 | 32 | 18 | 9 | 


| 2 37 | 9 | 16 | 15) n 
ee as: PES ae 


Differences between Target Groups 


The influence of ethnicity on status incumbent 
was examined in three separate sets of analyses 
in which bull, family, and work scores were 
independently analyzed. 

A simple analysis of variance among the tanget 
groups was performed for each of the 30 occur 
pations. The results indicated that ethnic group 
significantly influenced the perceived magnitude 
of the bull component for 19 of the 30 occupa- 
tions. The 11 occupations, in which ethnic group 
was nonsignificant, tended generally to be non- 
professional positions at or near the bottom of | 
the status ladder. a 

In addition to the 30 analyses of variance 
investigating the effect of ethnic group, a pooled 
analysis was performed, retaining ethnic group 
as the independent variable and each subject's 
bull scores that averaged over all 30 occupa- 
tions as the dependent variable. Results of the 
pooled analysis clearly supported other findings 
(F=1.58, df = 60/634, p < .01). Similar analy- 
ses were conducted for family and work scores 
in which ethnic group was significant in the 
analyses for 8 and 9 occupations, respectively. 

The effect of type of status component was 
tested by an analysis of variance with repeated 
measures; each subject’s ratings of each of the 
status components were used as three re- 
peated measures for each of the 30 occupa- 
tions. The results indicated significant difierences 
(F = 168.20, df = 29/1000 2, p<.001); actual x 
achievement had approximately three times as 
much to do with being a member of an occupa- 
tion as either gilding or ascribed characteristics 
(as seen in the overall means). The size of the 
gild (bull) component relative to that of the 
ascribed (family) component, varied with the 
type of occupation, such that the ascribed com- 
ponent was relatively smaller and the gild larger 


98 SHORT NOTES 


when the occupations were either management or 
sales oriented, and vice-versa when the occupa- 
; tions were professional. When the results of 
the analysis were repeated using each of the 
~ three target groups separately, the findings were 
virtually identical to those obtained when using 
the sample as a whole. 
p In addition, statistically significant differences 
between supervisor and nonsupervisor were ob- 
tained in perceived magnitude of the bull 
(F = 102.26, df= 1/348, p<.001), family 
(F = 18.04, df = 1/348, p<.001), and work 
= (F=15.01, dj = 1/348, p < .001) components. 
= Highest bull scores were associated with white 
Anglo-Saxon Protestant incumbents and lowest 
with blacks, with both supervisor (F = 9.00, 
d = 2/346, p<.001) and nonsupervisor (F = 
4.45, df = 2/346, p<.05). Ethnicity of target 
groups did not significantly influence the magni- 
tude of either the family or work components. 


Discussion 


Several implications can be derived from the 
results of this study. First, this study provides 
q an empirical validation of a theoretical schema 
that views status as consisting of a number of 
components. It also supports the general findings 
in the classic study of occupational prestige by 

_ Hatt and North (1947). Second, from the meth- 

= odological standpoint, the study represents an 
advance over the technique used in the Gitter 
and Mostofsky (1972) study, in which subjects 
rated the magnitude of all three components 
for each of the occupations. 


Substantively, the findings add to the W 
social psychology of occupational st Act 
achievement (work) emerges as a tor 
proximately three times as impor in 


incumbency of an occupation as either «scrip 
or gilding. In addition, the ethnic gro p of W 
incumbent was shown to influence rentia 
the amount of actual, ascribed, gildi 
that is attributed to him. 

Caution should be exercised in © king 4 
qualified generalizations from the resu ‘s of U 
study, since the findings, at least in att, m 
be a function of the nature of the ple e 
ployed, as in this case, a sample co -posed 
rather liberal minority-group consci stude! 
from a large urban university. Fut resear 
in this area might take this consi ion it 
account by sampling from other opulatio 
including nonstudent adults who are themseh 
incumbents of the occupation studied. 
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were factor analyzed. Factors for 


situations. 

The Descriptive Index measures job satis- 
faction five scales (work, pay, promotion, 
supery and co-workers) and is in frequent 
use (e. Blood, 1969; Evans, 1969; Hulin, 
1968, Hulin & Waters, 1971; Soliman, 
1970; ers & Waters, 1969). 

The ; poses of this study were: (a) to deter- 
mine zer the convergent and discriminant 
validity chat was reported by Smith, Kendall, 
and H (1969) for industrial workers would 
hold fo- civil service workers; (b) to compare 
the factor structure for blacks and whites; (e) 
to test for a general factor as predicted by 
Herzbers, Mausner, Peterson, and Capwell 
(1957) - (d) to determine item discriminability of 
the five scales for three different white-collar 
Samples of workers. 

In the study by Smith et al. (1969) the sub- 


jects were predominantly industrial workers, and 
nearly all were white. The subjects of this study 
were all of the employees of a civil service 
accounting division except for absentees (212 
whites, 107 blacks), and 110 randomly selected 
bank employees (nearly all white) with duties 
similar to those of the accounting division. For 
the litter sample the investigators predicted 
higher satisfaction because of superior working 
conditions and fringe benefits. Management em- 
ployees of the accounting division predicted 


— 
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POR STRUCTURE FOR BLACKS AND WHITES OF THE JOB 
DESCRIPITIVE INDEX AND ITS DISCRIMINATION 
OF JOB SATISFACTION 


PATRICIA CAIN SMITH, OLIN W. SMITH, axo JAMES ROLLO 
Bowling Green State University 


The Job Descriptive Index was administered to civil service whites (217), 
blacks (107), and predominantly white bank employees (110). The purposes 
were to extend convergent and discriminant validity, to compare factor struc- 
tures for blacks and whites, to search for a general factor, and to check the 
discriminability of items for three disparate groups. The 72 item scores 
rer five scales, work pay, promotion, super- 
vision, and co-workers were found for two or three 
tion of the supervision scale which split into two factors for all groups. Item 
means discriminated among the three groups as predicted from their known 
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groups with the excep- 


lower satisfaction for civil service blacks than 


whites because of fewer promotional alternatives. 


A principal components analysis of the 72 
items (scree test of Cattell, 1966) with vari- 


max rotations yielded seven factors for each — 


group of subjects, The work scale for civil ser- 


vice whites was split into descriptive and evalu- 


ative factors. (These factors require further 
investigation.) Loadings for pay, promotion and 
co-workers showed remarkable consistency." 

The supervision scale for all three samples 
was split into two factors which have been 
tentatively identified as quality of supervision 
(acts) and personal characteristics of the 

ision. 

Although there was no 
neither blacks nor whites, 
factors were found. 
the specific contents, and the balanced positive 


general factor for 
two unidentifiable 


The method of construction. 


and negative wording of the Job Descriptive 


Index may account for the absence of a general 
factor. The scales of the Job Descriptive Index 
did not exhaust those factors which may prove 
to be useful in quantitative studies of job satis- 
faction. The bias of this study, however, was 
that the understanding of job satisfaction can 
be achieved more economically by 


the use of 


multiple variables rather than one general factor. 


Factor loadings of the three groups were cor- 
related (a valid but crude measure of similarity 
according to R. B. Cattell, personal communica- 
tion), The median correlation along the diagonals 
(likenamed factors) was .75 with a range of 
49-88 (see Footnote 2). Off diagonal correla- 
tions were low. The correlations of the loadings 


2 Complete tables may be obtained from the senior 
author. 
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of the two supervision factors were low or near 


vero. With the exceptions of the split in the 
work scale for civil service whites and the splits 
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in the supervision scale, the scales remained 
stable for these samples of subjects and extended 
_ the results of Smith et al. (1969) for convergent 
and discriminant validity, 

Comparison of means of the items for the 
three groups showed all of the differences in 
total items to be significant (% < 001) by sign 
test. Bank employees were satified with more 
items than civil service whites as predicted by the 
investigators. Civil service whites exceeded civil 
service blacks as predicted by management* 


ConcLusions 


Identities of factors for blacks were the same 
as for one or more groups of whites. Identities 
of the pay, promotion, and co-worker factors 
were the same for all three groups as for the 
subjects of Smith et al. (1969), For civil service 
whites, two work factors resulted, the stability 
of which requires further study. The original 
Supervision factor was divided into two factors 
for all three new groups. The estimates of 
factor -· matching of six factors across the three 
groups ranged from medium to substantial with 
the median substantial. Neither a race factor 
for blacks nor a general factor was obtained. 
In terms of means for scale items, the Job 
_ Descriptive Index discriminated among three 
_ groups of subjects. Bank employees had the 
highest job satisfaction followed in order by 
civil service whites and blacks. The generality 
An overall F test shows the differences in scale 
means to be significant (p< .01). The analysis by 
items is presented as more meaningful to users of 

the Job Descriptive Index. 


= a 


of applicability of the Job Descript 

to both white and black clerical subje 

to be established. 
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ROLE AMBIGUITY, LOCUS OF CONTROL, AND 
WORK SATISFACTION 
DENNIS W. ORGAN * axo CHARLES N. GREENE 
Graduate School of Business, Indiana University 

The present study sought to extend the generality of previous based 

in Rotter’s internal-external locus of control aes 9 

tonii ae seg pe perceived ambiguity in the environment, and 

satisfaction. A sample oi tists and engineers responded to questionnal: . 

containing Rotter's measure of locus of control, 8 Fa — * 

and two measures of job satisfaction. Results indicated that locus of control N 

was related to both role ambiguity and satisfaction and that Jocus of control 

provided a greater independent contribution to satisfaction tban did role 

— Implications for interpreting scores on role ambiguity measures are 

noted, 

Am of recent studies have found that merely symptomatic of a more basic relationship 

— role ambiguity—the individual's between locus of control and satisfaction. 
teling uncertainty about his organizational This research sought, therefore, to answer the 
nl o tions and/or the means to fulfill following questions: (a) Do internals report less 
them— egatively and significantly related to role ambiguity than externals? (b) Do internals 
= FA expressed by the individual (Kahn, also greater work satisfaction than exter- 
Wolfe in, Snoek, & Rosenthal, 1964; Rizzo, nals? (c) If the answers to (a) and (6) are both 
oa Lirtzman, 1970). These findings and affirmative, is there a significant relationship be- 
their s: sorting theory imply that role ambi- tween locus of control and satisfaction when 
zuity i sive and, further, that it is a product controlling for the effects of role ambiguity? 
of the ~zanization and its environment (e.g, (d) Which variable—locus of control or role 
extent change or turbulence affecting the ambiguity—provides the greater independent 
organiz. ion, degree to which job behavior is contribution to variance in satisfaction? 


specific: by formal rules and policies, clarity and 
consistency of organizational communication). 
There is evidence from other areas of inquiry. 
however, that the possession of job-related and 
other types of instrumental information is 
related ío the belief system of the individual. 
From studies based on Rotter's internal-external 
locus of control measure, there is evidence that 
the type of person who actively seeks out infor- 
mation of an instrumental nature is one who 
tends to believe he can control his own out- 
comes, rewards, or fates (Rotter, 1966; Seeman, 
1963). For example, internals (persons who score 
highly in the internal direction on Rotter’s in- 
strument, and thus believe their outcomes are 
shaped by their own actions rather than luck or 
Powerful others) tend to be better informed 
about their occupations (Valecha, 1972). If in- 
ternals report less role ambiguity and also report 
greater work satisfaction, then the relationship 
between role ambiguity and satisfaction could be 
— 
1 Requests for reprints should be sent to Dennis 
$ Organ, Graduate School of Business, Indiana 
University, Bloomington, Indiana 47401. 
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METHOD AND PROCEDURE 


The sample consisted of 94 senior scientists 
and engineers employed in the research, develop- 
ment, and engineering divisions of a large manu- 
facturer of electronics equipment. components, 
and supplies. Each participant filled out a ques- 
tionnaire booklet which contained the following 
instruments: (a) an Il. item instrument devel- 
oped by Rizzo. House, and Lirtzman (1970). 
used to tap subjects“ perceptions of role ambi- 
guity; (b) Rotter’s (1966) Internal-External 
Locus of Control scale, which assesses the extent 
to which the respondent believes his outcomes 5 
in life are shaped by his own behavior rather 
than luck or powerful external agents; (c) an 
index of work satisfaction (satisfaction with the 
actual content of the job), consisting of fre 
items drawn from Stogdill’s (1965) job expecta- 
tion questionnaire; and (d) a measure of gen- 
eral job satisfaction (satisfaction with the orga- 
nization and its management), composed of 12 
questions drawn from Stogdill’s (1965) job 
description questionnaire. 7 

Approximately half of the questionnaire 
booklets were completed at the work site and 
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returned immediately. The remainder were 
returned by office mail within 10 days of 
administration. 


Resvtts AND Discussion 


Locus of control (scored in the direction of 
increasing perceived external locus of control) 
correlated significantly with role ambiguity 
(r= A, p< 01), work satisfaction (r = —.36, 
p<.01), and general job satisfaction (r= 
—,27, p <.01). Role ambiguity also significantly 
related to work satisfaction (r = —.30, p < .01), 
but not to general job satisfaction (r= —.14, 
f p > .10). It seems possible, then, that role ambi- 
= guity may elicit negative attitudes that are spe- 
cific to the task and work behavior, but do not 
> generalize to the individual's overall attitudes 
concerning the organization. 

i The partial correlation between locus of con- 

trol and work satisfaction—controlling for role 
q ambiguity—was also significant (r=.27, p< 

01). On the other hand, the partial correlation 
between role ambiguity and work satisfaction— 
controlling for locus of control—was not signifi- 
cant (r=.18, p> .10). Thus, it appears that 
locus of control provides a greater independent 
contribution to differences in work satisfaction 
than does role ambiguity. 

One further finding of interest emerged from 
_ a median split of the sample based on locus of 
control scores, For the subsample scoring in the 
direction of internal locus of control (n = 45), 
the correlation between role ambiguity and work 
satisfaction was significant (r = —.29, p < .01). 
However, among those scoring above the median 
on locus of control (ie. in the external direc- 
_ tion, = 49), the relationship was nonsignifi- 
cant (r=—.19, p>.10). This suggests that 
role ambiguity is aversive primarily to those who 
are frustrated in the active attempts to secure 
job-related information. 

The results, when coupled with Valecha's 
(1972) findings that internals have more knowl- 
edge of the world of work (as measured by a 
28-item occupational information test), support 
Rotter’s (1966) hypothesis that “the individual 
who has a strong belief that he can control his 
destiny is likely to be more alert to those aspects 
of the environment which provide useful infor- 
mation for his future behavior.” Thus, measures 
of role ambiguity bear an uncertain relation- 
ship to objective characteristics of the organiza- 
tional environment, since they were also tapping 
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the respondent's belief system and 
mental value he placed on informati 


instru. 


Furthermore, the results would se o Sup. 
port conclusions drawn by Korma 1971), 
Korman, whose study involved colles d uni 
versity students, found environmenta biguity 
to be far less important as a det int of 
satisfaction than the control aspect he en- 
vironment, that is, the extent to w! udents 
felt they were expected to exert control 


itrol as 
some- 
th his 
> that 
ssarily 


in student life. While the meaning 
defined and measured by Korman < 
what from Rotter’s conceptualizati: 
study and the present research i: 
organizational ambiguity may not 


have the negative properties that writers 
have attached to it. 7 
Obviously, any conclusions drawn ‘rom the 
present study must be tempered by ti- correla- 
tional and self-report characteristics a data. 


It is quite plausible, as Kahn et al. ( ) have 
implied, that role ambiguity is an ant dent of © 
(or at least influences) perceived locu: of con- 
trol; or that both variables are concomitant 
effects of age, tenure, or other independent 
variables 
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VASIVENESS OF THE RELATIONSHIP BETWEEN INTRINSIC- 
EXTRINSIC NEEDS AND PERSISTENCE AT WORK 


ELCHANAN I. MEIR * axo AZY BARAK 
Tel-Aviv University 
A total of 1,027 persons from 10 diferent occupations responded to inven- 


tories of intrinsic and extrinsic needs, While persistence at work was posi- 
tively correlated with intrinsic needs, no correlation was found with extrinsic 


needs, These findings were generally 


ous study (Meir, 1972) that used 

31 nurses, 20 policewomen, 30 s0- 
and 35 youth counselors as subjects, 
at work was positively correlated 
sic needs while no correlation was 
extrinsic needs. It was consequently 
i that these findings may also apply 
and perhaps also to men, engaged in 
pations. 


MetHop 


tionship between intrinsic and extrinsic 
the one hand and persistence at work 
zer was investigated in five more occu- 
teachers, special education teachers, 
ry workers, psychiatric team workers 
iding workers. Replication studies were 
ie of two of the five occupations already 
ted previously: nurses and youth coun- 
The research population by group and 
is given in Table 1. 
insic-extrinsic need inventories were con- 
structed separately for each occupation. Each 
‘inventory consisted of a list of 7-19 intrinsic 
needs and 8-17 extrinsic needs. 
The needs were classified by 4-24 psycholo- 
on which at least 
ag included. The 
‘scoring of the inventories has been described in 


In 15 groups the criterion for persistence at 
the total number of 


months elapsed since graduation. In two gro 


1 Requests for reprints should be sent to Elchanan 
I. Meir, Department of Psychology, Tel-Aviv Uni- 
versity, Ramat-Aviv, Israel. 


consistent across occupations, 
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work itself, the longer is his persistence in 
The hypothesis for the extrinsic needs was 
tailed: On the one hand, it could be assu 
that since these needs are satisfied through 
work there will be a positive relationship 
tween the extrinsic needs and persistence at 
work. Yet, on the other hand, since there existi 
the possibility that extrinsic needs might be 
satised even better in other occupations, 
relationship can be assumed to be negative. 


Resvtts 


Table 1 shows the correlations of intrinsic 
needs and extrinsic needs with persistence 


group the 
tween salary and 
hand, and difficulties 
cation to accept a job 
tional work because of 
on the other, may p 


general rule. 
Of the 18 correlations between extrinsic needs 


and persistence at work, 7 (39%) were positive, 
This finding by T 
all occupations together the parameter of th 
relationship between the two variables is close to 
zero. The correlations J 
separate and in the mixed groups indicated the 
same relationships as for the women. 7 
Tt was preferred to use several smaller samples 
of different occupations rather than one big 
sample of one occupation since in the latter case 
the results would be applicable only to the 
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i TABLE 1 
4 CORRELATIONS OF INTRINSIC AND Extrinsic NEEDS WITH PERSISTENCE AT WORK 
g BY ÔCCUPATION AND SEX 
4 — — — — 
4 | Subjects Correlation 
Occupation | : Corre 
S Intrinsic Extrinsic 
| No. Sex needs needs 
Dentists* 288 F 34% 13 
Nurses* 31 F 30* —.14 
Nurses 68 F Shas oe 
Policewomen* 20 F 73* 39 
Social workers“ 30 F 24 —.14 
Youth counselors* 12 F 46 —.28 
Youth counselors* 11 F 47 —.15 
3 Youth counselors* 12 F 40 19 
Youth counselors 141 M&F Ky xa —.08 
} Special education teachers 38 F —.53 —.80** 
Special education teachers 29 M&F 08 17 
Teachers 35 F 24 — 22 
Teachers 95 iy N fu 09 
Teachers 5i F. 47** 06 
Teachers 28 M 42* =.10 
Laboratory workers 25 F .29 —.18 
Psychiatric team 27 F ,42* .03 
Building workers 86 M 13 2 


* Nate. The hypothesis for intrinsic needs is one-tailed and 


_ * Reported in Mi 
a ir (1972). 
*p <.05, Ei a 
* <.01, 


ation engaged in that particular occupation 
and not in other occupations. Although the occu- 
pations investigated were not chosen at random 
from the world of occupations, the relationship 
ound seems to be a fairly general phenomenon. 
Tf social desirability affected the subjects’ re- 
Ponses, it would lower correlations between 
intrinsic needs and persistence at work because 


for extrinsic needs is two-tailed. Abbreviations: 


show whether an intrinsic needs inventory has 
predictive value for persistence at work. 
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parisons made on the 


The ; 
vestigat 
binatior 
oral ar 
tions 
written 
hypoth 
oral pre 
combin 
nating 
which | 
presente 
The 


‘ose of the present study was to in- 
» relative effectiveness of four com- 
different orders of presentations of 
itten information, The four combina- 
oral and oral, oral and written, 
oral, and written and written. It was 

d that the treatments which had an 

tation as the first presentation of the 

n would be more effective in dissemi- 

information than the treatments 
d a written presentation as the first 
on. 
evelopment of the research hypothesis 
was infiuenced by previous research (Day & 
Beach, 1950) which had indicated that: (a) 
meaningful, familiar material was presented more 
efficientiy orally; (b) the relative effectiveness 
of the visual presentation increased as the mate- 
rial became more difficult; and (c) the higher 
the ability level of the receiver, the greater the 
advantage of a visual presentation. Travers 
(1970), however, believed that the latter effect 
could be attributed to the fact that those who 
obtain ihe highest scores on such receiving tasks 
tend to be the best readers. 

McCormick’s (1964) finding that the use of 
two modalities was not systematically preferable 
to the use of one modality (where the amount 
of exposure was constant) i 
the order of presentations 0 
information. The simplicity and saliency of the 
messages used in this study and the fact that the 
subjects were disadvantaged also influenced the 


1Requests for reprints should be sent to M. M. 
ns Institute, 


Petty, Manpower and Industrial Relatio 
University of Alabama, P. O. Box 2737, Alabama 
35486. 


R! \TIVE EFFECTIVENESS OF FOUR COMBINATIONS OF ORAL 
\ND WRITTEN PRESENTATIONS OF JOB RELATED 
INFORMATION TO DISADVANTAGED TRAINEES 


M. M. PETTY ! 


Manpower and Industrial Relations Institute 
University of Alabama 


Relative effectiveness of four different combinations of oral and written 
yresentations of information was investigated. It was hypothesized that the 
treatments which had an oral presentation as the first presentation of the 
combination would be more effective in 
treatments which had a written presentation as 
criterion of effectiveness was & multiple choice test designed to measure the 
subjects’ knowledge of the disseminated information. 
basis of the research hypothesis were in the expected 
direction, but only three were statistically significant ($ < 05). 


l 


disseminating information than the 
the first presentation. The 


All eight a priori com- 


nature of the research hypotheses (ie, the 
predicted superiority of the oral presentation). 


METHOD 
Subjects 


The subjects were 73 trainees who were al- 
tending a training program for the disadvan- 
taged in Oak Ridge, Tennessee, The average age 
of the subjects was 22.3 years, 37% were school 
dropouts, and 49% were from minority groups. 
Six industrial skills (e. g. drafting, electronics} 
were taucht as well as remedial basic education. 


Treatments 


Since it would not have been practical to have 
simultaneous presentations in the combinations 
where only one modality was used, an interval 
of 24 hours was provided between all presenta- 
tions, Each of the four treatments was adminis- 
tered to four intact groups of subjects, that is, 
the experiment was divided into four phases with 
each group receiving a different treatment for 
each phase. The order of presentations of the 
treatments was counterbalanced to eliminate any 
systematic variance that might be due to @ 
practice effect. The information was disseminated 
by the subjects’ math instructors in order to 
maintain the normal atmosphere of the training 
program. | 

The first presentation of each treatment was 
given on a Monday while the second presentation 
was given the following day. A week after the 
first presentation of each treatment the subjects 
were tested on the material that was dissemi- 
nated during the previous week. All subjects 
were tested at approximately the same time in 
their respective classrooms. Immediately follow- 
ing the tests, the message of the next phase was 
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distributed through the assigned methods. Four 
weeks were required to disseminate the four 
messages and administer the appropriate tests. 

The oral portions of the treatments were 
Standardized by having the instructor of each 
class read the information word for word. In 
the written presentations, each subject was pro- 
~ vided with an individual copy of the material to 
dead in class, Different information was used for 
_ each phase of the study, but all subjects received 
the same information during any one phase. 


Messages 


Four different messages, one for each phase, 
were developed to be disseminated to the sub- 
jects. Care was taken to ensure that the messages 
© would provide the subjects with valuable in- 
_ formation that would be beneficial to them when 
they began applying for a job. 

. The messages were edited and approved by 
_ the reading teacher in order to ensure their read- 

ability. Also, an analysis of the reading difficulty 
in each message was performed with the aid of 
_ the Flesch Reading Difficulty Index (Flesch, 

1948). The results of this analysis indicated that 
three of the messages were rated fairly easy 
while the fourth one was rated easy, 


Dependent Variables 


The criteria were four multiple choice tests, 
one for each message. Each test consisted of 13 
questions with five alternatives for each ques- 
tion. The test questions were phrased in a 
straightforward manner and in language similar 
to that used in the original material. 


| 


R 


Covariate 


Nr 


Due to the nature of this field experiment, 
che use of intact groups was required to main- 
tain the natural environment of the training 
Program. It was realized that the classes may 
have differed on some variable that may have 
= confounded the experiment. The most obvious 
d ension on which they might 


: eral intelligence, Therefore, it was decided that 


RESULTS AND DISCUSSION 


The data for each message was analyzed with 
a2 * 2 factorial analysis of covariance (Barr & 
. Goodnight, 1972, p. 144). The two independent 
_ Variables were the first presentation and the 
Second presentation both with two levels, oral 


E written. The results of these analyses indi- 


2 


SHORT NOTES 


R 


cated that the regression of the 
the covariate was significant (p < r 
of the messages—sources of jobs, te 


] appli 
cations, and the interview (Fs 11.8 
and 11,22, respectively; and dfs- 3, 1/55 
and 1/52, respectively). The on nifican 
treatment effect was the first presen for th 
message on tests and applications = 5.49 
df =1/55, p<.01). 

Eight a priori comparisons of adj trea 
ment means were made. These mea ere ad 
justed for differences in general inte nee b 
partialing out the covariance betwee che 
terion scores and the GATB scores. “ll ej 


comparisons were in the expected dire b 
only three were significant (p < .05), th- is, | 
oral and oral treatment was more effec. ye 

the written and oral treatment in dis 


the message on the interview G=1.76, 1 S 

while the oral and written treatment more 
effective than the written and written ment 
in disseminating the messages on tests appli⸗ 
cations and social security (is 2.11 4 2.23, 


respectively, p C. 05). . 
The three statistically significant con arisons 
provide some support for the following j 
tion: Initially, the best way to distribx: 
tively easy, job-related information to 
vantaged trainees is an oral presentation. This 
finding appears to Support the conclusions of 
Day and Beach (1950) and Travers (1970). 
The primary implication of this study is that 
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ity retesting at midwinter and at the end of winter. 


> 
There has been a great deal of scientific inter- 
est in the exploration of the arctic and antarctic 


regicvs to gain a more detailed understanding of 
the et on which we live. A number of expedi- 
tions ny many nations, have been sent to Ant- 
arc > in the last 25 years for purposes of chart- 
ing lecting samples of minerals, animal and 
plan‘ life, and to make weather observations. 
Som’. hat incidental to the general purposes of 
thes: -xpeditions has been an interest in studying 
the ects of arctic exploration on human ad- 
jus cnt. Individuals who participate in these 
prolo sed expeditions to Antarctica are exposed 
to extreme climatic conditions and the threat 
of physical danger as well as to the psychological 
stress inherent in the long period of isolation 


characterized by sameness, boredom, and irrita- 
tions occurring from being housed in cramped 
quarter 

Solitary voyagers and early polar explorers 
Bernicot, 1953; Byrd, 1938; Gibson, 1953; 
Merrien, 1954; Slocum, 1918) have reported 
personality or experiential disorders such as delu- 
sional and hallucinatory experiences resulting 
from isolation, lack of sleep, starvation and fear 
of the elements. For obvious reasons it has not 
been possible to acquire reliable personality data 
Pertinent to evaluating psychological adaption 
under extreme environmental conditions. Most 
of the existing knowledge of these personality 
factors comes from anecdotal report. 

1 The authors wish to thank Marjorie Sullivan 
and Richard Weber for assistance in the data 
analysis. 

Requests for reprints should be sent to James N. 
Butcher, Department of Psychology, University of 
ae Elliott Hall, Minneapolis, Minnesota 

De 

2 The data for this study were collected by Dr. 
Ryan who served as station physician with the 
United States Navy. 


PERSONALITY STABILITY AND ADJUSTMENT 


Personality characteristics of a group of Antarctic explorers were examined 
and successive administrations of the Minnesota Multiphasic Personality In- 
ventory (MMPI) and Personality Research Form (PRF) were used to assess 
personality changes resulting from the wintering-in experience, Personality 
assessment suggests that the volunteers are generally better adjusted, more 
achievement oriented, more self-sufficient and serious minded than a control. 
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sample of normal males. No changes in personality profiles were found in 


In recent years, since the 1957 geophysi 
year expedition, it has been possible to obtain 
objective data relevant to personality and sod 
behavior on individuals who are isolated in 
ous size groups for long periods of time. These 
wintering expeditions are somewhat differen 
from the earlier explorations in that (at least the 
American sponsored expeditions) a greater de- 
gree of creature comfort has been possible since — 
the U.S, Armed Forces’ bases are relatively < 
permanent, thus making possible the storage o 
supplies and luxuries such as movies, whiskey, 
etc. Despite the fact that greater materialistic 
comfort is reasonably assured, the situation still” 
presents isolation, boredom, potential hazard and 
the knowledge that return to the safety of home 
is not possible for some time. 3 

A number of studies have examined problems 
in adjustment to the antarctic environment. Sev- 
eral investigators have reported a high incidence 
of sleep disturbance, depression, headaches, ir- 
ritability, and other minor emotional problems 
(Gunderson, 1963, 1968; Gunderson & Nelson, 
1963; Mullin, 1960; Nelson, 1965; Seymour & 
Gunderson, 1971). 

The danger, hardship, and cold are not serious 
sources of stress according to Mullin (1960). 
He found that more stress resulted from: (a) the 
individual’s adjustment to the group, (b) the 
sameness of the environment and, (c) the ab- 
sence of usual sources of emotional gratification. 
‘All of the available studies focusing on emotional 
adjustment of personnel wintering in Antarctica 
report instances of adjustment problems and 
stress. However, there have been #0 documented 
psychiatric breakdowns of psychotic proportions 7 
(Nelson, 1965). 

The present study is an investigation of th 
personality characteristics of a group of antarc 
tic volunteers who wintered-in during the 1970 
winter, It was the investigators’ hope to con: 
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tribute an objective personality study of well- 
functioning individuals living in a stressful envi- 
ronment to the scientific understanding of per- 
sonality. The objectives of this study were to 
provide an (a) objective description of the men 
who volunteered (and were selected) for this 
difficult duty and (b) an assessment of person- 
ality change, if any, which may have taken 
place during the long antarctic winter. 


METHOD 


The subjects were 15 volunteers from a group 
of 21 scientists and naval support personnel, who 
wintered in Antarctica at South Pole Station in 
1970. All subjects had undergone extensive 
screening which consisted of physical examina- 
tion and interviews with a psychologist and a 
psychiatrist prior to deployment. The South Pole 
Station, a small underground facility, housed 
around 20 personnel through the 8.5-month 
period of isolation. During this wintering-in 
period (the base was entirely self-sustaining), 
the only contact with the outside world was by 
short-wave radio. 

The Minnesota Multiphasic Personality Inven- 
_ tory (MMPI) and Personality Research Form 
_ (PRF) were administered to the volunteers (a) 
just after the station was closed for the winter 
~ (March, 1970), (b) at midwinter and, (c) just 
á prior to the reopening of the station at the end 
j of the antarctic winter (September 1970). The 
Station physician administered the tests (see 
' Footnote 2). The subjects were given the test 
materials and asked to complete the inventories 

and return them as soon as possible. They were 
requested not to discuss or collaborate with any- 
one in responding to the inventories, All mate- 

tials were returned within two days. 

. The control subjects (n= 30 college males) 

Were randomly selected from a group of 200 

normal college males who had been administered 
the MMPI and Personality Research Form on 

_ two occasions eight months apart.® 


RESULTS AND Discussion 


5 The first question of interest to this investiga- 
tion concerns measured Personality character- 
: istics of the antarctic volunteers: How do these 
individuals differ from people in general? The 
MMPI profile of antarctic volunteers was one 
which resembled group mean profiles of non- 
pathological groups or normal subjects (Lanyon, 
1968). To compare the volunteers with a sample 


These data were collected by James Butcher and 


Paul Mauger in Cooperation with Bethel College, 
St. Paul, Minnesota. 


Suort Notes 


of normals, their initial MMPI and i 


pro- 
files were compared with those of a Ip of 
male college subjects.“ When compared 1 this 
group of normals, the antarctic volunteer ppear 
better adjusted in the following ways: They 
showed significantly less anxiety as re ed in 
scales Psychasthenia and Anxiety (F 18, p 
<.001; F=15.2, p <.001, respectiv (b) 
Less bodily concern shown by lower s on 
scales Hypochondriasis and Hysteria 6.3, 
p <.05; F=5.9, p<.05, respectivi (c) 
They appeared to be more conventiona less 
alienated as shown by lower scores ales 
Psychopathic Deviate and Schizophre: F= 
4.4, p < .05; F=8.4, p <.01) than the inger 
college subjects. The greater elevation the 
Ego-Strength scale (F = 15.2, p < .001 own 
by the antarctic volunteers suggest that are 
perhaps more able to withstand stress tia the 


college males. These findings were consistent with 
the expectation that self-selection and ite pre- 
liminary psychiatric screening operate to elimi- 
nate individuals who have lesser adaptive capa- 


bilities. The generally more stable or bet ad- 
justed profile of the volunteers may be re d to 
the fact that the volunteers were somewhat older 


than the college males. a 

The personality differences be een the vol- 
unteers and the college males on the PRF were 
informative. In general, the volunteers tended 
to describe themselves in a more favorable way 
than did college males (Desirability scale score 
difference F = 8.8, $ < .01). This finding was 
consistent with the MMPI results previously re- 
ported in this study. The volunteers present 
themselves as more achievement oriented than 
the college males (Achievement scale score dif- 
ference, F = 9.7, p < .01). This was not a sur- 
prising finding since a number of the volunteers 
were scientific professional individuals who had 
achieved both academically and professionally. 
The result added to the construct validity of the 
PRF achievement scale. 

Two other PRF scale differences pointed to 
the more serious-minded and more self-sufficient 
attitudes of the volunteers. The volunteers pro- 
duced lower Play scores (Play scale score dif- 
ference, F=12.9, p<.001) than the college 
males. This suggested less of an easy-going atti- 
tude toward life; for example, less participation 
in games, sports, or doing things just for fun by 
the volunteers than by the college males. The 
volunteers were lower on the Succorance Scale 
(F = 4.6, p < .05) which showed that the college 


4A more complete report of the findings may be 
obtained by writing the first author, 
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requently sought sympathy, protec- 
ivice and reassurance of other people 
volunteers. 


Asse of Personality Change 


An were performed on MMPI and PRF 
profi hich were obtained in pre- and post- 


winte sting sessions, The stability of the 
MM id PRF for both the volunteers and 
the ze males was shown by the lack of 
signi differences in mean scores over an 
8-m period of time. Only one score was 
signi at the .05 level out of 76 compari- 
sons nding which was consistent with chance 
expe y. 

A nd analysis was performed to compare 
diffe scores between first and second MMPI 
and ` administrations for the volunteers and 
the ege males. For this analysis the squared 
diffe- ce between the first and second adminis- 
trai on the 16 MMPI scales and the 22 PRF 
scal vere used. In the 38 comparisons of mean 
diffe ce scores between the volunteers and the 
col subjects only one difference reached sig- 
niñ: e. This can best be interpreted as a chance 
find The results of this analysis showed that 
MM profiles of antarctic volunteers did not 


cha (in terms of elevation) as a result of the 
ng-in any more than a control group of 
> subjects who had not undergone the ex- 


Bon of the previous analyses focused on 
s in scale elevation and did not take into 
scale configuration which is often con- 
sidered more crucial to profile interpretation than 
elevation, In order to evaluate possible shifts in 
scale configuration, a third analysis was per- 
formed to assess possible changes in the profiles. 
The MMPI profiles of volunteers and control 
subjects were coded following the Welsh coding 
system. The first four code digits (the four most 
elevaied scales) for both the experimental and 
control subjects were examined and the initial 
and retest configurations compared. Each sub- 
ject was given a score value of one for each 
scale which shifted position out of the first four 
code-digit group of scales between the initial 
testing and retest. The results of this analysis 
indicated that scale configuration for volunteers 
and for college normals remained essentially the 
same. The volunteers all had only one or two 
code shifts with an average of 1.4 and the college 
students had between zero and four code shifts 
with most subjects having only one scale shift. 
The average code shift for the college subjects 


was 1.4 or identical to the average for U 
volunteers, 

Thus both elevation and configuration for 
volunteers and for the normal college men ge- 
mained fairly constant between initial ng 
and retesting some months later regardless 
the intervening events. Replication of these 
ings on another antarctic expedition could 
valuable. 


Midwinter Data 


The MMPI and PRF were administered to 
the volunteers at midwinter since previous e 
search had indicated that adjustment problem 
were most apparent at this point in the winter. 
No significant differences for any of the scales 
on the MMPI or PRF were obtained between 
the initial test and the midwinter retest, Al- 
though some of the volunteers experienced som 
symptoms of maladjustment such as headach 
depression, irritation, etc., these problems wes 
not sufficient to alter deviantly their MMPI 
profiles. 
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Using galvanic skin response (GSR) data from an earlier study by Orne and 
Thackray, it was shown that variance of response was a suitable discriminator 
between liars and truthtellers when an individual’s deception is detected on a 
single trial. Subjects who were liars differed from subjects who were truth- 
tellers on all trials, not just on the ones in which they tried to deceive the 
experimenter. The discrimination between liars and truthtellers was facilitated 
by Gruvaeus and Wainer's method of cluster analysis, 


The use of galvanic skin response (GSR) as a 
Physiological means of lie detection has been 
under investigation for an extended period of 
time (Gustafson & Orne, 1963, 1964, 1965a, 
1965b). One typical procedure has been to form 
two kinds of groups, liars and truthtellers, in 
which the liar group has been required to learn 
number of words, These learned words (desig- 
mated hot“) are then mixed in a presentation 
“Sequence with neutral words and presented to 
two groups together. All subjects were asked 
Teport whether or not each word presented 
one they were required to learn. The liar 
oup was told to respond with “No” for all 
words, and hence was requested to lie when a 
bot word was presented. The truthteller group, 
ol course, always responded with “No.” The 
“GSR of each individual was continuously re- 
corded. A more detailed description of this ex- 
imental procedure was found in Orne and 
hackray (1967). 
Two objectives were to determine which words 
ere hot and which subjects were liars. As has 
been pointed out previously (Orne & Thackray, 
1967), this is essentially a problem in signal 
tion, in which it is essential to isolate the 
(physiological response to hot words) 
from the noise (physiological response to neutral 


One quick and easy way to increase the signal 
0 noise ratio was to average across hot words. 
and Thackray (1967) carried out this 
od and found that liars may be discriminated 
rom truthtellers, even though there was con- 

able overlap of the two distributions. How- 
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ever, when averages were taken of the subjects 
as well as the words the overlap of liar 
truthteller groups decreased considerably, 
finding indicated that there were differences be- 
tween the two groups that were discernable in 
maximum deflection, but as Orne and Thackray 
point out, this is of little use for discrimination 
of individual subjects. 
The purpose of this study was to indicate 
another quality of the GSR curve other 
maximum deflection might be of some use, OP 
servation of many individual response curves 
indicated that subjects who lie “jiggle around” 
more than truthful ones and, thus, it seemed 
appropriate to characterize this jiggle by each 
individual's response variance around his own 
mean. When this was done for each word it 
was found that a comparison could be made 
between each subject’s jiggle with each other 
subject by computing a distance between every 
pair of subjects. This distance was a function of 4 
their variances. More precisely the distance be- 


tween subject i and subject j was defined as di, 
where 


dy = | log [s%,/s2,] | . 


The log of the variance ratio was taken be- 
cause the distribution of distances was typically 
very skewed and the log transformation placed 
it in a more sensible scale. The use of the abso- 
lute value of the log was to conform to the 
symmetry requirement of a metric (dy = du). 
A distance matrix between the subjects for each 
word was then computed and clustered using 
Gruvaeus and Wainer's (1972a, 1972b) tech- 
nique. This kind of cluster analysis yields or- 
dered hierarchical cluster solutions, Two sample 
results are shown in Figures 1 and 2 for a a 
“hot” and “cold” word respectively for 20 sub- 
jects. The numbers along the top are subject 


2.244 
3.307 
1.689 


8,040 


Ficure 1. Ordered hierarchical cluster analysis of intersubject distances from a hot“ 
word. (L and T represent the subjects who are liars and truthtellers.) 
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Ficure 2. Ordered hierarchical cluster an 
word. (L and T represent the sub. 


alysis of intersubject distances from a “cold” 
jects who are liars and truthtellers.) 
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numbers and the L and T represent whether the 
subject is lying or truthtelling, respectively. The 
numbers along the side are the cluster diameters. 
It is necessary to keep in mind that the order 
of the subjects across the figure is meaningful, 
that is, in Figure 1, for example, Subject 3 is 
more like Subject 11 than like Subject 20. This 
order implies that even though Subjects 3 and 
11 are in different clusters Subject 3 is more 

_ like the subjects in the liar cluster than any of 

6 the other subjects in his cluster. Note also that 

the ability of the procedure to discriminate be- 

tween lying and truthtelling subjects is quite 
good for both hot and cold words. The differences 
for the hot words seem to be somewhat larger. 
The two cluster solutions shown are quite typical 
for the 75 trials that have been analyzed in this 
way. 

Although it was not necessarily the purpose 
of this study to propose a cluster analysis of 
intersubject distances as a way to determine who 
was lying, such a procedure may be conceivable. 
Rather, this study has been an attempt to indi- 
cate that by examining variance of response, 
large differences between lying and truthtelling 

subjects appear without having to resort to ag- 

gregating of the words or subjects. The fact 
that these differences showed up on cold words 
might be artifactual, caused by Orne and Thack- 
ray's (1967) differential treatment of groups in 
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information. 


i the most important characteristics of 
Scientific investigation of the detection of 
on by psychophysiological means is the 
a of some fallibility of the detection 
Process as an empirical fact. Thus attempts can 
be made to reduce error associated with the 
process using methods that are generally known 
to be effective in reduction of error terms. It is 
clear by now that special state methods or magic 
psychophysiological indices have by them- 
Selves little promise for dramatic improvement 
in accuracy, although they may help in providing 
| additional information for the reduction of error 
terms. Three independent alternatives for the 

eduction of error terms in this area present 
‘themselves: (a) the simultaneous utilization and 
optimal combination of many psychophysiologi- 
tal indices (Kugelmass & Lieblich, 1968); (b) 
the use of larger alternative ensembles into which 
e critical items are inserted (Lieblich. Kugel- 
„& Ben Shakhar, 1970); (c) the use of a 
er interrogation which utilizes many critical 
s indicating the sought information (Lyk- 
1959; Ben Shakhar, Lieblich, & Kugelmass, 
0). The present study explores a fourth al- 
mative, which is a special case of (c), when 
one critical item may be used to detect 
Ivement with information: (d) the use of 
er interrogation with repetition of the same 
critical item. Preliminary work (Nevo, 
er that the latter alternative might be 
ible. 
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The present study explored the possibility of reducing errors in the detection 
galvanic skin response (GSR), s 

interrogation with repetitions of the same critical Ayi engate bira s 
volved two groups of subjects, one tested under instructions intended to 
arouse high motivation to beat the machine; the other was tested under low 
motivation to beat the machine, It was found that it was powible to reduce 
considerably errors in information detection through a simple combination of 
GSRs obtained from 10 repetitions of the same Interrogation sequence. Moti- 
vation did not produce any noticeable effect on the efficiency of detection of 


This study was intended to investigate the 
effects of repetition of series, in which the same 
critical item is embedded, on the probability 
correctly identifying these critical items, throug! 
the use of changes in galvanic skin response 
(GSRs). Since it was assumed that the repeti- 
tion of the interrogation sequences would produc 
boredom, and since Gustafson and Ome (1963) 
showed that low motivation may reduce detec- 
tion rates to chance level, it was decided also to” 
manipulate motivation of the subjects along the | 
lines of Gustafson and Orne (1963). 


Memo 
Subjects 
Fifty-eight students of the Hebrew University 
served as subjects in the reported study. T- 
Instruments 


The subjects were tested with a constant cur- 
rent (20 pA), GSR apparatus (Lieblich, 1966 
coupled with a Brush mark II recorder. Beckman: 
electrodes were used and were attached to 
volar side of the second and fourth fingers of the 
left hand, using Beckman electrode paste. 


Procedure 


When each subject arrived at the laborato 
he was asked his first name, An interrogati 
sequence was constructed comprised of five 
names of which one was the first name of the 
subject, After the experimenter seated the si 
ject and arranged the recording attachments, 
experimenter entered the adjacent control ro 
where he conducted the experiment, The interro- 
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gation sequences were read by an experimenter 
who did not know the subject’s name, using an 
intercom. The subject was observed through a 
one-way screen. Two minutes were allowed for 
stabilization before starting. The subjects were 
randomly assigned to one of two groups, a high- 
motivation or a low-motivation group. 

The instructions to the high- and low-motiva- 
tion groups were very similar to those used by 
Gustafson and Ome (1963). Following the in- 
structions the interrogation sequence began, The 
first name read was a buffer item followed by 
the sequence names. The questions were posed 
at ten second intervals. Ten repetitions of the 
names sequence were presented continuously. 
The order of the names was different for each 
repetition. In all, 51 names were presented to 

= each subject. 


5 RESULTS 


In order to evaluate the effects of the high- 
motivation group instructions the subjects were 
_ interviewed immediately after the experiment. 
Each subject was asked if he had used any spe- 
_ cific technique designed to beat the machine. 
Eight of the 28 high-motivation group subjects 
reported that they had not tried to use any 
specific means of interference with the poly- 
` graphic detection. For purposes of evaluating 
4 the effects of the motivation instructions, it 
seemed appropriate to analyze the results of the 
" high-motivation group, both including and elimi- 
nating the eight subjects. The subjects in the 
_ high-motivation group who reported the use of 
a specific technique designed to interfere with 
the detection procedure were called high moti- 
_ ational countermanipulators. 
3 Analysis of the GSR data obtained during each 
interrogation sequence was carried out to de- 
termine the name that yielded the highest con- 
dudctance change within five seconds after a name 
Presentation. A hit was scored whenever the 
conductance change to the actual name exceeded 
the maximum response to the alternative names 
in a given sequence. 
The correct name detection rates ranged from 
60 in the first repetition to .48 in the tenth repe- 
tition in the low-motivation group, from .55 to 
Fo in the high-motivation group, and from .60 
in the first repetition to .42 in the tenth repeti- 
tion in the high motivational countermanipu- 
Jators’ group. 

Although there were fluctuations in the pro- 
portion of correct name identification, with repe- 
titions, there was still a significant correct detec- 

tion rate (p <.05) in the tenth trial that used 
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omial disti 
ddition to 
reviously 
of the 


a normal approximation 
tion (n, 1/5) in all grou 
analysis, the same decisior 
tioned was applied to the me 
to the interrogation stimul ross repetition: 
adding one repetition at a time. The pri 
of correct name detection in the three expe 
mental groups ranged from .60 with one t 
tion to .96 using all ten repetitions in the L 
group, from .55 to .93 in the Hi group and 
60 in the first repetition of the high mo 
tional countermanipulators group to 1.00 
all ten repetitions. 


DISCUSSION 


The results of this experiment may be vi 
as support for the research that has atte 
to increase the efficiency of psychophysiologi 
detection through a simple statistical co) 
tions of the same interrogation sequence. 
best of the experimenters’ knowledge, this is th 
first successful use of such an approach in 
context of information detection. The repo 
finding has immediate applicatory implicatio 
(a) it simplifies considerably the task of obtain 
ing critical items for information detecti 
procedures in cases where these items are d 
cult to obtain, in addition, (b) it makes it ea 
to keep a few or one critical item from 
knowledge of the population of potential suspects 

As to the fluctuations of the correct detectiol 
rates with repetitions, while there did not se 
to be any particular trend from trial to trial 
the low-motivation group, there appeared to 
a tendency for the correct detection proportio ni 
to drop with further trials in the high-motivatio 
group, and in particular, in the case of the high 
motivation countermanipulators. This trend 
indicated by a significant (p C os) Spea 
rank order correlation of —.82 between detectio 
rate and repetition number for the high motiva- 
tional countermanipulators group, —.55 for 
high-motivation group, and a nonsignificant cor 
relation of —.28 for the low-motivation group 
However, a two-factor, Groups X Trials, 
peated measures analysis of variance conducte 
on the conductance change data (ratio of the 
conductance change to the actual name, and th r 
maximum conductance change to the alternati 
names) did not yield any significant effects. Th 
the possible interaction of the state of motiva 
tion and trials suggested by the differential pe 
relations could not be detected using the in 
vidual data, and the continuous measure und 
lying the detection process. In addition, th 
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tion r 
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Bedou 
presen 
degree 
pecte< 

instru 

used 
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see Li: 
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Sr 


ed no differences in the actual 
indices between the motivation 


of this experiment suggest that 
deration be given to the role of 
the detection of information. It 
independent measures of motiva- 

ied to evaluate the impact of instruc- 
subjects. This study applied such a 
exemplified in the identification 
motivational countermanipulations 
ntrast to Gustafson and Orne (1963) 
ed insignificant detection under low 
the experimenters of this laboratory 
never observed insignificant detec- 
except for certain groups of very 
dren (Lieblich, 1971), or uneducated 
Kugelmass & Lieblich, 1968). The 
zta did not indicate any difference in 
í detection that might have been ex- 
result from the ifically different 
is. Gustafson and Orne (1963) had 
is while this experiment used the sub- 
nes as critical items. Names have high 
» for a subject and are frequently used 


avironment (for discussion of these facets 
blich, 1969, 1971). In cases where such 


are presented external motivation may 


not be critical, It is to be expected that in gen- 
eral, detection procedures would use highly rele- 


vant stimuli such as names. 


codes, etc. In such 


cases, it could be expected that the external mo- 
tivational conditions would have limited effects. 
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RELATION OF PRICE TO PERCEPTION OF QUALITY OF Ni 


ARCH G. WOODSIDE * 


University of South Carolina 


Six groups of 12 construction workers were shown a new product, an ele 

lunch box, and provided with a printed advertisement that contained heati: 
instructions and price information. Five different prices were used: $4, $5, $ 
$8, and $10, with one group of subjects not receiving price information. The 
subjects individually inspected the product and completed (a) 11-point rating 
scales on advantage, use and quality and (b) a questionnaire selecting favor 
able and unfavorable words to describe the product. Results indicated that a 
linear regression model significantly fit the scaled responses for the treatments 


( < 01). 


Price has been indicated as a determining fac- 
tor of perceived product quality whenever the 
consumer has little previous experience with the 
product or the potential risk and uncertainty 
involved in its use are great (Leavitt, 1954; Sci- 
tovsky, 1944-1945; Shapiro, 1968). Gabor and 
Granger (1966) have hypothesized a nonlinear 
price-perceived-product-quality relationship which 
has been supported by empirical findings by Mc- 
Connell (1968) and Peterson (1970). However, 
two recent studies (Gardner, 1971; Rao, 1971) 
raise serious questions about any generalized 
price—quality relationship. These studies demon- 

strated that the salience of price in quality per- 
ceptions was negligible under conditions when 
other informational cues are available to con- 
sumers, for example, prior experience or con- 
Sumer test reports. 
Tt seems apparent that the value of price in 
_ Consumer's perceptions of quality may depend on 
the availability of additional product information. 
The Stafford-Enis ( 1969) hypothesis would be a 
limiting case difficult to dispute: “Tt appears 
obvious to us that if price data provide the only 
clue to product quality, then perceived quality 
should be expected to vary directly with price 
[p. 456].” 

The Gardner (1971) and Rao (1971) studies 
were performed with products that subjects could 
easily relate to prior purchase and use. Thus, the 
nonsignificant findings of price and perceived 
product quality might have been expected under 
such high-prior- information conditions. 

Peterson (1970) used a product for which 
consumers had a relative lack of information: a 
recently developed but unmarketed soft drink 


* Requests for reprints should be sent to Arch G. 
Woodside, College of Business Administration, Uni- 
versity of South Carolina, Columbia, South Carolina 
29208. 
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concentrate packaged in a pressurize: 
when mixed with water resulted in a zavel car- 
bonated soft drink (Gardner, 1971). Sabjects in 
the experiment estimated quality in part by com- 
paring the product with other soft d= 
they had purchased and used; however, 
prior experience with the product in 
was not possible. A significant price anc 
quality relationship was found by Peters 
Thus, the relationship jeen pric 
ceived product quality ma dend on the ¢ 
of prior information held by the consu 
paradigm shown in Table 1 sum 
argument and representative research findings. 
The present study examines the relationship 
between price and perceived quality for a product 
for which consumers would have a relative Jack 
of information. The selected product was hy- 
pothesized to be a “dynamically continuous” in: 
novation (Robertson, 1971) about which con- 
sumers would not have direct prior knowledge but 
would be prone to compare its benefits to some 
perceived standard. Thus, a significant, nonlinear 
relationship was hypothesized. 


METHOD 
Product 


The product used was a recently developed, 
though not yet marketed, electric lunch box that 
was made of reinforced plastic with a galvanized 
soleplate and a five-foot electric cord. 


Subjects 


The subjects were 72 construction workers liv- 


the product and were provided with little infor- 
mation as to its potential performance. 


Procedur 

The su were randomly assigned to one of 
six expe: ıl conditions. Each subject was 
independ called aside during his lunch break 
and req to participate in evaluating a new 
product product was displayed to the sub- 
jects, v ere given a printed advertisement 
with th owing headline: “hot box: electric, 
* lunch box warms your lunch on the 
job!” 

All i ation given to the subjects was con- 
tained e advertisement, with price being the 
only y on. Five different prices were used: 
KA, $5 38, and $10. (The actual market price 
was to 34.99 retail.) One group of 12 subjects 
served a control group and did not receive 
any pr information with their advertisement. 

Afte nspecting the product, each subject 
comp! a questionnaire by selecting 3 of 12 
words hich best describe the electric self- 
warm inch box to you.” The following words 
were :ded in the questionnaire: 

jeedless Quality 
rong Bad idea 
fad Poorly made 
reat Useful 

sood idea Unreliable 

Worthless Dependable 

The subjects also completed 11-point rating 
scales on relative advantage, ease of use, and 
quality perceived in the product. 


RESULTS 


A Lotal of 12 subjects were in each treatment 
condition, The subjects’ word choice responses 
were separated into favorable and unfavorable 
selections. Favorable words included strong, 
great, good idea, quality, useful, and dependable. 
The percentages of favorable responses were 
53, 78, 82, 89, and 90 for the $4, $5, $6, $8, and 
$10 treatments, respectively. A total of 71% of 
the word choices made by the control subjects 
were favorable, which is midway between the 
results of the price categories. This finding pro- 


vides credibility to the control procedures, since 
without knowledge of 


the median number of favorable words used by 
f which was given 


the full range of groups, each o 
4 different price for the product. 

The percentages of favorable and unfavorable 
responses were significantly different for the 
treatment groups using the chi-square test ($ < 
.01). The control group was not included in the 
analysis. 
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The mean perceived quality ratings were 4,83, 
5.33, 633, 8.67, and 7.33 for the $4 to $10 
treatment groups, respectively. (The larger the 
mean, the greater the perceived quality.) 

The calculated F = 3.43 (df = 4/59) for one- 
way analysis of variance was statistically signifi- 
cant ($ < .05). 

The Duncan multiple-range Test (Winer, 1962) 
was used to test for differences between indi- 
vidual treatment means. This analysis revealed 
that the mean quality ratings of subjects in the 
$4, $5, and $6 ranges were significantly less than 
the mean for the $8 treatment subjects (p < 05). 

An orthogonal test (Winer, 1962) was per 
formed to determine the appropriate form of the 
relationship. A linear regression model was found 
to fit the data significantly (p <.01), while a 
quadratic model was not significant. b 


Discussion 


The results partially support the hypothesis, 
A significant relationship was found between price 
and perceived product quality. The finding of a 
direct rather than a quadratic relationship may 
be due to limitations of the study, that is, an 
upper price limit may be near $10 for this prod- 
uct. Beyond the upper price limit, the product 
could be too expensive for quality perceived 
(Gabor & Granger, 1966). To test this hypothe- _ 
sis in the context of the present experiment, 812 
and $14 price treatments would have been useful. 
Any upper limit on price might also be a func- 
tion of the product’s market, for example, execu- 
tives or members of other higher income groups 
who carry their lunch to work may have a sig- 
nificantly higher price limit than the group of 
subjects studied. = 

The purpose of the present study has been to 
the conditions by which signifi- § 


examine 
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cant price- perceived · product - quality relationships 
_ would be found. The direct relationship found in 
the experiment is an extension of the Gabor and 
Granger (1966), McConnell (1968), and Peter- 
son (1970) studies. Given a novel product to 
evaluate and a relative lack of information, the 
consumer may be expected to use price as an 
indication of quality. However, when additional 
information cues are available, the relationship 
between price and perceived quality may dimin- 
ish. Within certain information limits, a gen- 
; eralized price-perceived-quality relationship may 
j be specified, 
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CONSUMER PROFILE ANALYSIS OF THE SELF-SERVICE 
GASOLINE CUSTOMER 


J. M. McCLURG 
Shell Oil of Canada 


Fifty drivers were interviewed at self-service gasoline stations and another 
fifty drivers were interviewed at nearby conventional service stations. Re- 
sponse differences between these two samples confirmed the predictions that 
the self-service customer would have an economic shopper orientation, that 
he would have the characteristics of people who are innovators in new 
product use, and that he would be less dogmatic in his thought processes. 
This last finding provided an interesting field replication of a multiproduct 


laboratory study by Jacoby. 


I e hypotheses about possible differences 
bet» n self-service and conventional service 
gaso e station customers were derived from a 
rey of the general literature. The first hy- 
pot’ is stated that because the self-service sta- 
tio ‘eatures lower price and minimized contact 
wW station personnel, the customers at these 
t. ns would be more like Stone’s (1954) 
pr- sensitive economic shopper and less like 
hi- clationships oriented personalizing shopper. 
nce the self-service station was a recent in- 
no tion in the market area surveyed, it was 
further hypothesized that customers at these 
ions would tend to possess the characteristics 
ociated with innovative product use. According 
to Rogers and Stanfield (1968) new product 
users tend to be better educated, more literate, 
have higher incomes and higher standards of 
living, have higher achievement aspirations for 
their children and are more knowledgeable about 
world and local events. 

The third hypothesis was taken from a labora- 
tory study by Jacoby (1971) who found that 
for many products, innovative use was in 
related to degree of dogmatism (Rokeach, 1960). 
On the basis of Jacoby’s results and the fact 
that self-service stations were new in the area 
surveyed, the investigators of this study pre- 
dicted that self-service customers would be less 
dogmatic in their thought processes than custo- 
mers who patronize conventional gasoline sta- 
tions. 


METHOD 


Fifty drivers were interviewed at five different 
self-service gasoline stations and 50 additional 


1 Requests for reprints should be sent to I. R. 
Andrews, Department of Economics and Commerce, 
Simon Fraser University, Burnaby, British Colum- 


bia, Canada. 
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drivers were interviewed at five different co 
tional gasoline stations chosen because of 
proximity to the self-service stations. The 
part of the interview included four highly struc 
tured questions and one open-ended question 
about the customer's gasoline purchase practic l 
Also included were two semantic-differential type 
items, one about the kind of person “who would 
buy gasoline here” and one about “attite 
toward new products.” 
At self-service stations only there were add 
tional questions which sampled customer opin 
about such things as the intercom between 
gasoline pumps and the cashier, knowledge ab 
how much money was saved at a self-se 
station, etc. 9 
The last part of the interview consisted ol 
questions about age, amount of education, inco 
level, etc, When the 15 to 30 minute inte 
was completed, the driver was given a ques 


sues.” This 0 
Form-E of Rokeach’s (1960) dogmatism se: 
Since this scale includes a variety of exotic items 
such as, “It is better to be a dead hero than a 
live coward.” the standard instructions e 
modified to include a consumer opinion phras 
Also the interviewer gave each subject a set 
steel steak knives in appreciation of his partici 
pation in the study. Lastly, the prestamped v 
turn envelope was addressed to a department of 
economics and commerce rather than to an oil 
company or to a department of psychology. 
These efforts produced a very high return 

73 out of 100. 


RESULTS 


The economic shopper orientation of the seli- 
service customer was verified by several pieces of 
data. For example, when the customers were 
asked in an open-ended question about why 
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they decided to buy gas at the station where they 
were interviewed, 28 of the self-service patrons 
mentioned price while only 8 mentioned good 
service. By contrast, none of the conventional 
Station patrons mentioned price and 21 men- 
tioned good service. 

When the self-service customers were asked 
to pick the 5 most important reasons out of a 
list of 12 possible reasons for buying gas at a 
particular service station, 23 of the self-service 
patrons assigned the number one rank to price 
and 11 more gave price a rank of two. Among 
the conventional station customers, pleasant staff 
received 16 first place rankings and 9 second 

place rankings. Two additional indications of the 

economic shopper orientation of the self-service 
customers were their much greater tendency to 
purchase gas at other kinds of discount stations 

and the fact that 48 out of 50 self-service patrons 

knew that they were buying a premium quality 
product at some discount in price. 

The hypothesized tendency for greater product 

_ innovativeness among the self-service customers 
_ Was examined with two semantic-differential type 
questions and with demographic data. The first 
f question asked the respondent to imagine the 
Find of person who would be apt to buy gas at 
the particular station where the interview was 
conducted. On four out of six semantic-differen- 
tial scales. the group means for self-service and 
conventional service customers were significantly 
different at the .05 level (df = 98, one-tailed). In 
their projected self-image, the self-service custo- 
mers imagined themselves as younger, bolder, 
more lively, and more modern. 
* The second semantic-differential type question 
_ —attitudes toward new products—produced sig- 
7 nificant group mean differences on five out of 
Seven scales (df=98, one-tailed), The self- 
service patrons were more apt to be early 
_ adopters, were more interested in new products, 
more carefree, more influential, and more ex- 
_ cited by new products. 

When the two samples were compared on 
demographic items the conventional service 
customers were more apt to own two cars and 
to drive a luxury car or a sports car, and they 
had a slightly higher income. The self-service 
patrons were younger, more apt to be students 
and somewhat better educated, The tendency for 
greater education in the innovative group was 
consistent with data reported by Rogers and 
Stanfield (1968) but the difference in income 
level and standard of living was not. Presumably 
this departure from previous results was pro- 
duced by the cost savings associated with the 
new product sampled in the present study, 
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SHORT Notes 


To test the third hypothesis a sepa: dog 
matism score was calculated for each of he 7% 
subjects who completed and returned th: ‘Cons 
sumer opinions on social issues” quest! naire 
(in reality, Form-E of Rokeach’s dor atism 
scale). As predicted the mean score for self- 
service customers (—15.85) indicated zroup 
tendency for low dogmatism. By cor the 
mean score for 32 conventional se re- 
spondents (3.37) was quite near the mid- 
point of the dogmatism scale. Include: mong” 
the 32 conventional service respondents e six 


persons who reported making only 20% 30% 
of their gas purchases at conventional : 
whereas the modal members of that grou; ma 
80% to 90% of their purchases at conve tio 

stations, When these atypical gas purchase: : Wen 


deleted, the mean dogmatism score for Ape 


tional service respondents was plus .67 mos 

exactly on the scale midpoint. Without dele- < 
tion of atypical gas purchasers from the ven- 
tional service group, the mean differ be- 
tween groups 5 et r= d, 
=71). With their deleti wever, the differ- 
ence in mean dogmatism scores for two 


groups was significant at th 
df = 65, two-tailed), 
An alternate nonparametric analysis v 
73 questionnaire respondents showed that 
proportion of subjects with dogmatism scores 
above the scale midpoint (zero) was quite 
different for the two groups—17 out of 32 for 
the conventional service customers versus only 
12 out of 41 for the self-service customers. This 
difference in proportions was highly significant 
(Z=2.08, p < .05). y 
Also confirming the personality difference 
between the two samples was the difference in 
the proportion of subjects who returned the 
dogmatism questionnaire—32 out of 50 conven- 
tional station interviewees versus 41 out of 50 
self-service interviewees (Z= 2.02, p < .05). 


Discusston 


As predicted from the cost-saving and deper 
sonalized nature of the self-service operation, the 
drivers who were sampled at self-service gasoline 
stations had a strong economic shopper orienta- — 
tion. Also, as predicted from the fact that self- 
service stations were still new in the region sur- 
veyed, the self-service drivers had several of the 
characteristics of people who are innovators in 
new product use. Lastly, the results provided 
field research confirmation of Jacoby’s (1971) s 
laboratory finding that new product users are 
less dogmatic in their thought processes than j 
persons who prefer established products. 
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MOTIVATIONS OF BLOOD DONORS AND NONDONORS 


ROBERT M. OSWALT? anp MARIE NAPOLIELLO 
N Skidmore College 
" 
4 


A survey of over 85% questionnaire returns from a random sample of two 
college populations indicated that 40% were blood donors (compared with 
nondonors. Motivations for donors were: humani 
À tarian, peer pressure, and curiosity experience. Motivations of nondonors 


fear of needle, general apprehension, and 
fear of aftereffects, Recommendations for blood procurement agencies are 


3% nationally) and 60% 
were: legitimate medical excuses, 


discussed. 


Population growth, advances in surgery and 
medicine, and expansion of hospital facilities 
have significantly increased the demand for whole 
human blood (American National Red Cross, 
1972). As a result of this increased blood usage, 
the American Association of Blood Banks (“The 
Blood Business,” 1972) has estimated that 8.5 
million units of blood are needed every year. 
Nonetheless, only 3% of the eligible population 
is now donating blood, whereas 4% is required 

to meet the need (American National Red 
Cross, 1972). Often the supply of blood Jags 
behind the demand because the source depends 
entirely on the voluntary donation of individuals. 
Although there is an increasing need for blood, 
7 information concerning the motivation of blood 
f donors is scarce. Most prior studies have investi- 
_ gated social, economic and demographic charac- 
" teristics of donors (Grace, 1957: Korzekwa, 
_ Jordan, & Alsever, 1960; London & Hemphill, 
_ 1965; Phillips, 1961), either compared the char- 
_ acteristics of first-time versus repeating donors 
(Condie & Maxwell, 19 70a), or studied the dif- 
ferences between voluntary and paid donors 


of the popula- 
tion of the United States does not volunteer to 
donate blood (American National Red Cross, 
: This study was supported in part by National 

ience Foundation Grant GY-9965. The authors 
wish to thank Robert P. 


with obtaining blood donors. 
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4 e, Saratoga Springs, New York 12866. 


1972), no study has ever been condu 
United States on the motivation of 
The purpose of this study was t 
motivations of a random sample of bl 
and nondonors. All of the previously me 
studies limited their subjects to those 
a previous commitment to donate bloo i 
mation about why people do or do not 
blood could be of practical significan 


in the 
Tonors. 
mine 


METHOD =“ 


Subjects 


Ten percent of the entire populations of two 
eastern liberal arts colleges were chosen for this 
survey by taking every tenth name in their stu- 
dent directories, beginning at a randomly chosen 


name. The sample consisted of 187 females and 
152 males. 


Procedure 


A questionnaire and return-addressed, stamped 
envelope was sent to each subject with a cover 
letter explaining the purpose of the survey 
and emphasizing its significance. Subjects were 
assured that all responses would be treated 
confidentially. 

The questionnaire: requested demographic data 
as well as specific data on the subject's age, 
whether or not the subject ever donated blood, 
geographic location of first blood donation, reas- 
ons for donating, the total number of times 
donated, the presence of any negative feelings 
before and after donating, whether the subject 
donated for remuneration or not, and self- 
opinions concerning the effect of incentives. 


RESULTS 


Eighty-seven percent of the subjects in the 
sample returned usable questionnaires and their 
responses indicated that approximately 40% of 
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Snour 


id donated blood while 60% had 
t that 40% of the subjects were 
s significantly from the American 
Cross estimate that only 3% of 


the sa 
not. TI 
donors 
Nationa 


the elig vational population are donors. 

Most iors in this survey (39%) have 
donate one time (although one third of 
the do: reported they would donate as often 
as eve ‘ree months), have donated at Red 
Cross i drives at college (899%), and have 
donate r no remuneration (979%). 

Dor eported the following as major reas- 
ons í mating blood: humanitarian reasons 
(66% er pressure (199%), convenience of 
college odmobile (18%), and curiosity in ex- 
perier the process of donating blood (139%). 
Prior donating they reported they were 
nerve 77%), afraid of the needle (379%), and 
concer | about feeling weak, faint, or dizzy 
(28% Subsequent to donating they were less 
fearfi but essentially afraid of the same 
aspec’ -nervousness (23%), needle (249%), and 
faintness (11%). 

The majority of the nondonors (60%) re- 
porte? ‘hat legitimate medical reasons (under- 
weigh hepatitis, mononucleosis, low blood pres- 
sure, .od the use of medication) prohibited 
them om donating blood. Other reasons which 
deter nondonors were general nervousness 
and isar (37%), fear of the needle (27%), 


inconvenience (19%), and apathy (16%). 

Fighiy percent of the donors and nondonors 
reported that incentives (eig., money, free re- 
placement blood if needed) would not increase 
or decrease their rate of giving, but cash pay- 
ments or income tax deductions would be the 
best incentives to encourage other people to 
donate. The Federal Government’s present pol- 
icy (Federal Tax Guide, 1973) which requires 
one to pay tax on money received for blood sold 
(property) and does not allow one to deduct 
the value of blood donated free (service rather 
than property) was seen as unreasonable by 
the majority of respondents. 


Discussi0N 


The results of this study suggest the following 
"recommendations to increase the donation of 
blood: 


1. Blood procurement agencies should contact 
More college students as perspective donors in 
order to satisfy the demand for blood. This 
study indicated that approximately 40% of the 
sample college populations donated blood com- 
pared to only 3% of the eligible national 
Population, If 40% of the approximately eight 


million college students (Chromicle of Higher 
Education, 1972) in the United States were t 
give only one pint of blood a year, over 
million pints could be obtained. This one 
(colleges) could provide almost the entire 
requirements of the Red Cross and one 
of the national needs, Yet 1971 figures indic 
that the Red Cross obtains less than 8% of it 
blood supply from colleges and universities 
(American National Red Cross, 1971). 
2. Blood procurement agencies should n 
the donation of blood more convenient 


Since 89% of the donors reported that their first” 
donations were at college bloodmobile units, 
convenient placement of bloodmobiles could in- 
fluence more individuals to donate blood. Very 
few students went to hospitals or bloodmobile 
centers to donate blood. 

3. Blood procurement agencies probably could 
obtain more donors if they made an effort to 
contact donors at some specified intervals. 
survey indicated that 34% of the donors would 
donate blood regularly every three months if 
called upon, and others would donate at 
intervals if contacted. Perhaps cards could be 
mailed at intervals agreed upon by the donors 
to remind them that they are eligible to give 
blood again. These cards could be patterned after 
the cards now sent to remind an individual that 
he is due for a dental appointment. In addition, 
individual appointments could be arranged in 
order to eliminate waiting and make the specified 
time more of an obligation. 

4. Blood procurement agencies might consider 
eliminating the emphasis on emergency situa- — 
tions as a means of motivation and begin to use 
financial incentives to encourage donations, — 
Emergencies, as incentive factors, received low — 
percentages from both donors (10%) and non- 
donors (159%). In fact, 80% of the respondents 
reported that the use of any incentive would 
not change their rate of donation. However, 
the other 20% could be motivated to donate - 
if monetary rewards were given either in cash 
or as income tax deductions. s 


These suggestions are, of course, based on the j 
results of the two colleges sampled for this z 
survey. Whether or not these results would gen- 
eralize to other colleges and universities is an 
empirical question which could and should be 
answered. ‘a 
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SOME FOLKWAYS OF POLITICAL LEADERS: 


THE RULES 


Temple 


voters before, during, and after 
eight specific folkways. The major 


voters increasingly, over the three ti 


l personal contact, while politicians remai 


paigning. It is, thus, suggested that 


folkways regardless of voter preference. 


understanding political campaigning 


Wha the rules, or folkways, used by 
politic s cues to social and policy positions 
and me ative strategies? While we know a 
good < ibout voter behavior and experience, 
we hav tle systematic knowledge of the cam- 
paign rience for the politician; thus, our 
unders ing of this important voter-politician 
linkag limited. 

Rer studies have attempted to describe 
folkwa of national and state legislators (Mat- 
thews 0; Wahlke, Eulau, Buchanan, & Ferg- 
uson, 2). Although there have been fewer 
systen e attempts to examine such folkways 
for the politician campaigning for office, the work 
of Levin (1962), Eldersveld (1966), and others 
have indicated that such folkways do exist and 


that they provide constraints on the behavior of 
politicians, Perhaps these constraints tend to pre- 
clude learning experiences during the campaign, 
and politicians’ performance of such folkway be- 
haviors may lend a ritualistic quality to cam- 
paigns. The present exploratory study (a) 
identifies some specific campaign folkways, (b) 
examines the extent to which the opinion of 
politicians regarding specific political folkways 
agrees with that of voters, and (c) determines 
possible changes in these opinions during a 
campaign. 


METHOD 


One hundred and eighty politicians from both 
major parties, active in local, state, and national 
politics, were requested to give political “sayings” 
they believed were applicable as the “do’s and 
don'ts” of political campaigning in four contrast- 
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| his study compares opinions about political folkways of political leaders and 
liticians and 141 voters were administered a questionnaire composed of 


l were substantially in agreement on seven out of the eight folkways, and (6) 


OF THE GAME 


University 


an urban mayoralty election, Twenty 
findings are (a) politicians and voters 


me periods, rejected the effectiveness of 
ned strongly in support of such cam- 
liticians adhere to specific campaign 
The significance of this trend in 
is discussed. 


po! 


ing states (California, Minnesota, Pennsylvania, 
and South Carolina). Their most frequent 
responses were then cross-tabulated into eight — 
specific campaign folkways and resubmitted to 
the original sample who unanimously (170 
answered) agreed that these were basic campaign 
rules. The second stage involved the presentation 
of these sayings, by questionnaire, to 20 poli- 
ticians and 208 voters; of these, 141 completed 
all three interviews before, during, and after 4 
Philadelphia mayoralty election. Politicians were 
asked, “Do these following statements guide your 
behavior?” Voters were asked, “Do the following 
sayings or actions by politicians attract your 
vote?” Controls for socioeconomic status differ- 
ences (income, education, occupation, race, and 
ethnic background) were applied to voters’ pêr- 
ceptions. Differences between samples were mea- 
sured by chi-square. 


RESULTS 


The results are summarized in Table 1. The pri- 
mary difference found between politicians and 
voters was on the folkway pertaining to effective- 
ness of personal contact. Differences between 
Samples at each stage of the campaign differed 
significantly (os level) on this issue. In contrast 
to previous findings in the social sciences (e.g. 
Cutright & Rossi, 1958), limited to contacts by 
committeemen, voters here rejected the rule per- 
taining to the effectiveness of personal contact— — 
“press the flesh” campaigning. Table 1 shows that 
prior to the campaign, voters marginally sup- 
ported this rule (53%), but over time a majority 
of the voters increasingly changed their opinion 
(48% and 40%), while the politicians were in 
higher agreement initially and remained relatively 
constant in their support at all three stages. 
Table 1 also suggests that politicians increasingly 
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Snort Notes 


TABLE 1 
POLITICIANS’ AND VOTERS’ AGREEMENT WITH CAMPAIGN FOLKWAYS Ar THREE STAGES 


Folkway 


hands, supermarkets, walking tours, and so forth.* 
People vote their pocketbook. 


against your opponent or his party is highly effective, 
on the swing vote. 
_ You can't beat somebody with nobody. 


q People vote against not for. 
You have to peak at the right time. 


Voters 
(% who agree) 


Politicians 
% who agree) 


A candidate has got to get out and mect the people—shaking 
A scandal, exposé, in short a bombshell during the campaign 
The majority of voters usually have made up their minds be- 


fore the campaign even begins, so you have to concentrate 


Stick to one or two issues in campaigning instead of several. 


Campaign stage Campaign stage 
| 7 y 
| So Tae 3 i 2 3 
| | | 
80 70 80 53 48 40 
70 80 80 91 92 97 


60 80 60 58 42 68 
80 80 70 94 90 94 
70 80 | 100 | 93 | 79 | 79 
70 | 80 70 63 11 3mm 
70 80 | 70 66 71 60 


Ss between samples for each of the three stages. 


~ agreed that it's better to stick to one or two 
= issues.” Voters also tended to agree more at all 
Stages that “you can’t beat somebody with no- 
_ body,” and “people vote their pocketbook.” There 
y no differences on any folkway between 
Voters on the basis of party, income, or ethnic- 
racial variables. The fact that more differences 
between politicians and voters were not found 
indicates some general saliency to these folkways 
and to the politician’s insights on general cam- 
paign strategies. 
he one issue on which there was substantial 
ge in the opinions of the politicians was in 
need to stick to one or two issues; politicians 
ingly agreed with this folkway as the cam- 
proceeded. In general, however, the cam- 
gn resulted in little change in politicians’ 
“agreement with these folkways. 
These results are at least consistent with 
earlier findings (McClosky, Hoffman, & O'Hara, 
1960) that politicians are groups of like-minded 
men. Smith (1958) found that when politicians 
were questioned as to whom they most com- 
municated with during a campaign for infor- 
mation, they cited fellow politicians as their first 
source, with their secondary source being family 
and friends. This social reinforcement from col- 
leagues may form the primary feedback linkages 


for fellow politicians (Dexter, 1963), with result- 
ing constraints on changes due to politician- 
voter interactions during a campaign. % 
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Forty laboratory groups, each consisting of three members, were arranged to 
discuss structured decision issues. Participation in only one of three decision- 
making phases (generation, evaluation, or choice) constituted partial participa- 
tion, in contrast to complete participation. Group members were committed or 
not committed to carrying out the decisions reached, Results indicated that 
perceived intragroup influence and satisfaction were greatest with complete 
With partial participation, influence and satisfaction were 
greatest in the choice phase. Although perceived influence was greater under 
commitment conditions, satisfaction was greater under noncommitment condi- 
tions. Decision riskiness was affected by neither treatment. 


participation. 


earch and theory in social and organi- 

za nal psychology have suggested that par- 
tit ation in group decision making enhances 
m: .oers’ influence and satisfaction. Further, 
decision making often elicits riskier 
co ses of action than does individual deci- 
si making, although individuals who are 
m committed to their decisions tend to 
me © more cautious decisions. These general 
fo: nations provided a base for the specific 
hy; theses tested in this study. 
up decision making, which has been a 

advocated means of sharing power 
rt, 1967; Tannenbaum, 1968; Wood, 
, has recently been conceptualized and 
studied as a multiphased process of varying 
activities in which group members may par- 
ticipate. Based on theoretical descriptions of 
organizational decision making and empirical 
assessments of group problem solving, Wood 
(1972a) formulated a three-phased process 
involving the generation, evaluation, and 
choice of alternative resolutions to specific 
decision issues. Wood’s research showed that 

This study is based on a portion of the first 
author’s dissertation submitted as partial fulfillment 
of the requirements for the PhD to the Ohio State 
University (Cooper, 1972). The contributions of 
Milton D. Hakel, S. Rains Wallace, and Robert F 
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the influence of members and the total group 
(leader and members) iniluence pie were 
greater when members participated through- 
out the process than when they participated — 
in only two phases. Results suggested further 
that members’ influence was weakest when 
they were unable to participate in the choice 
phase. : 
In the present study, participation was ma- 

nipulated within the context of the three- 
phase process. However, partial participation 
conditions were designed to permit member 
participation in only one phase, thus reflect- — 
ing commonly cited organizational participa- 
tion programs, for example, suggestion boxes. 7 
The participation conditions were defined as 
follows: (a) generation, evaluation and choice 
complete participation with group members 
able to participate in each of the three 
phases; (b) generation—members participate 
only in generating alternatives (a theoretical 
external group, analogous to upper-level or- 
ganizational decision-making groups, was re- 
sponsible for the evaluation and final choice 
of alternatives partially based on the alterna- : 
tives generated by the experimental group) ; : 
(c) evaluation—members participate only in 
evaluating alternatives, with the external 
group being responsible for generating alter- 
natives and later making final choices par- 
tially based on the experimental group's — 
evaluations; and (d) choice members par- 5 
ticipate in choosing alternatives based on the hi: 


5 á 2 
external group's generation and evaluation. ; 
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The following hypotheses concerning the 
effects of these participation variations on 
members’ influence, satisfaction, and the riski- 
ness of decisions were tested. 


1. Perceived member influence and satis- 
faction are greatest with complete participa- 
tion. When members have participated in 
only one phase of decision making, their 
influence and satisfaction are greatest when 
that phase involves final choice of alterna- 
tives, 

2. The riskiness of group decisions is great- 
est when members have participated through- 
out the process. Decision riskiness does not 
vary across forms of single-phase participa- 
tion. Previous research suggests that in- 
creased group interaction permits an in- 
creased diffusion of responsibility leading to 
a shift toward more risky decisions (Pruitt, 

1971). Since the generation, evaluation, and 
choice condition implies a greater interaction 
potential, compared to single-phase partici- 

_ pation, riskiness is greatest in the generation, 

evaluation, and choice condition. 


Commitment to decisions has, for the most 
part, been investigated as a dependent varia- 
ble. The studies that have used commitment 
as an independent variable have suggested 

that people produce more cautious decisions 
n they anticipate personal commitment 
to their choices (Dion, Miller, & Magnan, 
1971; Mann, 1971). Similarly, seriousness of 
decision consequences has been found nega- 
lively related to shifts toward risk, particu- 
larly when there are negative consequences 

associated with risky choices (Clark & Wil- 

lems, 1969; Johnson & Andrews, 1971). The 
_ following hypotheses were tested concerning 

effects of decision commitment on influence 
. satisfaction, and risk taking. A 
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3. Perceived member influence and satis- 
faction are greater when group members are 
committed to carry out the actions dictated 
by their decisions than when they are not so 
committed. 


4. Group decision riskiness is greater when 
members are not committed to carry out the 
actions dictated by their decisions than when 
they are committed. 


Micnart R. Cooper ann MICHAEL T. Woop 


METHOD 
Overview 


Effects of participation and cor 
sion making were investigated ir 
groups. Forty groups, with three 
discussed structured decision issues 
of varying participation (generation, 
choice; generation; evaluation; or ch 
and commitment (committed or 
Questionnaires were developed and a 
measure pretask perceptions of task ir 
posttask perceptions of participation, 
influence, and satisfaction. The princi; 
were tested with 4X2 analyses o 
questionnaire responses, 


in deci- 
boratory 
rs each, 
nditiong 
tion, and 
ditions) 
mitted). 
tered to 
ance and 
mitment, 
potheses 
ance in 


Participants 


Group members were 120 general sychology 
students at Ohio State University who ; »rticipated 
for required research credit. There were -7 females 
and 53 males, Their mean age was 18 us; 94% 
were freshmen or sophomores. Of the roups, 32 
Were sexually heterogeneous, 5 were a) nale, and 
3 were all male. Seventy-eight percen the peo- 
ple did not know either of their comenn -s prior to 
the experiment. Participants were rando assigned 
to treatment conditions, except that partici- 
pating for two hours instead of one bur were 
assigned to complete participation. The: 
group replications per condition. 


were five 


Decision Task 


The task focused on determining att 
ments to be included in a campus survey of student 
attitudes. Group members were instructed to se- 
lect, evaluate, and/or make final choices of state- 
ments for inclusion in the survey. They were also 
told that they would conduct the survey themselves 
in face-to-face interviews with students on campus 
(commitment conditions) or that other students 
would conduct the survey (noncommitment). This 
task was intended to represent organizational deci- 
sion tasks and to be relevant to student partici- 
pants. 

Determination of attitude topics and items to 
Comprise the task was accomplished by having 260 
undergraduate students generate four topics they 
felt should be posed to students in an attitude sur- 
vey and also having them generate 6 items designed 
to tap attitudes within each topic. They were asked 
to develop two “risky” items, two low-risk items, 
and two intermediate-risk items per topic. This 
Procedure yielded a pool of topics and items that 
was reduced to six topics with 12 items each. These 
72 items were then presented in random order to 4 
sample of 15 industrial-organizational psychology 
graduate students who scaled the items on a risk 
dimension. They were asked to evaluate the poten- 
tial riskiness of each item by indicating the prob- 
ability, on an 11-point scale, of a college student 
interviewer not wanting to personally confront 


another student with the attitude question. 
This deñ . of risk is similar to that of Belovicz 
and Fin 71) who view risk as a function of 
the desi and probability of decision conso- 
quences graduate students similarly evaluated 
the risl of the six topics without reference to 
specific ‘ 

The task consisted of items measuring atti- 
tudes t d sex (topic risk value = 6,2)*, abor- 
tions racism and discrimination (4.8), drugs 
(4.7), military draft and Vietnam (2.1), and 
ecolog „. There were 12 items per topic, and 
the i varied along the risk continuum. For 
exam; igh-, intermediate-, and low-risk ques- 
tions the topic of sex included (a) “Have you 
ever sexual relations with a member of the 
same as your own?” (risk value=9.6); (b) 
“Do believe that it is important to marry a 
virgi (4.3); (c) “Should sex-education courses 
be o ed in the public schools?” (.7). 

Pre and Dependent Measures 

00 the task, members completed a back- 
ground form including measures of char- 
ac stics and task importance, that is, “Do you 
bc ve that a major survey of student attitudes is 
or can be important and should be conducted? (yes 


or no)?” 
fter the decision-making session, members com- 
pitied a feedback form, consisting of (a) treatment 
manipulation checks, (b) perceived influence, and 
(©) satisfaction. The manipulation checks 
ci the following items to which members responded 
on g, point scales 
to “not at all true”: (a) participation in the gen- 
cration condition—“I was able to participate in the 
preliminary selection of items”; (b) participation 
in the evaluation condition—“I was able to actively 
discuss my opinions, choices, etc., with the other 
members of my group”; (c) in the 
choice condition—“I was able to play a direct part 
in the final choice of questions to be actually asked 
of students on campus”; (d) ved commit- 
ment—*I realize that I will actually 8 
measure 
sum of (100) hypothetical influence points among 
the three members of his group 


4 Items were also scaled for quality on a S-point 


scale of comparative goodness—poorness as an atti- 
tude item. A curvilinear risk-quality relationship 


was initially expected with high-quality items being 


of intermediate risk. However, in this scaling sam- 
risk and quality were strongly, inversely, and 


ple, 
linearly related (r=—51, p< 01). Moreover, 
treatment effects were essentially opposite for deci- 


sion riskiness and quality. Thus, decision quality is 
not dealt with as an independent criterion in this 
report. 

5 Risk values 
the mean item- 
equivalent. 


based on topic evaluations and on 
risk values for a given topic were 


— 


EFFECTS or MEMBER PARTICIPA’ — 9 


decision-making groups. Satitfaction wat 
by 12 items with point scales (completely 
dissatisfied, with reversed ane 
for hall of the Items). Items comcermed the dic 
outcomes, one's own role in the group, the de 
making procedure, intermember 
the decision task. Dees risk was determined 
the basis of the mean scale valucs of attitede 
included in the groups” selections, evaluathons, 


Treatment Manipulations 


item domain were specified a 
a modified “structured generation of alterna 
decision phase or selection. 
$ The other group did not exist. The experimenter 

served as coordinator throughout group sessions cre- 
ating artificial feedback, which the other group sup- 
posedly produced and Which was transmitted do 
members by the research assistants. This group was 
created to function as a perceived source of varying 
amounts of influence, in the manner of upper level 
decision-making groups that remain external 10 
most organizational members yet usually exert great 
influence in organizational decision making. 


E 
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5 cluded in the final survey. This other group is 
made up of graduate students who are more 

. qualified and trained in evaluating attitude 

l items and making final choices. However, your 

~ input is important because the other group 

p will evaluate and choose from those topics and 
items which you select. 


Instructions were similarly 
evaluation and choice groups. In each case, the 

= groups spent 20 minutes on the task. During the 
time that the hypothetical other group was process- 
ing their information, the members individually per- 
formed a filler task involving the specification of 
campus locations for the survey interview. The 
generation groups received feedback on the other 
group’s choices of two topics with three items each. 
J The evaluation groups were provided with Three 
Topics X Six Questions Presumably generated by the 
other group and (later) the other group's choices 
on their evaluations, They evaluated the items 

on 4-point scales of relevance, importance, and 
informativeness.? These groups began the evaluation 
immediately and performed the filler task during the 
other group's choice Phase. The choice groups spent 
first portion of their time on the filler task then 
“Made final choices (2 x 3) from a base of Six Top- 
ies & 12 Items generated and evaluated by the 
other group. The generation, evaluation, and choice 
groups spent 60 minutes conducting all three phases 
of the task. The research assistant was absent from 
the room during group deliberations. After 20 min- 
utes, he returned to collect decision results to con- 
vey to the other group (generation and evaluation), 
or to convey the other groups inputs to the ex- 
perimental group (choice), or to provide instruc- 
_ tions and forms for evaluation and choice to the 
‘generation, evaluation, and choice groups at the 
appropriate intervals, 


developed for the 


an 
slightly lower risk item. Th TN eee 
— depended, in fact, only Partially on 
generation. For the evaluation groups, 
and lowest risk topics were chosen, 


the group’s 
the highest 
and, for each 


With limited time, 
pected to evaluate all 
given the 18-item subset in topic-j 


they were 
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topic, the group’s most favorabi 


ted ii 
was included along with two ot that 
group had evaluated as slightly c on 
three favorability rating scales. 

Decision Risk Measure 

Risk scores were directly derived e ite 
risk values. For the generation » so 
were mean risk scale values for attit 
items generated by the group, For oice 
the generation, evaluation, and cho ups, the 
means were based on the 6 items che iring the 
final choice phase. For the evaluat nditions 
the risk means were based only on ti ems that 
were evaluated favorably, that is, item received 
a total positive rating across all thre orability 
scales. Unfavorable evaluations w ynsidered 


analogous to those items in the s tion and 
choice conditions that were, respectiv: 


not gen- 
erated or chosen. 

RESULTS 

Manipulation Effectiveness 
Both the participation and nitment 
manipulations were effective. P: Ucipation 
was perceived to have been greg, in the 
partial condition allowing participa:ion in the 
phase intended by the treatment and in com- 
plete participation than in the partial condi- 


tions allowing participation in eithcr of the 
other two phases, In other words, mean per- 
ceived participation in the choice phase was 
significantly greater for the choice (7.86) and 
generation, evaluation, and choice (8.56) 
groups than for the generation (3.96) or 
evaluation (4.07) groups (F = 36.71, df= 
3/112, p< 001) and similarly for mean 
Perceived participation in the generation 
phase (F = 11.27, p < .001) and for mean 
participation in the evaluation phase (F = 
12.83, p < 001). Perceived commitment was 
greater in commitment conditions (M = 7.75) 
than in noncommitment conditions (M= 
2-17, E = 210.50, df = 1/112, p < .001). No } 
Participation x Commitment interactions sig- 


nificantly affected perceived participation or 
commitment. 


Decision-M. aking Influence 


Results pertinent to the effects of partici- 
pation and commitment on influence are pre- 
sented in Table 1. Perceived influence of one- 
self was greatest with complete participation 
(F = 31.65, df = 3/112, p < 001, n? = 44). 


EFFECTS or MEMBER PARTICIPATION 


TABLE ! 


MEAN INFLUENCE, SATISFACTION, AXD Decision Risk ix Partyman 
K COMMITMENT EXPERIMENTAL CONDOMS 


* * 


AND COMMITMENT 


i Commit- Dependent measure 
uci- ment con- a ae om a= 
dition Others’ in External Total satis- 
ma) fluence influence l lac then 
Cc 2 37 =r . = 
tio a | 70 an 
p NC | S89 50.67 
* 0 | 4847 0 
NG 3650 7867 
€ 852 3.33 PAT 
NC 29.00 92.13 
1 c 87.07 
iplete NC = sass 


© = commitment; NC = noncommitment. 


F- ner, when members participated in only 
or phase, influence was greatest when that 
pi se involved final choices. Orthogonal com- 
p isons among means showed significant 
diierences for complete versus partial par- 
ticipation (p < .001) and for the choice phase 
versus the generation and the evaluation 
phases (p < 001). Influence did not differ 
between the generation and evaluation condi- 
tions, Influence was also significantly greater 
in the commitment conditions (F = 6.17, dj 
= 1/112, p<.05, * = 03). The overall 
commitment effect was attributable to the 
large difference in choice, which was the only 
significant difference within participation 
conditions (p < .05). 

The influence attributed by the individual 
to the other two group members and to ex- 
ternal groups corroborated this pattern of 
results. The influence of one’s fellow group 
members was significantly affected by par- 
ticipation (F = 32.43, dj = 3/112, p < 001, 
a? = 43) and by commitment (F = 9.79, 
dj= 1/112, p < 01, F = 04). As was the 
case for self-influence, others’ influence was 
greater in complete than in partial partici- 
pation conditions ($ < 001), and their in- 
fluence was also greater in the choice condi- 
tions than in the generation or evaluation 
conditions (p < .001). Commitment produced 
greater influence than noncommitment, espe- 


cially in choice conditions. The influence o 
groups was lowest in complete par- 
$4.53, df = 3/112, p < O01, 
ae ( 
14.05, df=1/112, p<.00l, „ = 0 
Within partial participation conditions, e 
ternal influence was lowest in the choice 


perceived 
self-influence and the sum of Member B and 
Member C’s attribution of influence to Mem- 
ber A was .73 (P <.01). Second, each indi- 
vidual's description of his own influence was . 
positively related to his assessment of the 
influence exercised by both of the other mem- 
bers (r = .68, p < 01). 


Satisfaction (sum of item ratings) was 
significantly affected by participation (F= 
4.99, dj = 3/112, p < 01, „ 11) and by 
commitment (R = 5.92, df =1/112; $ <S 
05, „ = 04). The directions of differences 
caused by participation were the same as 
those obtained for the influence variables 
(Table 1). Satisfaction was greater in com- 
plete than partial participation ($ < .02) and 
greater in the choice conditions than in the 


Satisjaction y 
1 
i 


generation or evaluation conditions (p < .01). 
Commitment, on the other hand, affected sat- 
isfaction and influence differently.“ Satisfac- 
tion was significantly greater with noncom- 
mitment to the decision than under commit- 
ment conditions. There was no interaction of 
this effect with participation treatments. The 
major commitment difference (p < .05) oc- 
curred in the choice conditions, where mem- 
bers were much more satisfied if they were 
not committed to implementing those choices, 
Satisfaction in the choice-commitment condi- 
tion was more similar to satisfaction in other 
partial participation conditions than to com- 
plete participation. In contrast, satisfaction 
in choice-noncommitment was more similar 
to satisfaction with complete participation, 
regardless of commitment, than to other forms 
of partial participation. A similar, but non- 
Significant, commitment difference occurred 
within generation conditions. 


Decision Risk 
There were no significant effects of either 
participation or commitment on decision risk. 
The means in Table 1 indicate that risk was 
t quite low across all conditions (overall M = 
2001, SD = 9s, from a possible range of 
1 -5-9.6). Interpretation of these results, which 
failed to support the hypotheses, requires 
bo additional bits of information. First, the 
effects appear to depend on perceptions of the 
importance of the decision task prior to 
group decision making; that is, pretask at- 
tributions of high importance to the task may 
represent a high degree of personal commit- 
ment to the conduct and outcomes of that 
task prior to experimental manipulations of 
commitment to implement the decision. This 
precommitment then tends to enhance the 
degree to which members strive for high- 
quality decisions. This situation occurred, as 
participants’ pretask ratings of task impor- 
tance were uniformly high (M=1.97, Sp = 


‘Perceived self-influen 
generally independent. 


tional moderators of 
Ships. 
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-18, where a mean of 2.00 indicates tha 
participants rated the task as impo 

Moreover, a posttask questionnaire it 
showed that on a 9-point scale of the deg 


quality criterion M = 7.02, SD = 1.70. § 
ond, if the groups were precommitted 
high-quality decisions on the present task 
emphasized the quality criterion, the risk 
their decisions is necessarily low, since qu 
ity and risk were inversely related, 


Discussion 


Group members’ participation in di 
phases of the decision-making process, 
their commitment to the decision, aff 
perceptions of influence and satisfa 
However, within the constraints of the hig 
perceived importance of the task, these m 
nipulations did not affect the level of risk 
group decision outcomes. 

Risk and quality criteria should be 
amined with varying tasks and research de 
signs. For example, with variance in pretas 
commitment, one could investigate changi 
levels of commitment as well as decision ri 
and quality that result irom group deci 
making. Further work is needed to 
findings on task-specific commitment to 
more general hypothesis that participa 
enhances commitment to organizational g val 
(Patchen, 1970). Subsequent studies 4 
group decision risk and quality might 
compare the effects of individual inputs t 
decision phases with those of a group proc 
since the interaction process may diffe 
tially affect outcomes of different p s 
(Vroom, Grant, & Cotton, 1969). Grou 
decision making may also enable disclaim 
of individual responsibility or accountabilit 
for decisions (Quey, 1971). However, 
may be different effects of individuals 
groups deciding issues that affect themsel 
other members of their task group, or 0 
people who have no role in decision m 
but for whom the decision makers bear 
social responsibility. É 

The hypothesized effects of participation 
decision phases on members’ influence : 
satisfaction were clearly supported. The 1n- 
fluence and satisfaction of group me 


Erricrs or Mesmer —— 


were 
thro: 
the c 
influe 
resu! 
dica 
seve 
197 
by 
cho 


est when members participated 
the decision process. Moreover, 
phase was found to yield greater 
han generation or evaluation. These 
roborate earlier evidence that ab- 
participation in the choice phase 
mits subordinates’ influence (Wood, 
Particularly, if the ideas generated 
up are rejected by others making the 
participation may generate hostility 
nation (Strauss, 1963). Participation 
ut the decision process, and particu- 
ı the final choice of alternatives, ap- 
‘itical to the sharing of power through 
decision making. When members had 
pa: ipated in the complete process, the 
tot group influence pie (Tannenbaum, 1965) 
w- largest. Moreover, members’ assessments 
oi eir own influence were positively related 
to iheir assessments of the influence of the 
© cr members of their group. Thus, the en- 
|. ved pie was shared among all group mem- 
_ rather than being attributable to the 
ater influence of one person at the expense 
others. 
It must be noted that the present groups 
were Jeaderless. Previous research has shown 
iat when groups begin the decision process 
with a legitimately defined leader, complete 
participation enhances the influence pie by 
increasing members’ influence without de- 
tracting from (but not necessarily increasing) 
the leader’s influence (Wood, 1972a). Further 
research on intact organizational groups over 
time, where leaders and subordinates are 
jointly affected by decision feedback, is 
needed to clarify the mechanisms of influ- 
ence sharing and its effects on organizational 
behavior. 
Participative t has been gen- 
erally shown to relate to the sharing of in- 
fluence across organizational levels and func- 
tions and to favorable organizational out- 
comes (Likert, 1967). The present research 
indicates that these relationships (and, more 
importantly, the effects of participation) de- 
pend on the specific manner in which the 
participative management approach is applied 
to the process of making organizational deci- 
sions. 
Group members’ commitment to the deci- 
sion task also affected influence and satisfac- 


ä 


ax ComerrMrnt 


tion, although this treatment accounted f 
much less variance in the dependent 
than did participation. Influence effects 
in the predicted direction where : 
generated greater influence. However, 
commitment led to greater satisfaction, capex 
cially when members participated only 
choosing alternatives generated and . 
ated by others. A post hoc Interpretation of 
this finding may be based on the assump 
of less pressure toward compliance in 
evaluative or analytic phase of decision - 
ing. In contrast to generation or cholce, U 

open group discussion and evaluation of 
ternatives, which the members have not p 
duced and do not have to adopt, may 
erate as much satisfaction when members are 
committed to implementing the decision as 
when they are not (Doyle, 1971). Future 
research should study additional situatio 
parameters that may affect, or interact 

commitment to affect, outcomes of g 
decision making. 
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‘RAINSTORMING PROCEDURE, GROUP SIZE, AND SEX 
AS DETERMINANTS OF THE PROBLEM-SOLVING 
EFFECTIVENESS OF GROUPS AND INDIVIDUALS 


THOMAS J. BOUCHARD, Je? JEAN BARSALOUX, ave GAIL DRAUDEN 
University of Minnesota 


A 2X2X2X2 analysis of variance was uwd to ames: (q) a mediied 
brainstorming procedure in which a physical environmental inf@urece war 
concordant with the task versus standard brainstorming, (6) sx of group 
(males versus females), (c) size of group (four persons versas seven periost), 


n a previous study Bouchard (1972a) 
© otrasted two group problem-solving pro- 
ures—standard brainstorming and a modi- 
ition of brainstorming which forced greater 
volvement of the participants. The modified 
ocedure made use of a tactic called “per- 
nal analogy” (cf. Gordon, 1961) which re- 
uired the participants to play the role of an 
bject or concept central to the problem being 
vorked on. This was called the synectics con- 
dition, The synectics treatment was shown to 
be more effective than was standard brain- 
storming on a variety of divergent thinking 
casks. The procedures used in that study, 
however, confounded active involvement with 
the task and object role playing or personal 
analogy. Further, for one of the tasks the 
experimenters also varied an important en- 
vironmental parameter in the synectics condi- 
tion. The task was the “blindness problem.” 
The problem required the subjects to come 
up with a list of consequences that would 
occur if suddenly everyone went blind. Dur- 
1 This investigation was entirely by 
Public Health Service Grant 1 Roli HD 5600-01 to 
the first author. The report was written while the 
first author was at the Oregon Research Institute 
supported by General Research Support Grant RR- 
05612 from the National Institutes of Health and 
Grant MH-12972 from the National Institute of 
Mental Health, United States Public Health Service. 
2 Requests for reprints should be sent to Thomas 
J. Bouchard, Jr., Elliott Hall, Department of Psy- 
chology, University of Minnesota, Minneapolis, 
Minnesota 55455. 


w ost- 
performed four-person, nominal groups, but there was so ise effect for real 
groups, Nominal groups outperformed real groupe. 


lights off during work 
lem constitutes what we call a Problem 
Environment. concordance. 

This experiment is an attempt to further 
investigate the factors discussed above and to 


group (real, nominal, individuals), and size 
of group. This study is restricted to testing — 
the hypothesis that Problem x Environment 
concordance alone is a sulficiently powerful “ 
treatment to generate an increase in perform- 
ance over normal problem-solving procedures. 
The treatment does not require either active 
involvement or role playing. The hypothesis: 
has a plausible basis in that there are nus 
merous anecdotal reports that creative dis- 
coveries have occurred when the problem 
solver put or found himself in the context of 
the problem. The classic example is that of 
Archimedes who is claimed to have discovered 
how to measure the volume of a gold crown — 
(by measuring its displacement of water) 
while sitting in a bathtub. Koestler (1954). 
and Krippner and Hughes (1970) cite addi- 
tional examples. 

The experiment is a four-way analysis of 
variance designed to assess the treatment 
effects on two different-sized groups (four- 
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TABLE 1 
MEANS AND STANDARD Deviations ror SIMPLE EFFECTS AND VARIOUS 


Main Evrects ror ToTaL NUMBER OF IDEAS 


Group size four 


| Group size seve 
Nominal | Real | Nominal 
Measure — — — — | C — 
Light Dark Light | Dark Light | Dark Ligt Dark 
— Ee — „ — 
| Fe- | F | Fe- Fe- F 
Mate | ES. | Mate | ES | Mate | Lie Se Ee | Mate] EC . male | Male | € te | Fe 
f 1 
i Simple effects 
4 Ng aTi 1 7 7 1 — á | ~ = a Gert 1 pE. 
} * 2845 | 357.0 |2490 | 217.0 | 140.0! 10% 144.0 113.0 | 415.0] 454.0 81 2640 1830 | 149 os | 1440 
' SD | 6S) 180 60) soj 40| 8 10.0) 22.0 68.0 toy SF] 8.0 | 160| 15 85| so 
ii | | 1 
x Light vs. dark 
= 252 2 . J 2 
Ex | 6 noa |" 2330 | 124.5 128.5 434.5 | 7 181.3 146.8 
4 SD 29. | 17.5 22.1 23.0 20.9 127.3 | 15.8 20.6 
— ö = 
Nominal vs, real 
— 2 = 2a E 
J 1 271.6 126.5 4111 4 
Sb 45.5 | 22.7 | 94.17 
and seven-person groups), groups of both followed by the forced-participation procedure 
Sexes, and real versus nominal groups. Treat- described below. The brainstorming proc dure pro- 
ment effects on individual problem solvers scribed criticism and emphasized free-wheciing, quan- 
f also assessed tity, and the combination and improvement of ideas. 
J : The forced-participation procedure required ideas to 
Meran be contributed by participants in sequence. Each 
Tee subject had to say “pass” if he had not to say. 
ect Subjects were also encouraged not to get bogged 


he subjects were male and female students from 
psychology course. They were re- 


_ Stuited to experiment by offering them points 
or expe Participation (required of all intro- 
ductory students). 
Design 
The experimental design, a 2 N 2 N 2 N 2 analysis 


Procedure 


One experimental unit 
each 


The first part 
cal to those used by 
r groups. These were 


are available directly from 


3 Complete instructions 
the author. 


down in long discussions about trivial points. 

When the instructions were over the experimenter 
returned and answered any questions. Subjects were 
then told “Here is the problem you will be working 
on.” They were then handed the problem. If the 
group was under the lights-on condition, the prob- 
lem read as follows: “What would be the conse- 
quences if suddenly everybody went blind? You 
will have thirty-five (35) minutes to work on the 
problem. The experimenter will tell you when to 
stop.” 

If the group was under the lights-off condition, 
the problem read as follows: 


What would be the consequences if suddenly 
everyone went blind? You will begin working 
on this problem in dim light. During the course 
of the task the experimenter will decrease the 
illumination until it is pitch black. If for Some 
reason you must have the light back on, just 
Say so and the experimenter will put it on. This 
will disrupt the experiment so please don’t ask 
unless it is necessary. You will have thirty-five 
(35) minutes to work on the problem. The 
experimenter will tell you when to stop. 


ae ee NT 


s were given time to read the problem 

d: “You will have thirty-five minutes 

it. I will tell you when the time i 
about 


HH 


ness groups 
th a 60-watt 
uring this time 
sible to see all 
expressions. 
imp was extremely dim and served 
lability to the situation. It 
wiy with a rheostat. 
ts could not see each other even a 
a. After the second five minutes, the 
ied off completely and the room 
itch black. 
| vidual brainstorming conditions. 
the instructions were identical to those used 
by uchard and Hare (1970) for individuals. 
re der of the instructions were essentially 


z HR 


u ifor the various group 


ring 
de dependent variable analyzed was the number 
different ideas generated. Previous work (Bou- 
ard, 1969, 1972b) has shown that the total num- 
of ideas is the most sensitive indicator of 
atment effects and correlates highly with quality 
ores. For the real groups, all 
ithin a group 
apes. The remaining 
fluency score for the group. 
a fluency score was created by counting 
overlapping ideas of four who 
alone in the appropriate Interjudge reli- 
ability of the deletion procedure is generally very 
high and only one judge was used (cf. Bouchard & 
Hare, 1970). 


REsULTS 


The means and standard deviations for sim- 
in Table 1. The analysis 


of variance is given in Table 2. All of the 


any interpretation of main effects. 


“Treatment (Light versus Dark) and Type of 
Group (Nominal versus Real) 

The significant Type X Treatment and 
Treatment X Sex interactions can be easily 


interpreted when the Significant Type * 
Treatment X Sex interaction is plotted.* The 


The reader can easily plot all interaction from 
the data presented in Table 1. 


PROBLEM-SOLVING — ee 
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TABLE? 
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logas Paoorcen U n Kuw 


AE SE 

Source l 45 us 
Sis (A) 1 SAMD 
Type (B) 1 AM Oh 
Treatment (O) 1 7701 
Sex (D) 1 aims 
AXB 1 % 
AXC i N20 
AXD | e f ama | 
BXC 1 7.601 
RxD | 1 10820 
CXD i ass 
AXBXC 111 
AXBXD 1 tops | 
AXCXD ET se 
BXCXD I teI uns 
AXBXCXD 1 40% | 
rey l 10 sess | 
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triple interaction (as well as the 
interaction) occurred because the real g 
were not responsive to any experimental 
nipulations, while the nominal 
The nominal groups were, however 
perior to the real group (341.4 
Focusing on the nominal group 
the Sex X Treatment interaction 
significant (F = 61.06, dj = 1/8, p< Ol) 
Women better than men 
the lights-on condition, while men performed 


7 


sligh 
condition to the lights-off condition for thë 
male nominal groups occurred only for } 
group with seven subjects and did not ear 
to be a meaningful treatment effect. The 
lights-off treatment depressed the perſormanc 
of the nominal groups relative to the lights- 
on treatment (310.4 versus 372.4; F = 27.14 
dj = 1/8, p < .01). Over all conditions, th 
lights-on treatment was superior to the lights. 
off treatment (255.13 versus 224.0; F= 
13.24, df = 1/16, p < 01). ; 


Group Size â 


The signiñcant Size X Type and Size X 
Sex interactions can be easily interpreted 


=a a ae Se 
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when the Size x Type * Sex interaction is 
plotted (see Footnote 4). The nominal groups 
were superior to the real groups, and nominal 
group performance increased with group size, 
especially for males, while real group per- 
formance were not. The Size X Sex interac- 
tion were highly significant when the nominal 
groups were analyzed alone (F = 23.34, dj = 
1/8, p < .01). 


Individual Performance 


In order to construct the nominal groups, 
_ 44 males and 44 females were run individu- 
ally, half of each sex with the lights on and 
the other half with the lights off. There were 
no treatment effects, sex differences, or inter- 
action effects, The male and female means 
under the light conditions were 97.00 and 
_ 100.05. Under the dark conditions, they were 
98.48 and 112.59, 


Discusston 


The dependent variable was shown to be 
"responsive to a variety of factors (sex, size 
“of group, type of group); thus, the signifi- 

“cant decrease in the performance of the nomi- 
groups under the lights-off treatment and 
“the failure to detect any treatment effect 
“under the individual conditions suggest that 
the Problem x Environment concordance hy- 
"pothesis as formulated in this study is false.® 
Creating a passive congruence between the 
task and the environment does not increase 
productivity. 
N 
z We have also failed to confirm the Problem 
; Environment concordance hypothesis in two previ- 


dus experiments. Technical rej rts i 
a Po! are available 
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T. J. Boucwarn, Jr., J. BARSALOUX, AND G. DRAUDEN 


The real group versus nominal group coj 
parison allowed a suggestion by Bouchai 
(1972b) to be tested, namely, that the force 
participation (used in this study) might li 
the performance of real groups to the 
of nominal groups. That suggestion 
clearly disconfirmed. The forced. particip 
tion effect found in the Bouchard (1972 
study must have been due to the specii 
nature of the groups used. They were all lo 
scorers on a personality factor called 
personal effectiveness. 

The Size X Type effects are a striking rep 
lication, using a new problem, new procedure 
and a longer working time, of results prev 
ously obtained by Bouchard and 
(1970). Increasing real group size adds 
little to performance, while increasing nomi 
nal group size increases performance dramati 
cally. 
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rior to the Hawthorne research (Roeth- 
erger & Dickson, 1939), job performance 
bably was the major dependent variable 
died by industrial researchers. During the 
man relations era following Hawthorne, the 
iew of work environments expanded to in- 
ude employee attitudes, especially job sat- 
faction. It was during this period that a 
ausal relationship between satisfaction and 
erformance was assumed, that is, high satis- 
‘action leading to high performance. 
Literature reviews in the middle 1950's 
(Brayfield & Crockett, 1955; Herzberg, 
Mausner, Peterson, & Capwell, 1957) cast 
doubt on both the form, strength and causal 
direction of the satisfaction and performance 
relationship. Although Brayfield and Crockett 
were more pessimistic than Herzberg et al., 
both reviews illuminated the conflicting re- 
sults of empirical research, The net effect of 
these reviews, especially Brayfield and 
Crockett's, resulted in what Schwab and 
Cummings (1970) have called “the develop- 
ment of uncertainty” or “satisfaction—?— 
performance.” Uncertainty was not the last 
phase in this research, however. A more re- 
cent view is that performance leads to satis- 
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Job satisfaction and performance data were gathered from about 50 newly 
hired female telephone operators after one and three months work experience 
This was not an experimental study, but tentative causal inferences were drawn 
from these longitudinal data by using a combination of crow-lagged and 
dynamic correlations. The overall relationship between satisfaction and per- 
formance was slightly positive, but the direction of causality was unclear 
When job satisfaction was split into extrinsic and intrinsic components, the 
data suggest that performance causes intrinsic satisfaction and that extrinsic 
satisfaction causes performance. Both the type of job and the fact that these 
were new hires were offered as alternative explanations for these results 


faction, whereby performance results from the 
joint effects of effort, ability, and role per 
ceptions (Porter & Lawler, 1968). 4 


Recent Literature on the Satisfaction and 
Performance Relationship S 
In the early 1970's alone both theoretical < 
statements (Locke, 1970; Sutermeister, 
1971) and empirical research (Cherrington, 
Reitz, & Scott, 1971; Siegel & Bowen, 1971; 
Slocum, 1970, 1971) have been published, 
attesting to continued interest in this topic. 
Locke (1970) has suggested that satis- — 
faction is primarily a result of performance 
and only indirectly a cause of performance, 


The strength of this relationship is said to 4 


on “, . the degree to which per- 
formance entails or leads to the attainment 


Of the individual's important job valeii 


(without neglecting his other important 


values) [1970, p. 485].” Locke found 


3 


that emotions (such as satisfaction) do not 


usually determine behavior, because job be- | 
havior results from an individuals own = 


task goals. 
Sutermeister (1971) has stated a cyclical” 


model of the satisfaction and performance 
relationship. Life style (“climber, conserver, E 


or slider”) is viewed as an important determi- — 


nant of level of aspiration, which, in tum, 
affects individual effort in the work situa- 
tion. The cyclical model states that satisfac- 
tion and performance are causes of each 
other, however the closer (and presumably 
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Stronger) linkage runs from performance to 
Satisfaction. Thus, the theoretical positions 
of Porter and Lawler, Sutermeister, and 
Locke are all in agreement, that is, per- 
formance is a much stronger cause of satis- 
faetion than the reverse. This view seems to 
prevail today, 
Several recent empirical studies have also 
been published. In two articles, using the 
_ same sample of about 200 lower and middle 
level supervisors, Slocum (1970, 1971) ana- 
_ lyzed this relationship using concurrent (not 
predictive) correlations. Slocum’s first study 
= focused on satisfaction/performance correla- 
tions for each different need level. This was 
] a test of the view that performance and 
Satisfaction were more closely related for 
higher than lower need levels. A significantly 
. higher correlation was found for self- 
a ctualization needs than for either security 
P or esteem needs, similar to the findings of 
an earlier study by Lawler and Porter 
(4967). In the second article Slocum’s study 
focused on satisfaction/performance correla- 
tions for different managerial levels. Per- 
formance and satisfaction were found to be 
more highly correlated for upper as opposed 
to lower level managers. 

Siegel and Bowen (1971) gathered longi- 
tudinal data, permitting an analysis of both 
~ concurrent and predictive relationships. Their 
sample was 86 M. B. A. students from a 
classroom situation where performance and 
_ Satisfaction measures were based on course 


4 work. They found strongest support for 
performance as the cause of satisfaction. 
= 


Cherrington, Reitz, and Scott (1971) con- 
ducted an experiment in which student sub- 
E jects were either randomly, appropriately, or 
inappropriately rewarded for performance. 
Depending on the situation an individual was 
in, the satisfaction/performance correlations 
were either zero (random rewards), positive 
- (contingent rewards appropriate for per- 
_ formance), or negative (contingent rewards 
inappropriate for performance). 


Le Present Research 


Although many studies have focused on 
4 this topic, the present effort appears to be 
different. First, multiple measures of both 


satisfaction and performance were gathered 
S 


Jonn P. 
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and combined in composite form. Twi ely 
accepted job satisfaction scales wer ed. 
Second, both cross-lagged and dynamic or- 
relations were computed. Third, job s ic- 
tion was split into extrinsic and in sic 


composites and each was separately cor ed 
with performance. 


METHOD 

This research was conducted in an caste le- 
phone company, using a sample of about 80 wly 
hired female operators who volunteered. Qi N- 
naires designed to measure a number of at les 
toward the job were administered after on and 
three months working experience; only t} ob 
satisfaction attitudes are reported in this arti 

Three measures of job satisfaction were ga d: 
(a) the Job Descriptive Index (JD, the 
Minnesota Satisfaction Questionnaire, Short m 
(MSQ), and (c) two items designed to : ire 
general satisfaction. One of these direct glob ns 
asked for satisfaction to be rated directly a 
5-point Likert-type scale. The other item ed 
subjects to rate on a 5-point scale how the c ĉ- 


teristic “being very satisfied” was for them at . 
The correlation between these two items was 
one month and .80 at three months. Altho ch 
Wanous and Lawler (1972) have shown that oh 
satisfaction has been measured in nine dific nt 
ways, only well known, direct measures were d 
in this study. 

The MSQ is used in two ways, to measure ovcr- 
all satisfaction and to measure intrinsic versus 
extrinsic satisfaction. Previous researchers (Weiss, 
Dawis, England, & Lofquist, 1967) intended that 
the sum of all 20 items reflect global job satisfaction 
and used factor analysis to derive extrinsic and 
intrinsic factors. The JDI is also used to measure 
overall job satisfaction, although this was not its 
intent as originally designed (Smith, Kendall, & 
Hulin, 1969). However, the sum of all five scales 
of the JDI has been used at least three times prior 
to this research (Ewen, 1967; Hulin, 1968; Mikes 
& Hulin, 1968), and in the present case correlated 
-71 at one month and .72 at three months with the 
MSQ total. Thus, it is felt that the JDI total does 
have meaning and can be used as a global measure 
of satisfaction. To measure extrinsic versus intrinsic 
satsisfaction, the JDI was split into its Work and 
Promotions scales versus Pay, Co-workers, and 
Supervision, consistent with Herzberg (1959). 

Corresponding to the multiple measurement of job 
satisfaction, several measures of job performance 
were gathered at one and three month intervals: 
supervisory ratings of employee quality and quantity 
of performance, as well as company indexes of 
quality and quantity of performance. 

Because multiple measures of both satisfaction and 
performance were obtained, it was decided to com- 
bine them into composites for simplified analysis. 
To form a composite of overall job satisfaction, the 


nd global items were each first treated 
cales. Only the totals of each scale were 
e of widely differing variances for cach 
scale, each was standardized before being 


A similar procedure was followed for 
supervisory ratings with company indexes 
composite of overall job performance. 

Fe ng analysis of the overall satisfaction and 
ce relationship, satisfaction was split into 
«c and intrinsic components. The perform- 


lobal items were dropped. Second, the JDI 
‘SQ were split into extrinsic and intrinsic 
vents, as described previously in the study. 
omponent was then standardized before com- 
the JDI and MSQ into intrinsic and extrinsic 
nents. 

‘ta analysis involved two types of correlational 

dures: (a) cross-lagged (eg, see Lawler, 

and (b) dynamic (Vroom, 1966), Although 
lity cannot be proven by the combination of 
œ two procedures, possible causal relationships 
y be suggested. The cross-lagged method is used 
on two variables are measured at two points in 
ic, so that both concurrent and predictive rela- 

ships between all pairs of variables are repre- 
nted by correlations. 
if performance is the cause of satisfaction, then 
ne expects to find a pattern of correlations some- 
ning like re>n=re> rs. One problem with this 
.pproach is that few theories of organizational be- 
havior specify time lags for causal effects. In the 
resent case a lag of two months was used, but this 
may or may not be appropriate for possible causal 
effects to be detected on the two variables studied 
in this research, 

Another weakness of cross-lagged correlational 
analysis is that it may show significant results when 
some third variable has influenced the observed rela- 
lationship between performance and satisfaction. In 
an attempt to deal with the weakness, Vroom 
(1966) proposed the use of dynamic correlations. A 
dynamic correlation relates changes in one variable 
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Sauiſac in — .7 5° 


Figure 2. Overall job satisfaction and p 
relationship. (% < 01). 


relationship in Figure 2 shows all positive 
correlations. However, only the test-retest 
correlations for job satisfaction and per- 
formance were significant. None of the 
concurrent or predictive satisfaction and 
performance correlations were significant. 7 

When job satisfaction was split into ex- 
trinsic and intrinsic composites? a slightly 
different pattern of correlations emerged. 


2The correlations between intrinsic satisfaction 


Performance Performance 
and extrinsic satisfaction were .18 (us) at one mon 
and .23 (ns) at three months. 
750 
71 77 TABLE 1 
1175 DYNAMIC CORRELATIONS 
Satisfaction 
Satisfaction Satisfaction 
T: 
: Performance Overall Intrinsic | Extrinsi 
Ficure 1. Cross-Lagged correlations for satisfac- 
30* 21 


tion and performance. (ra and 1: Test-retest reli- 24 
abilities; 1 and rs: Concurrent relationships; rs and 
ry: Predictive relationships.) 


* <05. 
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30° 
14 18 
» | 
Satisfaction 68˙ Satisfaction 


Ficure 3. Extrinsic satisfaction and job perform- 
ance. (*p< 0s; *#p < 01). 


Figure 3 shows the relationship between ex- 
trinsic satisfaction and job performance. 
Again, all correlations were positive, and sat- 
isfaction and performance were most strongly 
related to themselves, In contrast to Figure 2, 
however, a significant correlation appeared 
between extrinsic satisfaction at one month 
and job performance at three months. Al- 
though this particular relationship was sta- 
tistically significant, it was not significantly 
different from any of the other three correla- 
tions relating satisfaction and performance. 
When job satisfaction was intrinsic, as 
shown in Figure 4, the correlations were all 
Positive. Again, satisfaction and performance 
correlated most highly with themselves. The 
results in this study show that performance 
at one month was significantly related to 
Subsequent intrinsic job satisfaction. This re- 
sult was just the reverse of the findings when 
extrinsic satisfaction was related to perform- 
ance. Not only was the r= 37 significant, 
but it is significantly different from both con- 
current correlations? The difference between 
the r = 37 and the two cross-lagged correla- 
tions was only close to significance (p < 09). 


2 When testing for a Significant difference between 
-37 and the two concurrent rs, .10 and .15, a f test 
which accounts for the common array was used. The 
results showed that .37 is significantly different from 
10 (% Sol) and from 15 (p< 05). 


Performance Art Performance 
12 
10 AS 
sr | 
Satisfaction 


14*# ————— Satistaction 


Ficure 4. Intrinsic satisfaction and job performance. 
(Pp < 05; **p <.01), 
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Table 1 shows the dynamic co! itions 
corresponding to each of the Figures 2, 3, 
and 4. Although all three were positive, only 
the intrinsic satisfaction and job performanee 
correlation was significant. This res was 
parallel to the findings in Figure 4. | fact, 
finding a positive, nonsignificant correlation 
for overall satisfaction and performance also 
paralleled the results in Figure 2. The dy- 
namic correlation between extrinsic ıtisfac- 


tion and performance was not sig cant 
and did not lend support to the results in 
Figure 3, 


Discussion 


Theory and research concerning job satis- 
faction and performance date back a number 
of years in studies of organizational behavior, 
Although some have called for a moratorium 
on research of this relationship (Schwab & 
Cummings, 1970), it was felt that some im- 
provement might be made over previous re- 
search in this area. Using both theoretical 
and methodological contributions from recent 
years, a more comprehensive strategy for 
analysis was designed which employed com- 
posites of satisfaction and performance based 
on multiple measures, the distinction be- 
tween intrinsic and extrinsic satisfaction, and 
the combination of cross-lagged and dynamic 
correlations. 

The results are somewhat similar to previ- 
ous research, as well as suggestive of new 
directions for theory and research. Any con- 
clusions drawn from this research must, how- 
ever, remain somewhat tentative. This is be- 
cause of limitations on subject and job 
samples used in this study. These data were 
based on females only and were limited to 
those working as telephone operators. Gen- 
eralizations to other samples and other jobs 
must be made cautiously. On the other hand, 
it is possible that relationships reported in 
this study were attenuated by restrictions in 
range of both subjects and jobs, 

Caution must be exercised in trying to 
interpret these results because the particular 
nature of subjects, their jobs, and the orga- 
nization may account for the differences be- 
tween extrinsic and intrinsic satisfaction. It 
must be kept in mind that this sample was 
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all ne hired, had never been telephone 
operat sreviously, and was entirely fe- 
male. s possible to explain the relation- 
ship | en initial extrinsic satisfaction and 
subse. at performance in terms of a desire 


to good” to supervisors. This seems 
to b: casonable as these people were all 
the organization, Perhaps the hu- 


new 

man lations’ view (“a satisfied worker is 
ag worker“) is a correct description of 
new ployee motivation. 


I also possible to interpret the initial 
per mance and subsequent intrinsic satis- 
fac a relationship as a result of this par- 
tic ır situation, The telephone operators’ job 
is ot one with potential for satisfying 
“h` her order” needs or being intrinsically 
savsfying (Hackman & Lawler, 1971; 
W nous, 1973). However, it is often experi- 
e: ced as initially challenging. The monotony, 
l-k of variety and low autonomy are not 
typically experienced by newly hired oper- 
ors. Thus, it is possible that the significant 
relation between initial performance and 
ubsequent intrinsic satisfaction found in this 
study might not appear in a sample of more 
xperienced employees. 

The results and discussion indicate that 
there probably is no single “correct” rela- 
tionship between satisfaction and perform- 
ance, Sometimes there appears to be no rela- 
tionship at all, which is consistent with the 
view that situations can be created by reward 
systems that will support any causal model. 
However, the view taken in this research is 
that there are additional conditions besides 
reward systems when certain performance 
and satisfaction relationships are significant, 
for example, type of satisfaction, work ex- 
perience of individuals, and type of job situ- 
ation they are in. This research should be 
considered as a first step in the identification 
of such conditions and in the empirical 
mapping of possible causal relationships. 

Future research and thinking might con- 
sider further the occasions when performance 
seems to be the cause of satisfaction and 
vice versa, as suggested by the present study. 


4 The sample was, however, reasonably “typical” in 
its distribution of age, race, and educational back- 


ground. 
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The data in this research are in general agree- 
ment with a reciprocal causation view of job 
satisfaction and performance, as recently 
stated several times. To elaborate such a view 
of satisfaction and performance, more 
search is needed which specifies in detail 
effects of different job settings, sub 
samples, and measures of performance and 
satisfaction on the causal and feedback rela- 
tionships between satisfaction and performance, 
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-ecent studies of blue-collar-white-collar 
d erences in sources of job satisfaction and 
d satisfaction have used three basic sam- 
‘ing methods: (a) population or random 

mpling of different job levels within the 
ume plant (Armstrong, 1971; Gluskinos & 
-estelman, 1971) or of different job levels 
+ one location of a multiunit organization 
‘Friedlander, 1965); (b) stratified random 
-ampling of a cross section of employed per- 
Ins in one urban area (Centers & Bugental, 
1966) or in the United States as a whole 
(Gurin, Veroff, & Feld, 1960; Morse & Weiss, 
1972); and (c) “accidental” sampling of 
employees in different occupations living in a 
single urban area (Locke, 1973). 

Despite these different sampling procedures, 
the findings have been generally consistent 
with Herzberg, Mausner, Peterson, and Cap- 


1 This study was supported by Contract No. 68- 
03-0041 between the Environmental Protection 
Agency (EPA) and Applied Management Sciences 
(AMS), Silver Spring, Maryland. The computer 
time for the analyses in this paper was supported 
in full through the facilities of the University of 
Maryland Computer Science Center. 

The contents of this paper represent the views of 
the authors only and do not represent the official 
position of EPA or AMS; nor does the publication 
of this paper imply that the authors agree with the 
policies of EPA. 

The authors wish to thank David Snyder and 
Todd Tucker of AMS, and John Hill of EPA for 
their help in designing, conducting, and analyzing 
this study. 

2 Requests for reprints should be sent to Edwin 
A. Locke, College of Business and Management, Uni- 
versity of Maryland, College Park, Maryland 20742. 


SOURCES OF SATISFACTION AND DISSATISFACTION 
AMONG SOLID WASTE MANAGEMENT EMPLOYEES * 
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A national random sample of 911 employces in the solid waste management 
industry were interviewed concerning their job attitudes. While white-collar 
employees were more satisfied with their jobs than were blue-collar employees, 
both the latter and the secretarial employees were more satisfied than were 
the employees at the same job levels in other types of work. These differ- 
ences were attributed to pay and work differences between solid waste man- 
agement and other occupations, White-collar employees were more likely to 
derive satisfaction and dissatisfaction from intrinsic sources and less likely to 
derive them from extrinsic sources than were blue-collar workers. Herzberg 
motivator-hygiene theory of attitudes was not supported by the data. 


well’s (1957) conclusions based on an inte- 
gration of the earlier literature, White-collai 
workers are more likely to derive pleasur 
from intrinsic factors such as the work itself 
and work achievement, while blue-co 
employees more often get pleasure from ex- 
trinsic factors such as physical or social work 
ing conditions and pay or security. e 
Minimal job level differences were found, 
however, by Myers (1964) at Texas Instru- 
ments, Dallas, who used Herzberg, Mausner | 
and Snyderman's (1959) critical incident 
procedure and classification System. Herz- 
berg’s (1966) more recent work also de- 
emphasizes individual, and therefore job level, 
differences in sources of satisfaction and dis- 
satisfaction on the job. Schneider and Locke 
(1971) have criticized Herzberg's classifica- 
tion system on the grounds of logical incon- | 
sistency. Locke (1973), using the same criti- 
cal incident method but a new classification 
system developed by Schneider and Locke 
(1971), obtained results consistent with the 
findings of Herzberg et al., 1957 with one 
accidental sample of white-collar and blue- 
collar employees, but not with another sam- 
ple. Since there was some evidence to indi- 
cate that the latter failure to replicate was 
attributable to differences in occupational 
sampling, Locke recommended that future 
studies use employees from single occupa- 
tional groups. l 
One purpose of the present study was to 
attempt to replicate earlier findings on white- 
collar-blue-collar differences using a nation- 
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wide random sample of employees from a METHOD 
single industry—solid waste management The employee data were gathered a of a 
(e., collection and disposal of garbage and larger nationwide study of the solid was 
bother solid waste), To date, according to the ment industry. (A detailed description 

x present investigators’ knowledge, this is the study is found in Applied Managemer 


neces, 

2 2 ? 1972). Management representatives from pub- 

only study of sources of satisfaction and dis- lic and private collection agencies were i ewed 
satisfaction using a sample of this type. The concerning company policies and manpow: some 
Present study used the Herzberg critical inci- of these firms employees were also intery con- 


dent method, and the revised incident classi- cerning their attitudes and opinions. 
fication system (Locke, 1973), and included 


à Sampling Plan 
Other questions as well, 8 5 8 ; * 
rpose f e basic design involved a nationa ifi 
* er, 22 ri of e agi Hes to test random sample of agencies engaged in soli vaste 
ore TES motivation- Biene neory using management. The stratification variables we geo- 
the revised incident classification system graphical region (7); city size (4) based the 
which calls for separate classification by Standard Metropolitan Statistical Area syste. with 


events (what happened) and agents (who random selection of cities within cach tum 
made it happen) Previous results using more (though in the largest two Size categories th was 


` 100% sampling); and public versus priv vith 
samples have shown consistently random sampling in each stratum for < city 
vator events were more likely to selected, though in the largest cities close tc 00% 


both Satisfaction and dissatisfaction of the public agencies were sampled. 
re hygiene events, but that the self All ikea in anon chosen by the previous =. shod 
y R — 9 5 A were ed to allow a management represe ive 
3 pee likely to be Siven credit for satisfy- to submit to an interview concerning mana ent 
7 events and others blamed for dissatisfy- Policies. In addition, in approximately one ou. of 
events rather than vice versa (Locke, every three or four agencies, the interviewer «ked 
in Press; Schneider & Locke, 1971). Permission to conduct one employee interview 

Due to the limited sample sizes these 2 few cases two or three employees were inter- 

7 rii r atifvi 

previous studies did not report event agent viewed from the same agency.) The stratifving 


8 r : variables for the employee sample were geographi- 
interactions ona category-by-category basis. cal region; public versus Private; and job level (5). 
‘The third purpose of the Present study was Equal numbers of p 


to report such interactions for the first time, supposed to be interviewed within each region since 


„which agents are seen as primarily there are more in the p 


responsible for each? (b) With respect to levels were: (a) managerial, 
“agents, which events a 


the population. Within each company, the employee 
Waste employees to other Occupational groups was chosen at random from within the job category 


in terms of degree or amount of satisfaction, desired. (Sometimes the chosen empl 


in Sales & House, 1971). Therefore 


„it was The 46 questions asked of the employees fell into 
Possible to compare soli fo i 


d waste employees at ur main areas: (a) demographic information (age, 
ese groups. Since ihe le etc.); (b) present job information (job 
; F 9 title, tenure, hours of work, reason for taking present 
d 10 b 1 . . is almost universally job, etc.) ; (00 rewards (previous raises, present pay, 
£ = ered to e prototype of a “bad benefits, etc.); and (d) attitudes (amount of satis- 
job“ (eg., low status, physically tiring, dirty, faction, sources of satisfaction, etc.). Some informa- 
£ smelly, too hot, too cold, etc.), one might 590 (name, job level, etc.) was obtained in advance 
_ &pect employees in this field to be less satis- he Management. ; 
k 8 Th i i 
fied than employees in most other fiel dst fe ae becuse 1 items used for this report 
z 


ers 


or working in solid waste field was de- 
om the following open-ended item: “How 
ome to work in the solid waste manage- 
Wat made you decide to work in this 


t of satisfaction with each of 11 job as- 
s measured by means of a 7-point “faces” 
ith verbal anchors. The sum of the faces 
a combined overall satisfaction measure.“ 
zle, indirect measure of satisfaction was de- 
‘rom a yes or no response to the following 
used in previous studies; cf, Sales & House, 
“If you had your life to live over, would 
ke to wind up in the same line of work as 
e you're doing now?” 
[ rces of satisfaction were determined from four 
it two of the Herzberg type (one for satisfac- 
and one for dissatisfaction); 


>w would you try to think of a specific time 
hen you were satisfied [dissatisfied] with your 

esent job, a time when you felt especially 
vod [bad] about your job. Can you think of 
uch a time? Exactly what happened to make 
you feel good [bad]? What led up to this good 
[bad] feeling? Who or what was mainly re- 
sponsible for this event? 


ad two of the conventional type: 


What two things about this job give you the 
most pleasure [displeasure]? What two things 
make you happiest [most unhappy] about your 
job here? 


Che order of asking for satisfiers and dissatisfiers 
was alternated by each interviewer. 

Another open-ended item asked for the individu- 
als work preoccupations: 


Now I would like you to think back to this 
morning [or any typical morning] when you 
were getting ready to come to work or were on 
your way to work. I would like to know what 
you were thinking about as you got ready to 
leave or as you were on your way. Can you 
remember this? . . How did you feel? What 
kind of mood were you in? 


Coding categories for the open-ended items are 
described in the Results section.* 


Interviewers 


All questionnaires were administered by inter- 
view. The interviewing staff consisted of 15 mili- 


3 The 11 dimensions were: work itself, work vari- 
ety, amount of work, pay, chances for promotion, 
supervision, co-workers, subordinates, hours of work, 
equipment, and benefits. 

+All responses that were uncodable (according to 
the coding system used) were omitted from the 
analyses along with missing data caused by non- 
response to the question. Thus, the us vary from 
table to table. 


SATISFACTION AND DISSATISFACTION AMONG MANAGEMENT Eurtoxtes 


tary retirees selected from a pool of over 40 
the basis of previous experience and other p 
factors. All interviewees received from three to fiw 
full days of training, including: (a) an exp i 
of the study; (6) information concerning 
waste industry; (c) an explanation of the emp 
questionnaire including the purpose of each 
(d) a chance to listen to tapes of previous i 
views conducted during the pretest stage; and (el 
practice interviews with simulated and/or act 
employees followed by critiques from senior st 
members. In addition, each interviewee was pro- 
vided with a highly detailed interviewer's ma: 
covering all aspects of the questionnaire and th 
interview process, and was quizzed as to its cont 
before being allowed to begin interviewing. 
Employee interviews were conducted on-site 
in complete privacy with a guarantee of total cor 
fidentiality, Interviews were verified by follow-up 
post cards mailed to the interviewees. 4 


Editing and Coding 
Completed interview forms were mailed back t 
conducting 


either by calling the employee directly (at home 
calling the interviewer. Interviewers were kept 
stantly informed of observed errors, 

All open-ended items were coded 
by at least two coders. Disagreements, except in 
case of the Herzberg-type questions, 
by discussion among the coders, The the 
Herzberg-type questions was checked by a third — 
coder (the first author was the third coder for the 
first half of the sample), since these were the most 
difficult items to code.* Records of interrater agree- 
ment could not be kept (due to production pres- 
sures on the coders), but generally disagreements 
occurred only when the interviewer himself had 
been incomplete or unclear in the answers he clic- 
ited and/or wrote. Often such ambiguities were 
clarified by call-backs. 


RESULTS 


Demographic and Pay Data 


The second through fifth lines of Table 1 
show pertinent demographic and pay data 
for the five job levels. Excluding the secre- 
tarial/clerical employees for the moment, it 


may be raised here. In answer it should be pointed 
out that in a previous study (Locke, 1973, Study — 
2) this author agreed with a naive rater on 84% 

of the event and 88% of the agent categorizations. — 
Furthermore, this author was only one (and last) 
of three coders on half of the sample and did not 
look at the coding of the remaining half of the 
sample. None of the other coders were acquainted 


with the Herzberg theory. ; 
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TABLE 1 


DEMOGRAPHIC AND OVERALL SATISFACTION DATA gy Jos Lever 


Employee 
. x Secretarial Í 
Unskilled Skilled Supervisory 1 Managerial 
Demographic data (= = 526) | (ne = 303) (m = 146) 4 5 (m = 61) J 
* $D $ | sp $ SD * SD X So 
Tenure (in years) 449) 6.01 | 6.20] 6.90 10.40 7.70 4.77 | 5.24 8.30 8.19 | 4 0.18" 
Age (in years) 33.78 11.45 36.56 | 11.55 41.97 | 11.93 37.32 | 12.63 | 40.53 | 10:63 ja 4.024 
Education (in yearn) | 95i | 281) 1015 | 242 11.00 2.48 12.19 | 1,74 | 12.90 | 2.81 | 4/5 8.05% 
eekly gross pay 144.00 | $0.25 | 164.70 | 43.03 206.10 | 69.02 | 139.00 | 47.83 | 245.30 | 113.72 | 4/8 5.998 
Total satisfaction | | Í 
Cum of I faces | | | 
J sasó | 10.91 | 60.19 925 | 64.90 7.69 65.84 8.83 6445 8.90 | 4/85 21.50* 
Indirect satidaction | | f | | 
(% yea) | | | 
(Choose mme line | | | | | 
of work again?) | 40.4 | | ss4 | 69.0 736 | | 92.8 | 1 7.660" 
l U l 


Nole. n for some means are fess than the total figure gi 


o e me: iven due to missing data. 
* Chi-square; Yates correction for continuity applied. 
"$ < oo. 


when age, education, and tenure w par- 
older, have more education and tenure and tialled out. 


are more highly paid than blue collar work- The percentages in the bottom 


are 
ers. All F ratios are significant at the .001 for the item: “If you had your life live 
level. The pay and education differences re- over, would you like to wind up in the same 
quire no explanation. The age and tenure line of work as the one you're doing now?” 
differences can be attributed to the higher Sales and House ( 1971) reported that in 


5 previous studies between 16% and 21% of 
ees and the fact that garbage collection, being unskilled steelworkers and automobile work- 


cally demanding, is a younger man's ers answered yes to this question, whereas 
job. The secretarial /clerical employees more 46% of unskilled solid waste employees gave 
closely resemble the blue collar than the the same answer. Similarly, between 41% 

white collar employees with respect to ten- and 52% of skilled printers, steelworkers, and 
ure, and pay. automobile workers in previous studies gave 


Amount of Satisfaction a yes answer, while 55% of skilled solid 
5 The bo waste employees said they would choose this 
- e bottom two rows of Table 1 show the same line of work again. Thus, at the blue- 
K usual Positive association between job level 


collar level, garbage collectors appear to be 
more rather than less satisfied with their jobs 
x 9 8 than workers at the same job level in other 
Sociated with job level differences by their fields, : 
5 Sales and House reported no satisfaction 
education, or tenure could have been te. data for supervisors but they noted that a 
á 1 ren nationwide study of clerical employees yielded 
tween job levels. To test this aoe job 42% yes responses compared to 74% in the 
icho ue collar-white present sample. This difference in the favor 
ag oming the secretarial employees of the solid waste group is even more striking 
mio did not consistently follow the pattern of than that for the blue-collar employees. 
85 18 155 „ Job level and € Managerial employees in this study 
e variables in Table 1 were then intercor- ich i 
Related e ah esta: gave 62% yes responses which is somewhat 


The results lower than the 60% fi i ionwid 
‘Oss ; t gure in a nationwide 
showed a significant (p < 00¹) relationship Sample of managers reported by Sales and 
between job level and total Satisfaction even House. 


= 


Satisfiers | Dissatisiers Total 
Event — — — — us 
i 
n - Tee N A i * b 
4 | ——— — —ů 
r | ! f 
activity 28 | 36 32 | 24 » 54 
int of work M 43 107 182 lar | 9.8 
ithness 59 75 138 197 197 13.2 
ievement 8 105 | 33 47 [116 7.7 
notions %0 114 26 37 | 116 77 
sponsibility 33 42 10 14 as 29 
ognition 124 15.7 96 / 137 2 ns 
I ene i 
ney á 185 234 61 | 27 20 165 
terpersonal atmosphere 33 42 45 | 64 "| 83 
Vorking conditions 120 15.2 | 134 | 19.1 24 17.0 
Totals 789 702 | | 1aon 
“Au y faia are based on | degree of 3 aiie — 


"p<. 
* <.001. 


Herzberg Theory and Event-Agent 
Interactions 


The categories used for coding the Herz- 
berg-type questions are the same as those 
used by Locke (1973) as adapted from 
Schneider and Locke (1971). Table 2 shows 
the results for events for all subjects com- 
bined on a category-by-category basis. 

The overall chi-square was significant (p < 
001) but not of direct relevance to the 
Herzberg theory. More pertinent are the fre- 
quencies for the individual categories. Since 
the rows of the table are not independent, the 
chi-squares are meaningful mainly as descrip- 
tive (rather than inferential) statistics. 

Herzberg’s theory (or Theory 3 according 
to King’s (1970) classification of its five 
versions) predicts that motivators are more 
likely to act as satisfiers than as dissatisfiers, 
while the opposite holds true for hygienes. 
However, the data show that three of the 
seven motivator categories (task activity, 
amount of work, and smoothness of work) 
are more likely to act as dissatisfiers than as 
satisfiers. Three other motivators (achieve- 
ment, promotions, and responsibility) show a 
pattern in agreement with the Herzberg 
theory, while one category (recognition) 
shows no significant trend. With respect to 
hygienes, money is more likely to act as a 
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TABLE 2 
TOTAL FREQUENCY DistaimTion vor SaToorme axo eee Evers 


nificant trend. 8 

Thus, in 4 of the 10 comparisons, Herz- 
berg's theory was contradicted; in 3 cases it 
was not supported; and in 3 cases it was sup- 


The results for the combined motivator and 
hygiene categories are shown separately by 
job level in Table 3. The chi-square values 
for the 2 X 2 contingency tables are signifi- 
cant for the unskilled and skilled employees 
only. Furthermore, the relationships for these 
two subsamples are directly opposite to what 
the Herzberg theory predicts. In these groups, 
motivators are more likely to act as dissatis- 
fiers and hygienes as satisfiers rather than 
vice versa. Generally, motivators were more 
likely than hygienes to cause both satisfac- 
tion and dissatisfaction in the employee 
groups. = 

The results for the individual agent cate- 


gories are shown for the combined sample in 


Table 4. Again the chi-squares for each ro w- 


are presented separately for descriptive pur- 
poses. The self, the supervisor, and the or 
ganization are more likely to be seen as cau 

ing satisfier than dissatisfier events, while the 


opposite is true for co-workers, subordinates, 
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TABLE 3 


FREQUENCY OF MOTIVATORS AND HYGIENES AS 
SATISFIERS OR DISSATISFIERS BY Jos LeveL 


Dissat- | 


Sat- 
Employee | isfiers | isficrs * 
n % „ |% 

Unskilled 

Motivators 110 | 41 | 151 | 60 | 18,75%* 

Hygienes 161 | 59 | 101 | 40 
Skilled 

Motivators 143 | 53 | 159 65] 6.57* 

Hygienes 125 47 80 35 
Supervisory 

Motivators 100 76 78174] 05 

Hygienes 3124 2726 
Secretarial/clerical 

Motivators 51178 42|72| 32 
__ Hygienes 1422 16 |28 
Managerial, professional 
à and technical 
_ Motivators 47 87 32|76 1.24 
_ Hygienes 7 13 1024 
Es parers correction applied. 

* Si. 

customers, and nonhuman agents (e.g., the 


weather). 


policies are primarily dissatisfiers, the oppo- 
‘Site of the results obtained in this study. 

There was a definite Overall tendency for 

self to be given credit for satisfying 


The event-agent interactions are shown in 
Table 5 for those subjects for whom com- 
plete data were available. (Those indicating 


The satisfying and dissatisfying interactions 
re given in contiguous columns, with the 
otal for each pair. (The chi-square for the 
interactions are 
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highly significant, ps < .001.) W there 
are definite differences within ‘Ms in 
the frequencies of satisfiers ve: ssatis~ 
fiers (as expected, based on Ta 2 and 
4), the most frequently mention: sfying 
event attributed to every agent wi o the 
most frequently mentioned dissatis event 


attributed to that agent. For ex , the 


self was most often mentioned as cause 
of both achievement and failure. 

The supervisor was most freque: men- 
tioned as the source of both posi). e and 
negative recognition. The supervisor -ıs also 
mentioned often as a source of pre» otions, 
money (e.g., raises), working conditi <. and 
interpersonal relations, Co-workers w- © seen 
mainly as affecting interpersonal 1: tions, 
while subordinates were most often po eived 
to affect the smoothness of the work 

The company or organization was ‘ewed 
as causing the highest percentage both 
satisfying and dissatisfying events, wi em- 
phasis on money, working conditions mo- 
tions, amount of work, and recognition. Cus- 
tomers were seen mainly as agents of re. ogni- 
tion, and unions as a source of higher pay 
(money). Typically the nonhuman agent was 


nature or the weather that caused unpleasant 
working conditions and made the work go less 
smoothly (e.g., as when it snowed or rained ). 

Overall the organization and the super- 
visor were seen as being by far the most 
frequent causes of both Satisfying and dis- 
satisfying events (with the exception of non- 


TABLE 4 


TOTAL FREQUENCY DISTRIBUTION FOR SATISFIER 
AND DISSATISFIER AGENTS 


A Dissat- 
Satisfier isfier Total 
Agent * 
n n % n Mo 
Self 95 12.3 46] 65| 141 9,5 12.65%" 
Supervisor 212 | 27.5 | 122 172 334 |226 | 16.961 
o-workers 23) 30| 56| 7:9 . 
Subordinates 14| 1.8) 35 | 40 49| 3.3 9.89%% 
‘ganization |307 39.9 188 26.5 495 33.5 19.28 „ 
Customers 49| 6.4) 710.0 1208.1] 504“ 
Nonhuman 57| 7.4/176 24.8 233 18.8 70.00 
Union e l 28| 1.9 17 
Totals 770 709 1,479 150.30*** 


0 * All chi-squares are based on 
Vates's correction applied. 
*p <05. 


51 
* 001. 


1 degree of freedom. 


TABLE 5 
FREQUENCY OF EVENT/AGENT INTERACTIONS FOR SATISFYING AND DISSATISFYING INCIDENTS (ALL SUBJECTS) 


Event 


Total | S. | De! Torat | s+ | D»| Torat S- 


22 ͤ A O A OA) 21 3 EE — — S | D> 
Task activity 2 2 $ — — 
Amount of work 17 131 13 1131 4 13 1 28 4% 
Smoothness 7 1⁄2 | 4) th] 13 10 25 3 1 2| 9% 
Achievement 7 a 1 1 $i 2 7 7 ~ aln 
Promotion 7 1 1 — | 44 — 22 St 
Responsibility 2 5 r9 UE N ES ON — | aS 2 | a) 35 
Recognition 2 6) 4) w 1 1 2w = 33) 10 
Money 4 2 = — 1112 — 1122 | 95 
Interpersonal atmosphere | g Sa e e a hese lh ee aes. sh sas | 68 
i 
Nees 93 36 12 HO jasso] 1 % 4 25 * fias | 137 
io |737 | “se 
a Satisfying. 


Patel of Satisfying lus Dissatisfying, 
si 3 A 
4 Agent Sen duta omitted for subjects indicating event but no agent or agent but no event. 


mentioned agent for two others 
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TABLE 6 
SATISFIER AND DISSATISFIER EVENT FREQUENCIES FOR BLUE-CoLLAR AND Wurre-Co1 s 
Satisfier Dissati 
Event Blue-collar | White-collar® Bluc-collar | White-c: 
— — — ** ** 
n % | n % n % n 

Task activity EEE 25 3 | 70| 13| s 27 
Amount of work 31 S8 |a 11 5.56* 89 |179| 10 | 638 39%% 
Smoothness 30 56] 20 10.8 471 | #5 15.1 48 32 30.7 
Achievement | 33 | 61 |} 41 | 22.2 3.1516 3.2 12] 8 27° 

Promotions | 43 | 80| 45 | 24.3 28.90 | 4 15 

Responsibility 14 26 10 54 2.84 12 10 

Recognition } 8 | 158 | 21 | 11.4 1.85 71 | 143 17 | 11.6 H 

Money 148 | 27.5 | 29 | 15.7 7.32" |. 48 | 97 7 | 48 68 

Interpersonal atmosphere 211 4.72˙ 25 5.2 128.2 17 

Working conditions | 110 | 20.4 7 | 38 | 22.54 | 113 |227 | 18 | 122 3 
Totals | 539 | 185 111.24 | 497 | 147 son" 


* Secretarial and 
è All chi-squares 
*p s. 


derical employees omitted. 


human, which played a major role in causing 
dissatisfying events). The company, for ex- 
ample, was seen as the major causal agent in 
5 of the 10 event categories (task, amount of 
work, promotion, responsibility, and money), 
while the supervisor was the most frequently 
(recognition 
and interpersonal atmosphere). 


Blue-Collar-White-Collar Differences in 


Sources of Satisfaction and Dissatisfaction 


For the purpose of making blue-collar- 
white-collar comparisons, the secretarial / 
clerical employees were deleted. The main 
reason was that a substantial proportion of 
this group was female. Since employees in 
the remaining job Categories were all male, 
this would have led to a confounding of sex 
with job level when making comparisons. The 
blue-collar category, then, consisted of the 
unskilled and skilled employees while the 
white-collar group was composed of super- 
visory and managerial (professional, etc.) 
personnel. It should be noted that in this 
study the terms job level and white-collar— 
blue-collar are used interchangeably. They 
are not viewed as separate concepts but as 
alternative ways of referring to the 


same 
basic distinction. 


are based on I degree of freedom. Vates's correction applied, 


Table 6 shows the Herzberg eve: 
gory results by satisfier versus dis 


cate- 
isfier 


and blue- versus white-collar. The col:::nn of 
chi-squares were again calculated for de- 
scriptive purposes. Considering first the satis- 
fier events, blue-collar employees were more 
likely to cite amount of work, money, inter- 


personal atmosphere, and working conditions 
as satisfiers, while white-collar employees 
were more likely to derive satisfaction from 
smoothness of work, achievement, and promo- 
tions. The pattern for dissatisfaction was 
parallel, with blue-collar employees more 
often being dissatisfied by amount of work 
and working conditions, and white-collar em- 
ployees being more frequently dissatisfied by 
(the lack of) smoothness and (the lack of) 
achievement in the work. 

The results for agents shown in Table 7 
reveal that white-collar employees were more 
prone to see the self and subordinates as 
causing satisfaction, while blue-collar work- 
ers were more likely to see the organization 
as a source of satisfaction. White-collar em- 
ployees more frequently attributed dissatisfy- 
ing events to subordinates and the union, 
while blue-collar workers more often saw 


their supervisors and co-workers as agents of 
dissatisfaction. 


g3 J = * 
N 
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TABLE 7 


FISIER AND DISSATISTIER AGENT Fexquencurs rox BLog axo n H Eurtovers 


| Satisfier Dissatistier 
t | Blue-collar | White-collar* Blue-collar | White-collae | 
| I 
—— — * 5 
| n = i m % 1 * 1 % | 
42 | 80 | aeaa | ware | | ot | 9 | 032 
rvisor 136 | 26.1 5S | 88 0.20 100 | m2 * 10.7 
vorkers 15 29 3 16 om | 45 91 6 a8 
vrdinates 3 06 | 1 | 60 | 17.39% 7 14 ar | no 
nization | 231 | 443 50 | 321 46 | 129 | 261 11 288 
tomers 39 tarras 43 1.85 $2 10.5 9 1 $7 | 
nhuman 4 84 | 10 54 1.23 126 | 25.5 #0 | 252 
ion 12 23 1 0.5 L43 5 10 10 63 
Totals 522 184 | 40. J KRAJ 159 


Secretarial and clerical employees omitted 


Since each employee was instructed to 
nake two responses to each sources of plea- 
sure and displeasure item, the two responses 
were combined for purposes of analysis. (In 
some cases, the employee only made one re- 
sponse.) Since the employees were not asked 
specifically to cite agents or events, the cate- 
gorizing system was necessarily based on the 
answers they gave spontaneously. The coding 
categories turned out to be similar to the re- 
vised Herzberg categories used in the previ- 
ous analyses except that events and agents 
were not kept separate. 

The results (not shown) were that white- 
collar employees mentioned the work itself, 
achievement, smoothness, subordinates, and 
customers as sources of pleasure more fre- 
quently than did blue-collar employees. The 
latter more often derived pleasure from the 
amount of work, pay (benefits/security), co- 
workers, and working conditions. Blue-collar 
employees were also more likely to derive 
' displeasure from these same sources than 
were white-collar employees. The latter were 
more often displeased by subordinates and 
customers. 

The above pattern of results was further 
verified by the employees’ descriptions of 
(a) their thoughts or preoccupations while 
coming to work in the morning and (b) of 
their reasons for working in the solid waste 


‘All chi-squares are based on 1 degree of freedom. Vates’ cotrection applied. 


field. Blue-collar employees were more pre- 
occupied with being on time, the amount of 
work, and working conditions than were 
white-collar workers. They were also more 
likely to be thinking of quitting and of off- 
the-job activities, presumably due to heir 
greater dissatisfaction with their jobs as 
compared to white-collar employees (ce 
Table 1). Blue-collar employees more often 
took their jobs mainly because they needed 3 
a job. In contrast, the white-collar employ- 
ees were more preoccupied with the content a 
of the work to be done and with subordinates 9 
and were more likely to have taken their job 
because they like the type of work.“ 


Discussion 


The previously obtained differences be- 
tween white-collar and blue-collar employees 
with respect to sources of satisfaction was 
replicated for the first time with a nationwide — 
sample of workers in the same occupation. 
The intrinsic-extrinsic dichotomy affords ae 
reasonably accurate way of summarizing the 
nature of these differences. The white-collar 
employees were relatively more often affected 
by the work itself, achievement, smoothness d 


¥ 


o Tables showing the actual category frequencies 
and relevant descriptive statistics for the items 
described in the latter three paragraphs are available 
from the authors upon request. 
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(a kind of short-term achievement), by re- 
wards that are associated with changes in 
work content (promotions), and by the 
agents most often associated with the fore- 
going events (self in the case of achieve- 
ment, and subordinates in the case of smooth- 
ness). 

In contrast, blue-collar employees were 
more often satisfied or dissatisfied by extrinsic 
rewards such as money and benefits, by work 
context factors such as working conditions, 
and interpersonal relations, and by co-workers 
who were associated with the latter type of 
event. The one exception to this trend was 
that blue-collar workers were more affected 
than white-collar employees by the amount 
of work. While amount of work is not ex- 
trinsic to the work itself, it is separate from 
work content, work progress (smoothness), 
and work outcome (achievement). The blue- 
collar orientation in this sample appears to 
involve a view of work as a means to an end 
rather than an end in itself; the work itself 
is liked to the extent that there is not very 
much of it. (Blue-collar employees rarely 

complained about too little work.) Presum- 
4 ably avoidance of fatigue is a major goal of 
7 employees with physically demanding work. 
4 Chadwick-Jones (1969) found reduced physi- 
cal effort to be a major source of satisfaction 
for English workers who had been transferred 
_ from a preautomated to an automated steel 
plant. 

These white-collar—_blue-collar differences 
should not necessarily be interpreted as indi- 
cating that the two groups of employees have 
= (on the average) different value systems. It 

is equally plausible to assume that the two 

groups have simply had different work expe- 
riences. For example, in the present study, of 
the 455 satisfying events reported. by em- 
ployees whom the company claimed had never 
received a promotion, only 10 (2%) de- 
scribed a promotion as a satisfying event, 

(Presumably these 10 individuals defined pro- 

motion differently than their bosses or else 

the company records were incorrect.) In 
contrast, of the 334 satisfying events reported 
by employees who had received a promotion, 

80 (24%) described a promotion as a satis- 

fying incident. To take another example, 

white-collar employees were much less likely 
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than were blue-collar emplo 


rention 
the weather as a source oi tion or 
dissatisfaction. One reason > been 
that the former do their work while 
most of the latter work outdo: nilarly, 
blue-collar employees did not ention 
subordinates as sources of sa ion or 
dissatisfaction (Table 7) becar ost of 
them did not have any subordir 
To restate the obvious, an eve: ust ac- 
tually happen before it can be de dasa 


source of satisfaction or dissati n. If 


an event is not mentioned often could 
mean either that it did not occu: that it 
occurred but the individual did no praise 
it as significant enough to be worth rting. 

With respect to Herzberg’s the little 
support for its main tenets was fou t any 
job level in this study. Previous stu using 
the same classification system en ed in 
this study found that motivators > the 
major source of both satisfaction . dis- 
satisfaction. The same result was ob d in 
this study, except that the trend w.- more 
marked at the higher job levels. 

The major event-agent interactions are 


most easily interpreted from an instrumental 
or functional point of view. The agenis can 
be viewed as facilitating or blocking employee 
value attainment (Locke, 1970, in press). 
These values include task values (content, 
amount of work, smoothness, achievement), 
valued rewards for task performance (respon- 
sibility, promotions, money, and recognition), 
and conditions which are congruent with 
physical needs and which facilitate goal at- 
tainment (working conditions). The super- 
visor and the organization were found to have 
the most pervasive influence as both value 
facilitators and value blockers, whereas the 
other human agents were seen as instrumental 
in attaining only particular values, for ex- 
ample, the self and achievement, subordinates 
and smoothness, customers and recognition, 
and unions and money. 

Interpersonal relations, associated mainly 
with co-workers and supervisors, seem to en- 
tail valuing the agent as an end in himself 
rather than for his functional value. This 
type of relationship has been described by 
Locke (in press) as an entity relationship to 


h it from the functional relation- 
the type previously described. En- 
tionships did not prove to be of great 
ince to the employees in this sample. 
val issue concerns the reasons for the 
satisfaction among the unskilled and 
employees in this study as compared 
hose at the same job level in different 
tions. One could begin by asking 
er the item used to measure satisfac- 
ictually measures it. This item corre- 
37 (p< .001) with the total of the 
fa - scales for all subjects combined. While 
th correlation was not very high, the single 
ity suffered from restriction of range and 
m ht not be highly reliable for a given in- 
di dual: however, this should not be the 
cı when groups are compared. Also, the 
dem was the same one used in previous stud- 
ies, What then accounts for the different 
results? 

With respect to pay, the gross wages of 
solid waste employees were somewhat higher 
than those of employees in other industries. 
The gross weekly base pay of nonsupervisory, 
privately employed, nonagricultural workers 
in the United States in 1971 was $125 (Sta- 
tistical Abstract, 1971) as compared to a 
mean figure of $128 for unskilled and skilled 
solid waste employees combined (based on 
data obtained from management in the pres- 
ent study; see Applied Management Sciences, 
1972). The latter figure does not include 
overtime pay which raised the mean weekly 
gross pay of the blue-collar employees in this 
sample to over $150. There were also other 
respects in which the unskilled and skilled 
solid waste employees may have advantages 
over other occupations. The unskilled workers 
are typically collectors while many of the 
skilled workers are truck drivers, who may 
help with collection. (The national average is 
approximately two men per truck.) The fact 
of being alone with co-workers on a truck 
means considerable autonomy in terms of 
freedom from supervision. The fact that a 
different neighborhood is covered each day 
means that there is some degree of variety 
in the work task. In addition, the type of 
solid waste collected may differ day to day, 
from season to season, and from location to 
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location. Since the blue-collar workers are i 
charge of their own truck, there is 
able freedom to control pace of work 
even freedom to choose methods of 
(e.g., where to stop the truck, how to 
the waste containers, how to coordinate team- 
work). All of these characteristics have been 
found to be associated with high work sa 
faction in previous studies (for a summary 
see Locke, in press). In these respects 
job of solid waste collector differs 

from the typical assembly line factory job 
where close supervision, repetitiveness, and 
little freedom of choice with respect to either 
pace or method of work are characteristic. 

Finally, while the following did not show 
up in the figures of this study, there is rea- 
son to believe that solid waste collectors have 
shorter working hours than most other skilled 
and unskilled workers. Many solid waste 
collectors, especially those in the public sec- — 
tor, are on the task system, whereby they 
may leave whenever they complete their as- 
signed route for the day (Applied Manage- 
ment Sciences, 1972). This route Sometimes 
requires only 5-6 hours work to complete. 
The reasons that these employees did not 
actually report working such short hours 
(mean reported hours were 43) may be that 
they are officially paid for a 40-hour week 
plus overtime. 

While the above factors may help to ex- 
plain the greater satisfaction of the blue- 
collar employees as compared to those in 
other professions, none of these considerations 
apply to secretarial /clerical employees. The 
high degree of apparent satisfaction own 
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by this group may be partially explained by F 
their high pay. The reported weekly gross 
pay of the secretarial/clerical employees in 
this sample was $114, whereas the national 
median (mean was not available) for female 3 
employees in this category was $80 in 1969 
(Statistical Abstract, 1971). Even adding S 
10% to this to compensate for the two-year i 
difference (the present data were collected 
during 1971 and early 1972) yields a re- 
maining difference of $26 per week. This 
means that solid waste employees in the sec- 
retarial/clerical category are paid 29% 
higher than the national average- 


14 — 4 = 4 
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Since Webster's (1964) publication, nu- 
merous ‘actors influencing interviewer's deci- 
sions ave been identified. Broadly viewed, 
these any studies constitute two streams 
of research: (a) the impact of information 
characieristics including ordering of informa- 
tion ., Carlson, 1971; Hakel, Ohnesorge, 
& Dunnette, 1970; Hollmann, 1972; Rowe, 
1963; Springbett & Crowell cited in Web- 
ster, 1964; Wexley, Yukl, Kovacs, & Sanders, 
1972), and the roles of stereotypes (e. g., 
Hobel, 1971; Hakel, Hollmann, & Dunnette, 
10 Mayfield & Carlson, 1966; Springbett, 
Sydiaha, & Rowe cited in Webster, 1964). 
This article reports a study which combines 
streams, and also illustrates the impact 
of ye: another source of variance in judg- 
ments, the interviewer's conception of the 
expected typical applicant. 
It is clear that stereotypes have an influ- 
ence on interviewers’ decisions. Sydiaha 
(Webster, 1964) found that stereotypes used 
by an interviewer are not unique to the indi- 
vidual but are common to all who have a 
reasonable amount of experience in a specific 
employment situation. Rowe (Webster, 1964) 
found that the interviewer has a stereotype 
of the “good” rather than the “poor” worker 
and Sydiaha found that they all tended to 
the same information. A possible 
ion for this generality of results is 
„at the interviewers used as subjects were 


both 
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EFFECTS OF APPLICANT STEREOTYPES, ORDER, AND 
INFORMATION ON INTERVIEW IMPRESSIONS’ 


MANUEL LONDON ® axo MILTON D. HAKEL 


Ohio State University 


Carlson's study of the effects of favorability, order, and type of information 
on interviewers’ decisions was replicated, The results showed that unfavorable 
information is more important to interviewers’ ratings than favorable informa- 
tion. However, this effect is moderated by the expected typical applicant—an 
expected typical applicant of high caliber leading to a more favorable first 
impression of unfavorable information. Unfavorable information showed 
contrast effects moderated by the type of information, Interviewers’ confidence 
in evaluations was greater for favorable than for unfavorable information. 


individual interviewers. It is likely that inter- 
viewers from widely divergent backgrounds Ç 
having had different experiences might have , 
different stereotypes and 4 
different decisions. ` 
Interviewers in divergent situations are 
exposed to differing populations of applicants, 3 
and even if the applicants come from the 
same population (as within a local labor 
market at a single company location), inter- 
viewers see different samples of applicants. — 
These population and sample differences may 
be manifest in interviewers’ perceptions as 
differences in expectations about the overall 
level of qualification of the typical applicant. 
The expected typical applicant, similar to the 
stereotype of the ideal applicant in that both 
are abstracted from experience, can be con- 
ceptualized as the interviewer's estimate of 
mean level of overall qualification of a sample 
of applicants. Thus, while interviewers may 
share the same ideal applicant stereotype, 
their differing experiences may have produ 
differing expectations about the typical appli- 
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cant. The expected typical applicant could 
function as an anchor, or perhaps even the 
adaptation level, on an interviewer’s overall 
evaluation continuum, and judge to judge dif- 
ferences in the location of that anchor could 
be related to interjudge disagreement. This 
study investigates both ideal applicant stereo- 
types and expectations of the typical appli- 
cant as determiners of interview impressions, 
using Carlson's (1971) methodology as a 
Starting point for replication and extension. 
Carlson (1971) tried to determine to what 
extent an initial valid impression established 
by knowledge of a test score is altered by 
_ Subsequent information differing in favor- 
ability, order of presentation, and quality of 
preceding applicant. He found that informa- 
tion which is consistent with an interviewer's 
initial impression added less to his total im- 
pression than information of opposite value 
_ detracted from it. Further, the effect of the 
Same unfavorable information was greater 
after a positive impression had been estab- 
shed than was the effect of positive infor- 
mation after a negative impression had been 
established. 
Carlson's results are in line with Spring- 
bett's finding, reported by Webster (1964), 
a that interviewers are more influenced by un- 
favorable than by favorable information. It 
_ Seems that the interview is primarily a search 
lor negative evidence, since the unfavorable 
impression was followed by rejection in 90% 
of the cases investigated. Rowe (Webster, 
1964) found similar results with unfavorable 
and favorable typewritten descriptions of 
candidates. 
This experiment examines the joint impact 
of several independent variables: experimen- 
tally induced ideal applicant stereotypes, 
_ experimentally induced typical applicant ex- 
_ pectations, order of information, favorable- 
ness of information, and type of information, 
The type of information evaluated was sub- 
jective self- descriptions given by the ratees 
versus a valid selection test score given at 
the end of the interview. 


os 


METHOD 
Subjects 


Webster (1964) commented with some exaspera- 
tion on the difficulties of getting experienced inter- 
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viewers to adopt a common 


clerence, 

Therefore, eighty inexperienced ents em 
rolled in introductory psycholo and an 
upper level psychology course o State 
University served as raters. No is made 
for sex. 
Experimental Design 

Four induction combinations wer r study- 
ing the impacts of ideal applicants vical ap- 
plicants: (a) high qualified ideal nt, high 
qualified expected typical applicant (H b) high 
qualified ideal applicant, low qualified ıl appli- 
cant (HL); (c) low qualified ideal int, high 
qualified typical applicant (LH); and w quali- 
fied ideal applicant, low qualified ty ipplicant 
(LL). Within each of these comi s there 
were four types of information pr ion de- 
scribing two applicants. The sequences vorable 
and unfavorable information were cl o allow 
for examination of relevant order efie ach ap- 
plicant consisted of two Paragraph de ons and 
a test score described as a valid achi nt test 
(the test score was given last so as not found 
the initial effect of the stereotypes). T orabil- 
ity of the test score corresponded to t orabil- 
ity of the first paragraph of information he first 
applicant presented while the favorab of the 
test score for the second applicant cor: ded to 
the favorability of the second paragraph < forma- 
tion. The possible sequences of favorabili: of each 
description for the two applicants presente: io each 
subject were as follows: fff, uuu; fuf, uff u, fuu; 
uuu, fff (u and f indicating favorability) 
Materials 

Descriptions of the ideal person for the job and 


the type of employee to expect were given to the 
interviewers. As a manipulation check and in an 
effort to get the interviewers to concentrate on and 
“internalize” these descriptions, they were asked to 
evaluate them on the same response scales as they 
would later use in the experiment. A 2 X 2 analysis 
of variance performed on this data indicated signifi- 
cant differences in favorability among the ideal 
applicant and the expected typical applicant descrip- 
tions, 

The favorable and unfavorable items of informa- 
tion consisted of hypothetical self-descriptions of an 
employee’s previous morning on his present job. 
The employees’ descriptions were selected on the 
basis of ratings by 33 judges drawn from the target 
Population. Ratings were made on the scales used in 
the main study. The descriptions were selected for 
use in the experiment according to the following che 
teria: (a) Their respective scale values differed sig- 
nificantly from the neutral point (0) but in opposite 
directions from it. (b) Favorable and unfavorable 
descriptions did not differ significantly in terms of 
absolute distance from the neutral point. (c) The 
descriptions conceivably could have referred to the 
same individual (Briscoe, Woodyard, & Shaw, 1967). 
The descriptions were systematically randomized for 
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ale measure such that no subject received 
d the descriptions in the same order. 
csponse sheets contained thirteen 7-point 
i the interviewers’ impressions of how well 
ployee met the job requirements listed on 
les. The scores ranged from 1 (very poor) to 
good) and a sum of all 13 scores was taken 
total rating. An overall rating and the conf- 
the rater had in his decisions were abo taken 
ch description, each on a 7-point scale. 


dure 


ating is that feeding information to a judge piece 
y piece affects the decisions (Webster, 1964). A 
judgment is both different and better when informa- 
tion is given to a judge simultaneously than when 
it is fed to him piece by piece. The studies cited by 
Webster indicate that it makes no difference in the 


effects oc- 


information. Briscoe 
to be true of unfavorable information when their 
data were examined for 
from the favorability of 
interpolated mode of response d 
result in recency effects. 


Analysis 

The design of the experimental material was a 
2X2X4XK%2X3 factorial. The independent varia- 
bles were the ideal applicant stereotype, the expected 
typical applicant, the four orders of information 
- favorability combinations, the position (first or 
L cond) of the two applicants rated by each sub- 
ject, and the three paragraphs of information for 
each applicant rated. Factorial analyses of variance 
with repeated measures on the last two factors were 
performed on the treatment combinations for the 
total, overall, and confidence ratings. 


RESULTS 


Means and standard deviations were com- 
puted for the totals of the individual ratings, 


INTERVIEW IMPRESSIONS 


7* 


the overall ratings, and the confidence rat 
ings. The Pearson product-moment ce 
tion for the total score and the overall 
was 8, and therefore only the results 
the total scores are presented (sce London, 
1972, for means, standard deviations, and 
other results). f 

The results of the analysis of variance of 
the totals scores are given in Table 1 which 
includes the values of omegas squared (WI), 
the proportion of variance accounted for by 
the independent variable (Hays, 1963). 

The analysis of variance on the totals o 
raters’ evaluations for each descriptio 
showed that the interaction of order of infor- 
mation favorability with applicant position” 
(first or second) accounted for the largest 
amount of variance: 45%. Differing combi- i 
nations of paragraphs and order of informa- ö 
tion favorability accounted for 11% of we 
variance. Levels of ideal and typical appli- 
cants in combination with differing para- 

and orders of information favorability 

also showed significance but accounted for 4 
much smaller percentage of variance. The 
significant effect of the ideal applicant condi- 
tions indicates that the ratings were not made 
merely on a basis of each description as in 
the prescale ratings. 

Several post hoc hypotheses were devel- 
oped and tested for statistical significance. 
were as follows: 


of consistent information—that is, all favor- 
able or all unfavorable: (a) The difference 
between the first two favorable descriptions 
is not statistically greater nor less than the 
discrepancy between the second and third 
descriptions for consistently favorable appli- 
cants. (b) The difference between the first 
two unfavorable descriptions for a consistently 4 
unfavorable applicant is statistically greater 
than the discrepancy between the second and x 
third unfavorable descriptions. 1 
2. For hypothetical applicants constructed 
ol inconsistent information—that is, fuf, uff, * 
ufu, and fuu: (a) Unfavorable information 
has a statistically greater effect on a raters 
evaluations when presented after rather than 
before favorable information when summed 
over all ideal applicant and typical applicant — 
conditions. (b) A comparison between the 
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TABLE 1 


ANALYSIS OF VARIANCE FOR TOTAL SCORES FROM THE 13 RATING Scar: 
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Source | Error df | MS 
Ideal applicant (A) F 1 2851.88 6.60" 
Expected typical applicant (B) F | 1 795.68 1 
Favorability combinations (C) | F | 3 1097.77 2.54 
i Position (D) | DXF 1 53.33 | 1 
i Paragraphs of information (E) EXF } 2 123.73 1.74 
r AXB | F 1 26.13 06 
AXC F 3 490.38 1.13 
BXC F | 3 1035.78 2.40 
r AXD PD er 1 1235.21 2.92 
BXD | DXF 1 2.41 01 
A CXD DXF 3 35249.41 83.2 
2 AXE EXF | 4 63.53 89 
5 BXE E X F AES 187.05 2.63 
l CXE EX F 12 2034.21 28.63 
A AXBXC F 3 499.24 1.15 
5 AXBXD DXF 8 403.33 95 
H AXCXD DXF 3 41666042 3.92 
4 BXCXD DXF gr] 288.79 08 
Subjects (F) 64 432.36 
i AXBXE | EXF 4 60.55 98 
4 AXCXE EX E 12 47.09 60 
4 BXCXE E N F 12 207.50 2.9257 
2 AXBXCXD DXF 3 595.23 141 
DXF 64 423.37 
AX EN CNX E EXF 12 58.24 82 
E F 256 7106 | 


i. 


high. Favorable information, however, did not 
seem to be altered by either induction.) 
These hypotheses Were tested by the 
Scheffé method of orthogonal Contrasts (Hays 
1963). Except for hypothesis 1(a), all were 
significant at the .05 level. Thus, unfavorable 
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information does have a greater effect th 
favorable information though the effect of un- 
favorable information is modified by 

the ideal applicant and expected typical aj 
plicant conditions, Unfavorable informati 


Discusston 


The original analyses and the post hoc 
analyses yielded the following conclusions: 
1. An interviewer who expects to find a high 
caliber applicant gives applicants higher rat 
mgs. This was shown by the significant dif 


nces among interviewer judgments for 
ideal applicant conditions and by the 
nificant interaction of expected typical 
plicant with favorability combinations and 
ragraphs. The post hoc analyses, however, 
vowed that it is the unfavorable information 
hich diminishes in effect when an inter- 
ewer has the concept of a high caliber typi- 
ıl applicant and even more so for a high 
leal applicant also, but the effect of favor- 
ble information remains unchanged by 
tereotype condition. The moderating variable 
n this study seems to be the expected typi- 
al applicant, a high typical applicant leading 
to more favorable ratings of unfavorable in- 
formation. It does not seem, as Carlson 
(1971) found, that the rater overreacts to 
information of opposite view to his initial 
impression (or in this case the induced stereo- 
type). But this finding does not hold for a 
low expected typical applicant making favor- 
able information any less favorable than 
when it is preceded by a high typical appli- 
cant. Initial favorable information is rated 
about equally no matter what the stereotype, 
and a candidate who is overqualified, tends 
to be rated highly regardless of stereotype. 
2. Consistently unfavorable information 
has stronger effects than consistently favor- 
able information as shown by the results of 
the post hoc analysis 15). That is, the sec- 
ond paragraph of unfavorable information 
was rated more unfavorably than the first for 
an applicant comprised of consistently un- 
favorable information following an applicant 
of consistently favorable information. This 
hypothesis was not supported in the present 
experiment for the uuu order when it de- 
scribed the first applicant rated with no 
applicant of favorable quality preceding it. 
Analysis of the first bit of information in 
this sequence for the high qualified ideal 
applicant, high qualified typical applicant 
stereotype condition indicates that unfavor- 
able information has a diminished effect after 
this stereotype is induced. It may be, then, 
that the interviewer is more unsure of his 
ratings on the first unfavorable description 
aiter rating the consistently favorable appli- 
cant or after having been induced to expect 
a high caliber applicant and therefore tends 
to decrease the ratings made after the second, 
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but leveling off for the third and Gaal 

J. Unfavorable information results in 
cency and sandwich effects, That is, fa 
sults in a more unfavorable final rating i 
a uf sequence when a favorable test score is 
given last. That is not the case, however, 
when the same ratings are compared for 
order three, the final u of the ulu order ape 
plicant being no less unfavorable than the 
final u of the fuu applicant, It may be, then, 
that when an unfavorable test score is pre- 
sented to an interviewer, the maxim ie, as 
Carlson (1971) suggests of his failing test 
scores serving as initially valid information, 
“When the selection test is failed, forget the 
man, and when the selection test is passed, 
forget the test and evaluate the applicant 
using other tools,” i 

4. The effects of induced stereotypes di- 
minish or are altered by subjective informa- 
tion as the rater evaluates the applicant de- 
scriptions. Such distinctions as made above 
about the effects of stereotypes on first im 
pressions cannot be made for final ratings 
for the second applicants since no such 
trends appear in the data even though the | 
interviewers were asked to review the stereo- 
types before they read the first description of 8 
the second applicant. 

5. Finally, the analysis of variance for con- 
fidence ratings (reported in London, 1972 
and post hoc analyses on these data showed _ 
that interviewers seem to be more confdent 
in making ratings of consistently favorable — 
descriptions. This result is in line with de 
finding that while favorable information 
tends to be rated at the same level of favor- < 
ability regardless of stereotype induction, the 
effects of unfavorable information seem to be 
moderated by a high expected typical appli- 
cant. Thus, interviewers may be less sure of — 
their unfavorable ratings but believe that 
favorable applicants can succeed on the job 
no matter what the induced stereotype. s 

Since the type of methodology used in this 
study is commonly used in studies of impres- 
sion formation (cf., Hastorf, Schneider, z 
Polefka, 1970, pp. 49-52) and investigations 
of decision making in the employment inter- 
view (eg, Carlson, 1971; Wexley et fA E 
1972), it is useful to look at several prob- 
lems with this type of method. = 
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One problem is in generalizing results to 
the selection interview from studies of im- 
pression formation using a paper and pencil 
instrument with college students as subjects. 
Carlson (1971) tried to counteract this prob- 
lem by using life insurance agency managers 
who actually do the hiring and are familiar 
with what the best man for the job should be 
like and the type of person who generally 
applies. The present experiment sought to 
manipulate the stereotypes held by the inter- 
viewer and therefore used naive subjects, that 

is, college students, with no interviewing ex- 
perience. 

A methodological shortcoming with this 
type of study, pointed out by Hollmann 
(1972), is that the information units are 
neither independent nor show equivalent, 
symmetrical patterns of intercorrelation. To 
counter this, he asked raters for a base rate 
of success for the company and a probability 
of success after each item of information 

Was presented. This procedure might be al- 
| tered to distinguish the ideal applicant and 
expected typical applicant by asking inter- 
_ viewers for base rates of success of each 
Stereotype applicant and analyzing error 
Scores of each bit of information compared to 
= each base rate. This is a matter for future 
experimentation. 

Finally, there is the problem that the con- 
trast effects for unfavorable information ac- 
count for less than 1% of the variance. Wex- 
ley et al. (1972) point out that this problem 
may be due to possible ceiling and floor ef- 
fects caused by the extreme levels of suita- 
bility or that these extreme levels do not 
represent ambiguous stimuli and hence there 
was no opportunity for selective perception. 
Modifications in the stimulus materials used 
in this study might lead to shifts in the per- 
centage of accounted variance. 

In summary, this study has delineated the 
effects of ideal applicants, expected typical 
applicants, and favorability and orderings of 
information on rater’s impressions. It has re- 
viewed the importance of yet another source 
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of variance, the expected typical app! unt, in 
interviewer's decisions. 
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COMPARABILITY OF INTERVIEWER, GROUP, AND 
INDIVIDUAL INTERVIEW RATINGS 


LARRY F. MOORE,! ann ALEC J. LEE 


Faculty of Commerce and Business Administration 
University oj British Columbia 


Interview trait ratings made by individual and groups of managers following 
exposure to videotaped interview playbacks were compared to ratings made by 
experienced interviewers participating in 34 live interviews couched in a 
typical entry-level managerial employment setting. A convergent and dis- 
criminant validity analysis was made to assess the adequacy of the interview 
rating procedure. Mean ratings made by groups of managers were similar to 
mean ratings made by the interviewers. Agreement between the two dis- 
similar rater types using direct (face to face) and indirect (videotaped) 
stimuli indicated that perceptual distortion was not a strong factor. Individual 
managers’ mean ratings suggested a leniency effect, 


Individual differences among interviewers 
have been shown to account for some of the 
overall differences in interview judgments 
and sources of interviewer-decision bias that 
have been documented (Ulrich & Trumbo, 
1965; Wright, 1969). A common strategy for 
countering the effect of individual differences 
in decision making is to combine individuals 
into groups. Group versus individual decision 
making and problem solving has been studied 
in a wide variety of settings and has been 
extensively reviewed (Lorge, Fox, Davitz, & 
Brenner, 1958; Maier, 1967; Hall & Wil- 
ams, 1970). With some notable exceptions 

Bouchard & Hare, 1970; Dunnette, Camp- 
boll, & Jaastad, 1963; Campbell, 1968) most 
the research supports group over individual 
judgment. : 

though interviewing by groups (panel 
interviewing) has been used in military and 
industrial settings (Office of Strategic Ser- 
vices, 1948; Taft, 1959) the effectiveness of 
group compared to individual decision making 
has not been examined in the context of the 
employment interview. One feasible method 
for achieving group decision making in the 
employment interview is to record the inter- 
view on videotape for later playback to a 
group of judges who make a collective deci- 
sion. The purpose of the present study was to 
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compare ratings made by individuals and 
groups of managers following exposure to 
videotape recorded interview playbacks with 
ratings made by experienced interviewers par- 
ticipating in live interviews in a typical en- 
try-level managerial employment setting. 


METHOD 


Research Site and Subjects 


The interviews forming the basis of the study 
were conducted at the western headquarters of a 
large Canadian commercial bank. Thirty-four gradu- 
ating students of a university business school served 
as interview subjects. The median age of the sub- 
jects (all male) was 23 years and their mean grade 
level was B with a range from C to A—. The mean 
number of previous employment interviews experi- 
enced by the subjects was 5.5 and all had undergone 
at least one interview with another company. 

Each subject participated in a 25-30 minute semi- 
structured employment interview (initial screening) 
with an interviewer from the bank’s staff. Six inter- 
viewers each interviewed five or six different sub- 
jects. All interviewers had been conducting college 
employment interviews as a major part of their 
work roles during at least the previous six months 
and all had received either a one- or a two-week 
formal course on interview methods. Thus, the inter- 
viewers were considered to be experienced and 
trained in a manner typical of many business firms. 
The interviewers followed the bank’s usual interview 
format designed to elicit attitudinal, behavioral, and 
factual information not attainable elsewhere in the 
selection process and to provide the interviewees 
with information designed to enhance the bank’s 
image. All 34 interviews were recorded on one-half 
inch videotape using split screen staging to allow a 
head and upper body playback presentation of the 
interviewer and interviewee simultaneously. 
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TABLE 1 
MEans, STANDARD DEVIATIONS, AND SIGNIFICANT DIFFERENCES FOR 
INTERVIEWERS’, GROUPS’, AND INDIVIDUALS’ RATINGS 
= SS — — 
| Ratings 
Trait Interviewers Groups Inc 

| F | <3 ry 

CCC vs. 
Attitude 3.20 | .69 | i | | 3.53 2.05* 
Appearance 2.94 49 3.00 42 3.27 2.85% 
Interest 2.85 61 | 3.00 78 3.12 
Intelligence 3.12 33 2.87 3.40 50 3.47 3.41 
Leadership 3.12 4 3.15 70 | 3.27 
Maturity 3.20 48 3.27 62 2.29˙•3.62 6 2.97% 
Motivation 3.15 50 3.12 81 3.27 5 
Persuasiveness 3.00 43 3.15 -66 | 3.15 6 
Self-confidence 3.17 82 2.02* 3.44 56 3.4161 
Self- expression 3.17 58 3.21 59 3.50 6 2.15* 
Sociability 522 3 | 3.44 | 6c 
Potential 2.97 58 2.85 86 3.09 71 
Overall 3.00 43 3.12 60 3.27 57 18* 

3 ise (two-tailed) is for significant difference, Abbreviations: I = interviewers, G = 8 and I’ x ir ls 

“p <0. 

The subjects were rated on S. point graphic scales ing their judgments. Thus, in realistic yn, the 
with equally spaced intervals labeled excellent, su- study examined ratings based on interviews plus 
perior, average, marginal, and unsatisfactory. Judg- ancillary data. An investigation of the mpleted 
ments were made on 12 dimensions (traits): atti- Standardized application blanks did not reveal any 


tude, appearance, interest, intelligence, leadership, 
maturity, motivation, Persuasiveness, self-confidence, 
self-expression, sociability, and potential. Each trait 
_ to be rated was amplified by definitional phrases or 
Sentences. A separate overall rating was made using 
the 5-point graphic scale. 


discussion following videotaped 
-four line mana- 


systematic differences in interviewee bac kgrounds 
likely to bias the rater comparisons. To test for 
intrasample bias, one-way analyses of variance were 
calculated for the ratings made by each interviewer, 
group and individual in each of the five or six suc- 
cessive interviews rated (overall trait rating only). 
No significant intrasample differences were found 
(interviewers F = 74; groups F = 1.67; individuals 
F= 66, df =5/28). 

Mean and standard deviations were computed for 
the 12 interview traits and for the overall trait 
Scores derived from each rating method, then the 
means were compared. A convergent and discrimi- 
nant validity analysis (Campbell & Fiske, 1959) 
Was made to compare more fully the three interview 
rating methods and to examine the adequacy of the 
general interview technique used by the bank host- 
ing the study. 


RESULTS 


Table 1 presents means and standard devi- 
ations of the interviewer, group, and indi- 
vidual ratings of the interview subjects. Mean 


trait ratings by interviewers following live 


Interviews agreed closely with the mean rat- 
Ings of groups following videotape playback 
and discussion, The mean interview ratings 
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TABLE 2 
CONVERGENT AND Discknmnanr Vauinrnies or INTERVIEWERS Geovrs, 


INTERVIEWERS-INDHVIDUALS, 


Heteromethost Nock 


Interviewers individuals 


AND INpivipvaLs-Gaours 


Individuals grown 


were significantly higher than the group rat- 
ings for intelligence and self-confidence, Indi- 
vidual managers differed from the interview- 
ers to a greater extent. The mean individual 
ratings were significantly higher than the 
mean interviewer ratings for six traits. In- 
spection of the mean interviewer ratings show 
them to be uniformly lower than the mean 
individual ratings and lower than all of the 
Corresponding mean group ratings except mo- 
tivation and potential. This pattern may re- 
flect a tendency reported by Rowe (1963) 
for experienced interviewers to be more criti- 
cal. 

Table 2 summarizes the analysis of con- 
vergent and discriminant validity following 
the Campbell and Fiske (1959) procedure. 
Essentially, the procedure enables a detailed 
analysis of intercorrelations of trait scores 
among and between traits within and among 
methods when more than one method is used 
to measure more than one trait. The Camp- 
bell-Fiske design provides one indication of 
convergent validity and three indications of 
discriminant validity. 

Convergent validity is indicated when two 
or more independent measures tend to agree 
in the measurement of a given variable 
(trait). In Table 2 the correlations between 
methods are shown in columns 1, 4, and 7. 


Interviewers-groupe 

— whe — j l — 
| No. Ne. | l Na 

Vatidity ahat aeg | Validhy | Hishest aasee vnde, ge, NO nie 

coefficient ba os values at Po val | conffictent —_ valses 
= bers f higher» 
A ae S6 7 me | s 6 as s 7 
* 40" W 0 11 4 Bl ene * 5 
! ss 47 0 * 4 1 i wv ts 16 
h 25 48 n 0 M æ | o | a w 
f oor 40 o P 4 1 8 7 
M 32 “ 9 10 s7 ‘ 11 e 
M i 16 54 18 26 AA ea 2% | 8 * 
P es | 32 53 11 0 eee. E w | a | n 
8 we | 45 4 0 1s w 14 s a j 7 
Sel on sie wo 1 “ 1 6 o 8 | a 
So 4 0 7 * 4$ 2 oO ói 24 
P. 42° 42 0 w 57 10 sie ót 1 
O 10 $3 2 * 80 s uf as 10 
Median 42 48 26 45 26 é | 
ber of heteromethod off-diagonal coefficients in corresponding row and column higher than validity coefficient ; maximum 


Minimal requirements for convergent validity 
were met for 10 of 13 traits in the interviewer- 
group comparison (7 significantly different 
from 0); however, the median validity co- 
efficient was low. In the interviewer-indi- 
vidual and individual-group comparisons less 
than half the traits satisfied the criterion for 
convergent validity. 

The first test for discriminant validity re- 
quires that the validity coefficient for a trait 
measure by two different methods should be 
higher than the correlations between that 
trait and other traits measured by different 
methods (i.¢., having neither method nor 
trait in common). This criterion is met by 
five traits in the interviewer-group compari- 
son but is not met by any trait in the inter- 
viewer-individual or individual-group com- 
parisons. 

The second test for discriminant validity 
requires that measures of a given trait made 
with independent methods correlate higher 
than correlations between the given trait and 
other traits when measured by a common 
method. This test indicates the strength of a 
trait across methods as well as its indepen- 
dence from other traits. All heteromethod 
trait intercorrelations were exceeded by at 
least 50% of the intertrait correlations within 
methods except for appearance, interest, and 
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leadership, all within the interviewer-group 
heteromethod block. These were exceeded by 
8%, 25%, and 11% respectively. Clearly, 
the second test for discriminant validity was 
not met. Intercorrelations between traits 
within the three monomethod blocks were 
quite high (for interviewers, median r = 40; 
for groups, median r = .53; for individuals, 
median r = .45), indicating that most of the 
traits themselves may have lacked definitional 
clarity. 

An examination of the patterns of trait 
intercorrelations within and between rating 
methods for similarity constitutes the third 
test for discriminant validity. No pattern 
similarity could be identified, thus placing 
further doubt on the definitional clarity of 
the traits used by the bank. 


Discussion 


The results indicated that mean trait rat- 
ings made by groups of managers after ex- 
posure to videotape playbacks of candidates’ 
interviews are highly similar to those of inter- 
viewers having the typical level of training 
and experience. Perception biases often asso- 
ciated with interviewer judgment such as 
stereotyping, early closure, or data seeking 
operating singly or cumulatively did not ap- 
pear to produce a differentiating effect. Hakel, 
Ohnesorge, and Dunnette (1970) have found 
that the effect of perception bias may be 
negligible in some instances. 

The tendency for individual managers to 
rate relatively high suggests a leniency effect. 
Untrained raters have been shown to be less 
critical than experienced interviewers (Rowe, 
1963). Although the individual managers used 
as raters in this study had been involved in 
interview decisions in the past, none had pre- 
viously received formalized training in inter- 
viewing or decision making, Further research 
is needed to determine the precise way in 
which group discussion influences decision 
making in interview rating. Holloman and 
Hendricks (1971) have demonstrated the 
error-correcting propensity of social interac- 
tion in group decision making in other set- 
tings. Group discussion may influence un- 
trained raters to be more critical in evaluating 
interviewees. 
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Interview reliability and validity ysis 
is facilitated via videotape becaus the 
replay feature. Although design lin ations 
precluded investigation of intrarater abil- 
ity in this study, tapes of three oi ank's 
screening interviews with associat: plica- 
tion blanks were shown to group three 
(n = 27) and individual (n = 104 ident 
judges at two viewing sessions se; d by 
a one-week interval. Intrarater « tency 
differed considerably across the 13 rview 
variables but correlation coefficient high: 
as .70 to .80 were reached alth the 
medians were .54 and .59 for group indi- 
viduals respectively.“ 

The convergent and discriminar lidity 
analysis raised serious questions the 
interview process itself. The conver: ilid- 
ity coefficients were quite low throu in- 
dicating large amounts of unexpla vari- 
ance between rating methods on m raits. 
The analysis for discriminant va re- 
vealed generally high intertrait cor. ations 
within methods, but no common pa! ex- 
isted among methods. Suspicion was t s cast 
on the traits themselves; their defi: ‘ional 
clarity, degree of overlap or commonal and 
their realism and relevance to the aters. 
Other researchers (Howell & Vincent, ‘970; 


Hakel & Schuh, 1971) have begun to define 
interviewee attributes that are suitable for 
interview assessment. Further research is 
needed to determine the relative effects of all 
sources of undesirable influence on the inter- 
view and on the interview process. Some are 
apparently more critical than others. 


? Further details of this reliability investigation 
will be supplied by the authors on request. 
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PARTICIPATION EFFECTS IN APPRAISAL INTERVIE\ 
CONDUCTED IN A TRAINING SITUATION 
JOSEPH M. HILLERY anno KENNETH N. WEXLEY 2 


University of Akron 


Participation, order, and interviewee personality effects were studied in ap- 
praisal interviews conducted as a part of a training program. Results showed 
that nonparticipative appraisal interviews brought about greater behavior 
change and interviewee satisfaction than participative interviews. Order of 
presentation did not effect interview outcome, and personality variables were 
not found to moderate the effectiveness of type of interview. The importance 
of the training situation was discussed in relation to the results of the study. 


The desirability of allowing subordinates 
to participate in their own performance ap- 
praisal interviews has been voiced for a num- 
ber of years (eig., Kindall & Gatza, 1963; 
Likert, 1959; McGregor, 1957). McGregor 
(1957) was the first to outline an entire per- 
formance appraisal approach in which the 
subordinate takes an active part. Studies 
which have experimentally manipulated the 
amount of participation in appraisal inter- 
views (French, Kay, & Meyer, 1966; Wexley, 
Singh, & Yukl, 1973) have supported the 
notion of the beneficial aspects of participa- 
tion. French et al. (1966) found that higher 
levels of participation produced improve- 
ments in the supervisor-subordinate relations. 
Wexley et al. (1973) showed that participa- 
tive appraisal interviews resulted both in 
greater subordinates satisfaction with the in- 
terview and greater intention to improve fu- 
ture job performance. 

While Maier (1973) has generally sup- 
ported the participative type of appraisal in- 
terview, he believes there are situations in 
which a nonparticipative approach is more 
appropriate. Maier (1958) contended that 
the tell and sell method (a nonparticipative 
approach) has its greatest potential with new 
and inexperienced workers. Furthermore, 
Maier (1973) held that criticism, typically 
given in the nonparticipative but not in the 

+ This study is based on a dissertation submitted 
by the first author to the Department of Psychol- 
ogy, University of Akron, in partial fulfillment of 
the requirement for the PhD degree. 

The authors wish to thank Isobel Pfeiffer for her 
cooperation and assistance, 

Requests for reprints should be sent to Kenneth 


N. Wexley, Department of Psychology, University 
of Akron, Akron, Ohio 44325. 


participative interview, is most li} to be 
accepted by the subordinate when is still 
learning the job. Acceptance of c` cism is 
important since most workers re defen- 
sively to criticism, resulting in J. ort to 
improve performance (Kay, Meye: rench, 
1965). The major purpose of this ly was 
to see if the beneficial aspects of p: pation 
in appraisal interviews found wii xperi- 
enced workers also holds for appr: inter- 
views with trainees. Interview ef- iveness 
was measured by performance imp vement 
and interviewee satisfaction with ‘\- inter- 


view. 
Two additional variables were examined in 


the appraisal process. First, persona fac- 
tors of self-esteem, authoritarian and 
need for independence have been found to 


moderate the effectiveness of various styles of 
leadership (Kay et al., 1965; Vroom, 1959). 
However, not all studies investigating these 
variables have obtained similar results (Tosi, 
1970; Wexley et al., 1973). For example, 
Wesley et al. (1973) found that it was de- 
sirable to allow a subordinate to participate 
in the appraisal process regardless of his 
personality structure. In the present study, 
first, all three personality variables and their 
relationship with interview effectiveness were 
examined. Second, order effects in the non- 
participative interviews were examined to see 
if it made any difference if the ineffective 
items were presented first, last, or sandwiched 
in between effective items. 


METHOD 
Procedure 


Appraisal interviews were conducted as part of a 
teacher-training program known as microteaching. 
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 Microteaching is similar to an actual classroom situ- 
“ation except that it is scaled-down in terms of 
teaching time and size of class. It is a widely used 
(Allen & Ryan, 1969) and effective (Kallenbach & 
Gall, 1969) method of teacher training. 
Four different ways of presenting feedback in 
the appraisal interview comprised the four experi- 
mental conditions. The four conditions included one 
participative and three nonparticipative types of 
appraisal interviews. The three nonparticipative 
interviews differed in the order in which the two 
effective and two ineffective behaviors were pre- 
sented: (a) the effective behaviors presented first, 
then the ineffective (EI); (b) the ineffective then 
> effective (IE); and (c) one effective, two ineffec- 
"tive, then the remaining effective (EIE or com- 
monly called the “sandwich technique”). 
Each of 16 teachers was exposed to all four ex- 


balanced order. 
total of 64 teach-interview-reteach sequences (ien, 
1 sequences for each of the 16 

teaching session (teach) consisted of 
teaching three high school students 4 10-minute 
lesson aimed at the completion of one task. After 
the teaching Session, two supervisors were given 10 
minutes to confer and decide which two effective and 
two ineffective behaviors were to be fed back to 
the teacher. The appraisal interview consisted of a 
10-minute discussion between one supervisor and 
the teacher using one of the four types of inter- 
views. The teacher was then given a half hour to 
prepare for his reteach sequence. The reteach se- 
quence consisted of the teacher reteaching the same 
10-minute lesson, but to a different group of stu- 


dents. 


supervisors filled out 
task. The checklists were 
E—I/E+I, where the Es are 
haviors checked and the Is are the 
haviors checked. After each appraisal interview, the 
teacher filled out a graphic rating scale indicating 
how satisfied he was with the interview. 


Subjects 


Three groups of subjects were used: (a) 16 
undergraduate education majors who taught the les- 
sons (teachers); (b) 25 graduate students who ob- 
served the performance of the teachers and who 
conducted the interviews (supervisors) ; (c) 12 
high school students who were taught the lessons 
(students). 
The teachers we 
_ jors enrolled in an undergra 
visors were experienced elemen 
teachers enrolled in a graduate course in 
The microteaching experience was included in the 
training of the teachers and supervisors. The stu- 
dents being taught the lessons were 12 high school 
Students paid to participate in the project. 


Participation EFFECTS IN APPRAISAL INTERVIEWS 


Materials 


Self-esteem, The instrument used consisted of a 
set of 13 bipolar adjectives. Each teacher twig 
filled out a 13-bipolar-adjective questionnaire, 


Levinson, and Sanford (1950) and subsequently 
adapted and used by Vroom (1959). Adorno et 
report reliability studies of this instrument 
average reliability coefficients of .90. 
i . The teachers’ need 
independence was measured by 
questionnaire used by Vroom (1959) and 
from a larger instrument employed by Tann T 
and Allport (1956). Vroom reported that the teste 
retest reliability over a seven-month period of a 
shortened form of the questionnaire was 61 for 90 
supervisors. 

Checklists. Five separate checklists were devel- 
of the five teaching tasks. Each 


tive and 10 ineffective. The items were obtain 
from 20 experienced teachers familiar with each 
the five teaching tasks. Additional behavioral items 
were adapted from the evaluation guides published 
with the teaching tasks (Davis, Gregory, Kyselka, — 
Morse, & Smoot, 1970). To estimate the reliability 
of the checklists, a ratio was computed between the 
number of instances in which two raters agreed ; 


(both raters checked or failed to check an item) to 
the total number of agreements plus disagreements. 
The percentage of agreement for the overall checklist 
was 80%, while the percentage for the checklists 
scored only for the four fed-back items was 76%. 
Teaching tasks. The five tasks, taken from Davis t 
et al. (1970), included clarity of instructional ob- 
jectives, presentation, interaction, questioning, 2 
c refocusing. These tasks were considered to 

be important dimensions of effective teacher pere 
formance. 
Satisfaction. The teacher's satisfaction with the 
appraisal interview was measured by averaging his 
responses to the following three questions: (a) How 
effective was the conference in covering the im- 
portant aspects of your “teach” behavior? (b) If 
you were to be the supervisor in an ideal con- 
ference, how similar would it be to the conference 
you just had? (c) How satisfied were you with the 


conference? 
RESULTS 


Table 1 presents the means and standard 
deviations for the participative and nonpar- 
ticipative interviews on each of the following 
three dependent variables: (a) the checklist 
scored for all behaviors checked (overall 
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TABLE 1 


MEANS AND STANDARO DEVIATIONS ror Participative 
AND NONPARTICIPATIVE INTERVIEWS ON THREE 
DEPENDENT VARIABLES 


Dependent variable x SD 
Overall checklist 

participative | 04 27 

nonparticipative 29 | 10 
Fed-back checklist 

participative 17 2 

nonparticipative 48 A7 
Satisfaction 

participative | 3.60 42 

4.21 | 43 


nonparticipative | 


checklist), (b) the checklist scored only for 
those behaviors which were fed back to the 
teacher (fed-back checklist), and (c) the 
overall measure of the interviewee’s satisfac- 
tion with his interview. The scores for the 
overall checklist and fed-back checklist rep- 


resent gain scores from pretest (teach) to 


posttest (reteach). The nonparticipative in- 
terview scores are the gain scores for all 
teachers averaged across the three nonpar- 
ticipative interviews, Analyses of variance 
revealed that the nonparticipative interviews 
brought about significantly greater perform- 
ance increases on the overall checklist OS 
9.53, df = 1/15, p < .01) and the fed-back 
checklist (F= 7.59, df= 1/15, p< 05) 
than did the participative interviews. Also, 
the interviewees were significantly more satis- 
fied (F = 12.72, df=1/15, p<.01) with 


r- the nonparticipative interviews than with the 


participative ones. 

Results of the comparisons examining order 
effects indicated that no one order of presen- 
tation was found more effective or satisfying 
than any other presentation order. This find- 
ing is contrary to the advice often given in 
business journals that one order of presenta- 
tion is superior to another (eg., Leskovec, 
1967). 

Differences in teacher self-esteem and au- 
thoritarianism showed no main effects and no 
interactions with the amount of participation 
or the order of presentation. On one depen- 
dent variable (the overall checklist) there 
was a main effect for teachers’ need for inde- 
pendence (% < .05). Those persons who had 
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a low need for independence sho signifi- 
cantly more improvement than di: e per- 
sons who had a high need for in dence. 
In no case, however, did the ne: inde- 
pendence interact with the amou f par- 
ticipation or the order of present 
Discussion 

From the results, it is clear tha e non- 
participative interviews were mor: fective 
than participative interviews in lucing 
improved teaching as well as re ng in 
higher teacher satisfaction with the view. 
These results do not support th: vious 
experimental studies examining ap al in- 
terviews (French et al., 1966; Wes! et al., 
1973) that have consistently show e su- 
periority of participative approac The 
most notable difference between two 
studies and the present one is that two 
previous studies dealt with real-life -© prais- 
als conducted with experienced rkers 
(French et al., 1966) or with stude role 
playing experienced workers (Wexley -t al., 
1973), while the present study involved in- 
experienced people in a training setti The 
teachers in the present study were desirous of 


being evaluated by the supervisor and were 
noticeably disappointed when asked to par- 
ticipate in their own evaluation. On several 
occasions, the teachers asked the supervisors 
for direct feedback after a participative inter- 
view had been conducted. This was indicative 
of the teachers’ dissatisfaction and frustra- 
tion with the participative approach in the 
training situation. The results from this 
study lend empirical support to Maier's 
(1958) notion that a tell and sell appraisal 
interview (i.e, nonparticipative) has its 
greatest effect with young and inexperienced 
workers. 

The personality variables of self-esteem, 
authoritarianism and need for independence 
were not found to moderate the effectiveness 
of the participative and nonparticipative ap- 
proaches. These results conflict with the find- 
ings of Vroom (1959) and Kay et al. (1965), 
but support the recent findings of Tosi (1970) 
and Wexley et al. (1973). The difference in 
results might be explained by either the dif- 
ferent instruments used to measure the per- 


ariables or the totally different situ- 
which the studies were conducted. 
more research is needed to delineate 
tions under which personality fac- 
important in determining appraisal 
outcomes, 
mitation of the present study should 
|. The performance i t was 
d over a one-hour time period. Studies 
emonstrated that the beneficial effects 
ing may not persist over a long period 
© (Fleishman, 1953; Ivancevich, 1972). 
nal research studying the appraisal 


be 

m 

ha 

of 

0 

A 

ir iew in training using follow-up mea- 

« 2, however, did afford a well-controlled 
£ 5 ion in which actual behavior changes 


is clearly desirable. The microteaching 
0 be evaluated in a pretest-posttest de- 
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EXTENSIONS OF A PATH-GOAL THEORY OF 
MOTIVATION ' 


MARTIN 


Faculty of Management Studie: 


This article extends and replicates the Evans's hypothesis concerning the way 
in which the behavior of the superior affects the subordinate’s perceptions of 


expectancies and instrumentalities in 


the three moderators (the subordinate’s locus of control, the subordinate’s 
position in a web of role relationships, and the supervisor’s upward influence) 
hypothesized, only the first was found to moderate the superior/subordinate 


relationship as predicted. Results for 


cal. Additional implications of the path-goal model were explored, that is, 
the role of motivation as: (a) an intervening variable between supervisory 
behavior and subordinate behavior and (b) as a moderator in the behavior 


satisfaction relationship. 


The seminal article of Georgopoulos, Ma- 
honey and Jones (1957) extended the expec- 
tancy theories of motivation (Lewin, 1938: 
Tolman, 1932) to the organizational scene. 
The model proposed was formalized and 


further extended by Vroom (1964) who made 


a more extensive analysis of the path-goal 
issue. He introduced two basic probability 
concepts: (a) Instrumentality, which provides 
a probability link between two outcomes, 
represents an individual’s beliefs that one 
outcome is associated with another, for ex- 
ample, that being a high performer results in 


_ higher pay. (b) Expectancy, which provides 
a probability link between behavior and an 
_ outcome, represents an individual’s beliefs 


that an outcome is associated with his be- 
havior, for example, that increased effort 
results in higher performance. Vroom sug- 
gested that motivation to perform is made up 
of two components combined multiplicatively: 
(a) the valency of high performance, the 
attractiveness of the outcome (which in turn 
is comprised of the sum, over goals, of the 
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the path-goal theory of motivation. Oi 


the other two moderators were equivo 


product of the importance of each goal and 
the instrumentality of high performance for 
the goal) and (b) the path-goal expectancies 
that effort or working hard leads to high per- 
formance. Furthermore, following Georgop- 
oulos et al. (1957), work behavior is a func- 
tion of this motivational force, subject to a 
number of constraints such as the individual's 
ability and the freedom he has to vary his 
behavior.“ 

Variants and extensions of Vroom's model 


have been tested and confirmed in a variety — 


of studies—experimentally (Graen, 1969) 
cross-sectionally (Gavin, 1970; Porter & 
Lawler, 1968; Schuster, Clark, & Rogers, 
1971); longitudinally (Lawler, 1968). In 
addition, Porter and Lawler (1968), Graen 
(1969) and Evans (1970b) have further ex- 
tended the model to show the relationship 
with job satisfaction. The path-goal theory, 


provides a rather complete framework for 


understanding motivation in organizational 
settings. 

Given that the path-goal framework has 
some support as a model of individual moti- 
vation in organizations, how and when do 
organizational factors affect motivation? Re- 
cent literature has suggested that organiza- 


While Georgopoulos et al. (1957) found their 
hypothesized relationships were enhanced when the 
individual had freedom from barriers to vary pro- 
duction, research is needed to explore whether or not 
these barriers are implicitly taken into account by 
the individual when he reports his expectancies or 
instrumentalities. 
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tic tors may affect motivation through 
the ict that these factors have on in- 
st alities and Compensa- 
tio ‘romotion, and tion (Gal- 


br 1968), supervisory behavior (Evans, 
1 task complexity (House, 1971), and 
v up behavior (Graen, 1969) have all 
sorted to have an effect on these prob- 

In addition, interactions of some of 

‘actors with one another and with the 
iual’s own characteristics may also in- 
motivation. For example, House 

) suggested complex interactions be- 
supervisory behavior, task structure 

he individual’s tolerance for ambiguity. 

ans (1970a, 1970b) has used the path- 
approach to link leader behavior with 
loyee motivation and job satisfaction, and 
proposed that two conditions were neces- 

y for the supervisor to create a situation 
which subordinates perceived strong links 
ween high-performance paths and their 


‘bordinate (thus showing tion), and 
») that he made certain that the distribu- 


erformance 
linkage model (following the path-goal 
theory) further suggests that when the goals 
were important to the subordinate and he 
saw a strong link 


former. W 
former, and there was an actual link between 
performance and reward, then i 
attains his goals and experiences job satisfac- 
tion. The model implies that the independent 
variables of the leader behaviors only relate 
to the dependent variables, performance 
satisfaction, if (a) the independent variable 
(leader behaviors) relates to a set of inter- 
vening variables (path-goal links) and (5) 
the intervening variables (path-goal links) 
relate to the dependent variables (perform- 
ance and satisfaction). Research in two or- 
ganizations tested this model (Evans, 1970b) 
and provided some 
However, there were 
the two organizations studied, and Evans 
(1970a) suggested three factors might mod- 
erate the supervisory pehavior/subordinate 


a "oS Ff 


motivation relationship: the upward $ 5 
in the hierarchy exhibited by the supervi 
(Pelz, 1952); the role conflict ex n 
by the subordinate; and the extent to 
the subordinate was internal or external 
locus of control orientation (Rotter, 196 
or the generalized expectancies of the subordi= 
nate that reward and punishment were b 

on his own behavior rather than on ra 
occurrences. 

The present study was designed to 
these speculations as well as to replicate 
extend the earlier research (Evans, 19700 
The following hypotheses were made concert: 
ing the relationship between supervisor be: 
havior and motivation: Hypothesis II. 


paths. The greater the upward influence of 
the supervisor, the more likely that he coul 
offer and deliver many rewards (considera. 
tion) to the employees, thus influencing 

and increasing their motivatio 
Hypothesis 2—The greater the initiation ol 
structure, the higher the motivation is for 
high performance paths. The higher the ole 


wards contingent on performance (initiatin 
structure). The more internally controlled the 


activity 
vation is a moderator of the behavior /s 
faction relationship. 


METHOD 
Subjects 
Subjects were 86 young managers enrolled in 


first year of a part-time master’s of business 
the University | 


a variety of functional specialities. Most subj 
(57%) had been brought up in Southern On 
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TABLE 1 


CORRELATIONS BETWEEN INITIATION OF STRUCTURE, CONSIDERATION, UPWARD INFLUENCE, | 


Coxrucr, Roux Ammourry AND ÎNTZENAL-EXTERNAL ÖRIENTATION 


— * 2 — === 
item Í 1 2 
z i 
g 1, Initiation i 
` 2. Consideration 8 
AJ. Upward influence ae S0 
4. Role ambiguity* i 12 9 
5, Role conflicts J4 agere 
6. Internal-external® —.26°* —,24° 
* 105.70 105.84 
SD 14.88 18.39 


BE High scores — — 
Rie wore tm a more external orfemation. 


Ps oO. 
p= p. 
P= wi. 


fe of them in large urban centers (i.e, the To- 
o/Hamilton, Ontario, conurbation). 


ip Behavior. The subordinate’s percep- 
of leadership behavior were measured with 
on XII of the Leader Behavior Description 
tire (Stogdill, 1968). “Upward influence” 
as measured with the “superior orientation” sub- 
ie; “consideration” was measured by the sum of 
consideration, tolerance for freedom, and inte- 
ation subscales; initiation was measured by the 
of the structure, role assumption and produc- 
tion emphasis subscales. 
Role confiict. This was measured with the paper 
nd pencil questionnaire developed by Rizzo, House 
d Lirtzman (1970). 
> This was measured with Rotter’s 
) Internal-External Control scale, which has 
extensive arg validation (Rotter, 1966; 
onald, 1969). 
lob performance. Scli-rated measures of the fol- 
ME were obtained: effort, performance, fre- 
of making suggestions to the superior, fre- 
z : following the superior’s instructions, fre- 
ency o and frequency of 
helping fellow workers, requency 


_ Goal attainment. This was measured with a ver- 

sion of Porter's (1961) instrument, tapped the 
facets of self-actualization, self-esteem, autonomy, 
social, security, pay, supervision, promotion, and 
family. The family facet was included to try to 
explore a non-work-related goal that might conflict 
ena with the work-related goals. In addition, a 
‘Managerial version of the Job Descriptive Index 
(JDI) (Warr & Routledge, 1969) was used to mea- 


3 4 5 6 

17 

2 spm 
— .26°* —.24° —.25° a 
36.16 3435 39,36 36 
6.30 5.69 7.98 8 


sure satisfaction with firm, job, pay, promo 
portunities, superior, peers, and subordin 

Overall satisfaction. This was measured with the 
Brayfield-Rothe (1951) instrument. 

One major limitation of this study lies in the fact 


that the respondents were the source of all cata. It 
is possible that individuals may have been lenient in 
rating their behavior, which restricts the range of the 
dependent variables. f 


RESULTS 
Relationship Between Initiation of Structure 
and Consideration with Motivational and 
Performance Variables 


It was hypothesized (Hypotheses 1 and 2) 
that high consideration and high initiation is 
associated with high motivation. However, a 
problem emerged in testing these hypotheses. 
For this sample, perceived initiation and 
consideration were highly correlated (r = .54, 3 
p< .001); see Table 1; hence, the original 
intention of analyzing the data through a 
two-way analysis of variance (ANOVA) was 
inappropriate. Accordingly, the technique of 
multiple regression was employed—motiva- 
tion and performance measures‘ were the 
dependent yariables, while initiation, consid- _ 
eration, superior orientation, I-E control, role 
ambiguity, and role conflict were employed as 
predictors. The results are presented in Table 
VS 


Additionally, the product of instrumentalities 
and expectancies (EPI)—a measure of motivation 
unweighted by the importance of goals—was in- 
cluded (for each activity) as a dependent variable. 


* 


pendent variable 1 

i 1 
ch performance 
vation-high performance 

rain subordinates 


: 
i 
| 
vation train subordinates | 
clp follows | 

| 


vation help follows 
suggestions to boss 
vation suggestions to bess 
lo what boss says 
tivation do what boss says | 
tort 
lo what boss says f 
icip fellows | 
[i 
i 


go 


erformance 
Train subordinates 
Suggestions to boss 


| ERE bE eobeResees| 


cternals; third, in some cases 
vas higher when role conflict or ambiguity 
was higher. 

in order to test Hypotheses 1 (second 
part), 2 (second and third parts), the regres- 
sion was run after splitting the sample on the 
three control variables: superior's upward in- 
luence, internal-external control, and role 
conflict. The results for these are 
below in Table 3.7 


sures are not 
contain a component of 
formance work. 

© It should be noted that, contrary to the findings 
of Graen (1969) and Evans (1970a, 1970b), the 
correlations between the variables and 
motivation are slightly higher 
between the independent n 
expectancies for four out of five behaviors. 
The tables on which these comments are based 
can be obtained from the author. 
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(D, Urwane wewee (UN, Dseraaean Dia reanens, Cresa HD, 
Rose Coxruer (RC), Rote Ke e (RA) 


„ Variance accounted bor by nenn warkelden 


c 1 * Ke RA ul 
* ~64 
1 0 
2 20 -48 
mo SA] -40 
N. -i sJ 
mi ~F2 #0 
mS -29 — 11 
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lated (r = .59, 9 < 001), is inconsistent, 
major effect of controlling for 

ence is that the relationship with 
tion diminishes; what relations do remain are 


TABLE 4 


Variable 
Variable Motiva- | Perform- 
tion? EPI ance 
in = 10) | œ = 6 
No control ire 2 
High-upward influence 4 1 
Medium upward influence 2 1 
Low-upward influence 3 | 1 
| 
High-role conflict ee 
Medium role conflict | 80. 4 oe 
Low-tole conflict 5 1 
Internals 10 | 1 
Externals 1 | iy 


Note. EPI = product of instrumentalities and expectancies, 
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l TABLE 4 


AND PERFORMANCE, UDING PARTIALLING Morti- 
VATION OUT OF THE CONSIDERATION-PERFORM- 


CONSIDERATION, MOTIVATION 


ANCE RELATIONSHIPS 


1. Effort | — 

2. Consideration 34 

3. EPI high performance 43 54 

4. Motivation high performance | 44 "Soret 98 


E with (EPI AS 
: . / motivation 13 
~ 
Do what boss says 1 | 2 | 3 4 
| — U— 
Do what boss says a 
Consideration 20 
„ EPI do what boss says 245% 4% 
„ Motivation do what boss says 21“ AGF ORFF 
DBS,.C/EPI -22* 
DBS,C/motivation 24 
Help fellows 1 | 3 | 2 | 4 
Help fellows — 
2 Consideration .20* = 
El help fellows 49“ 32 — 
4. Motivation help fellows 21 S eee, — 
ECV EPI 12 
HF. C/ motivation 11 
Train subordinates 1 | 2 | 3 | 4 
Train subordinates A 
Consideration 3 — 
_ 3. EPI train subordinates 33˙ A553 — 
J. Motivation train subordinates | 37% Saxe girt  __ 
IS. EPI 18 
IS. C/ motivation 14 
Suggest to boss 1 | 2 | 3 | 4 
1. Suggest to boss — 
. Consideration 30% — 
. EPI suggest to boss 14 J — 
4. Motivation suggest to boss 4, SS ogee 
3 SB. EPI 27* 
2 SB. C/ motivation 27 


Note. EPI = product of instrumentalit 


8 H> 2 u ties and expectancies; 
k HF bel el = consideration; DBS = do cals Hostage: 
D fellows; TS = train subordinates; SB = suggest to 
*p <05. 
BF LON 
* < 001. 


distributed Over all three levels of upward 
influence. There is, therefore, little support 
for Hypothesis 1 (first part), 

Role conflict as a moderator. With the mo- 
tivation dependent Variables, a bifurcation 
takes place; note that consideration and lack 

of role conflict are positively related Gs 
48, p < .001). For high-role Conflict, the 
consideration/motivation relationships remain 
high; however, for low-role conflict about 
half of the consideration / motivation relation- 
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ships remain. For the behavior < sions, 
those with moderate conflict ha onger 
consideration/behavior relations, 7 idings 
for high conflict are contrary to thesis 
2 (first part). 

Internal external control as a erator. 
Here the results for the motivation ‘ata are 
much clearer. The relationships betwen con- 
sideration and motivation remain hiv’. for the 
internals; they become weak for . exter- 
nals; so, in addition to internals hayin» higher 
motivation than do externals, they have 
a stronger relationship between su: visory 
consideration and motivation than mals. 
There is little impact on the consi¢»ation/ 
performance relationship. Hypothes (sec- 
ond part) receives support. 

These results suggest that inter have 
higher motivation and that their mo ‘vation 
is more predictable from leader havior 
(consideration), that is, they per: and 
respond to environmental contingen: more 
consistently than do externals. The at of 
upward influences and role conflict is less 
clear-cut. However, when role conflict ‘s high, 
supervisory consideration and motivation re- 


lations remain strong, the consideration /effort 
relationship is enhanced. 

A further question can be raised concerning 
the leadership/motivation/behavior links: 
Can it be demonstrated that motivation oc- 
cupies an intervening position between lead- 
ership and behavior as suggested in Hypothe- 
sis 3? One way to test this is to partial out 
motivation from the leadership/behavior re- 
lationship. The results for such a test are 
presented in Table 4. For the behaviors of 
effort, helping fellows and training subordi- 
nates, it appears that the partials drop sub- 
stantially, suggesting that this linkage is ap- 
propriate; however, for the behaviors of do- 
ing what the boss tells or making suggestions 
to the boss, the consideration/behavior rela- 
tionship remains just as strong after motiva- 
tion has been partialled out. 

The data for testing Hypothesis 4 (the 
relations between motivation and behavior) 
are presented in Table 4. For all behaviors 
except giving suggestions to boss, the hy- 
pothesis is supported (at p < .05). However, 
correlations are generally weak and motiva- 


ely explains more than 10% of the 

e in behavior. 
lypothesis 5 it is suggested that when 
tion is higher, there is a stronger re- 
between behavior and satisfaction. In 
this hypothesis we noted an effect 
onfounded the contention that higher 
ation results in a stronger relation be- 
behavior and motivation. As motiva- 
nereases, both behavior and satisfaction 
ase concomitantly. The correlation data, 
n alone, indicate that there is little rela- 
hip between effort and satisfaction under 
jitions of both high (—.13) and low 
56) motivation, while the relation is 
ngest for the moderate motivation condi- 
(51, p < .01). However, the lack of re- 
onship in the low- and high-motivation 
ndition arises from different causes. An 
mination of the scatter diagrams indi- 
tes: (a) for low motivation there exists a 
istribution of effort and satisfaction over 
he whole of both scales; however, there is 
o relationship between the variables; (b) 
‘or high motivation the range of both scales 
restricted; the cases cluster at the top of 

oth. 
Discussion 


These results provide an extension of the 
path-goal theories. Wahba and House ( 1974) 
have noted the strong rationality assumptions 
underlying the path-goal model. Their cri- 
tique is given some indirect support in this 
study by noting the differences between in- 
ternal subjects and external subjects in the 
applicability of the model to their behavior. 
Internal subjects who perceive their environ- 
ment as meaningful and consistent and who 
feel able to control it are more likely to be 
rational decision makers in the sense proposed 
by economists and game theorists; while ex- 
ternal subjects who see the environment as 4 
cluster of random forces which they cannot 
control (even if it is benign) are more likely 
to make decisions tha 


t violate the rational 
assumptions of transiti 


vity, independence of 
irrelevant alternatives, and dominance of al- 
ternative strategies. The results indicate 
stronger support for the model among the 
internal subjects than among the external 


subjects; this finding is consistent with the 


a 
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model's basis in rationality assumptie 
Path-goal theorists, in trying to recreate 4 
“goal-directed man” model of behavior, have 
to meet the rationality assumptions require 
Their data have not yet done this. 
There is a further problem which path- 
theorists have not yet adequately dealt wi 
that is, the question of conflicting path-g 
instrumentalities. If an activity leads toward 
one cluster of goals and away from an equally” 
important second cluster then, by our com 
putations, the valence of that activity is neu 
tral—neither desired nor not desi n 
fact, the same as if the goals were unimpo 
tant or the activity led nowhere. However, 
Dalton (1947) has demonstrated that 
group who were neither rate-busters ‘ 
quota-restricters were conflicted in the val- 
ence that productivity had for them; P 
were ambivalent toward it—they had inter- 
mediate performance and, it appears, 
suffered from psychosomatic disorders. 
path-goal theory can not yet predict th 
kinds of outcomes. 
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in the black and the hard-core samples than in the white t 
class groups. Results partially supported Hypotheses 1 and 2; partially re- 
versed Hypothesis 3; and did not support Hypothesis 4. Data were interpreted 
as supporting the concept of white working-class alienation. 


ihe general instrumentality-expectancy 

del (Fishbein, 1967; Graen, 1969; Porter 

Lawler, 1968; Vroom, 1964) is generally 

plied to job performance and satisfaction, 

well as to the broader issue of the attitude- 
havior relationship. It can also be a fruit- 

i source of hypotheses about the job-seek- 
ing behavior of different race and social class 
groups. 

For example, Graen (1969) has shown 
that his model holds only in a “reciprocating 
mate“ where performance is regularly 
evaluated, good performance rewarded, and 
contingencies clearly communicated. The job- 
seeking environment of the hard-core unem- 
ployed, especially blacks, can be considered 
nonreciprocating because of a lack of job- 
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relevant skills and, in the case of b E 
racial discrimination. Thus, it can be pre 
dicted that the instrumentality model will n 
apply to these populations (for a more co 
plete discussion see Feldman, 1972b). E 
Furthermore, the hard core cannot attain 
many socially desirable goals. Under these 
conditions, Cohen (1955) predicted the de- 
velopment of norms contradictory to those o 
the larger society. The work-related norms of 


the study of job-seeking behavior. Fishbein 
(1967, chap. 51) has proposed and success- — 
fully demonstrated (Ajzen & Fishbein, 1970) 
that the overall attitude (defined in terms of 
pure affect or evaluation) toward the object 
(o) of an act (Ao) generally does not di- 
rectly predict a specific behavior toward that 
object. Rather, the attitude toward the spe- 
cific act (A act) s combined with normative 
beliefs relevant to the action, predicts 
individual’s behavioral intention with re 
to the particular act. These intentions medi- 
ate between the model and measures of o 

3 The evaluation of the act (A act) is defined 
the subjective expected utility of the act, a 
tion of the act’s expected consequences and thei 
evaluation, 


179 


consideration of Fishbein’s (1967) 
aad Green's (1969, p. 22) revieed 
ings to mind the question of whether 


“It should be noted that the definition of 
fet (A act) are based on Graen (1969), though the 
r arare Aon 
the Grace and 


f 
f 
i 


The second is the sum of products o! o 
of the same variables for a norm s to 
each behavior. These are called NP ral 
and NP specific, respectively, (c) ti. 


mate of A act (or “internal pressure he 
sum of products of the evaluation he 
intrinsic outcomes of each job-seei be. 
havior times the expectancy that the me 
is received is the estimate of A act The 
individual's directly measured evalu of 
working times the expectancy that a cu. 
lar behavior leads to a job is symbo by 


(VwPw). 
In regression equation form, this | +: 


BI = (Aube, 
+ (Ac) w. + (NP general) 
+ (NP specific)ws ＋ (A act)» 
+ (Vwi a 


The following hypotheses relat se 
variables to the environments of at 
socioeconomic groups. 


Hypotheses 


1. The hard core of both races pe ive 
weaker pressure from friends, family, an. self 
to get a job if unemployed than do the k- 
ing class. Perception of general norm 
tion is also weaker for the hard core. 

2. Both general and specific normative 
terms predict behavioral intentions 
2 in the hard core than in the work ng- 


3. Following Fishbein, neither AOp: not 
AOp independently predict behavioral inten- 
tions; A act (or “internal pressure”) inde- 
pendently predicts BI. 

4. Multiple correlations of the variables 
with BI in the black and the hard-core 
groups are lower than those in the white and 
the working-class samples. 


MetHop 
Subjects 
Subjects were 100 black and 100 white males, be- 
tween the ages of 18 and SO, living in the St. Louis, 


industrial firms (in the case of the working class). 
Core subjects were defined as those having 
Unemployed for at least three months prior to 
Contact and being defined by a social-service agency 


ig 
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TABLE 1 ality does not reside in race or soci s$, 
PERCEIVED NORMATIVE PRESSURE FROM Four Sources but in the experiences one has as a of 
To Loox ror Work race or class, such data are potentia ite 


Economic class 


„„ | Proportion 
my j p z | of 
18 Working | variance 
Norm | class 
source — E 
Race jiz 
“| I 
Black | White | Black | White 8 
r per Ts * TE AE 
Friends 242 | 2:27 | 2.50 ah | == 
Family 2.92 | 2.90 | 2.98 2 
People in 
general’ | 1.94 | 2.10 | 2.23 06 
Self 2.90 | 2.79 | 3.00 2 


Note. Ratings r tange from 1 Gow) to 3 (high). All means are 
to second decimal. Proportion of variance (univariate 

a) 53 to . omega-squared (Hays, 1963), 

— a effect, E = cconomic-class effect, I = interaction 


> 8 univariate interaction effect (p < .0003). 
»Slentfcant univariate Seni ston tp < 004), 


andis, 1972; for examples of practice items, see 
Feldman, 1972a). 

Subjects were interviewed in a standard order: 

evaluation, instrumentality, expectancy, norms, be- 
havioral intentions, employment history, and demo- 
graphic data, Within a topic, order of methods was 
counterbalanced (for details of the multitrait, multi- 
method validation of evaluation and expectancy 
measures, see Feldman, 1972a). Subjects who could 
not comprehend a task (after re-instruction) at any 
point were excused with a small payment. Less than 
9% of the black hard-core sample and less than 5% 
of the other samples had to be replaced due to 
inability to perform the tasks. No other subject 
attrition occurred. 
Interviewers used a neutral tone and manner de- 
veloped through role playing. To minimize biasing 
factors, interviewers were never told the specific 
hypotheses under investigation and were always 
of the subjects own race. In addition, 10% of the 
subjects were called at random and questioned about 
the interview to ensure the quality of the data. 


RESULTS 
Validation oj Sampling Strategy 


Analyses of a series of demographic and 
job-history questions revealed no unusual 
between-sample differences that might con- 
fuse interpretation of the data.? Since caus- 


The multivariate analysis of variance, which 
provides a simultaneous test of significance for all 
dependent variables, was used to analyze this data. 


useful. 


Tests of Hypotheses 


Hypothesis 1, that the hard cor: th 
races perceive weaker pressure to fin zif 
unemployed and have weaker norm ion 
perceptions as well, was partially st ed. 
A significant multivariate economic ef- 
fect (F = 2.53, p < .05) was found at- 
ings of perceived pressure to look í tk, 
As Table 1 shows, this was due to 4 ifi- 
cant univariate effect, in the predict: ec- 
tion, for pressure from one’s self. 

A significant multivariate interac! vas 
also present (F = 3.79, p < .006). vas 
due to a univariate interaction on r: of 
pressure from people in general. Th ick 
working class reported higher pressu om 
this source than any other group. 

Table 2 illustrates similar results rat- 
ings of norm direction. A significan lti- 


variate economic-class effect (F = 3.1 p< 
.02) was due to the hard-core samp! 
dency to make fewer “should do” reso ses 
for ratings of personal normative belie 
those of “people in general.” 

A significant multivariate interaction 2 
occurred (F = 6.10, p < .0002). Univari 
tests show that it was based on the 
hard core’s tendency to perceive v 
norm-direction from “people in general” than 
that of the white hard core, while the oppo- 
site was true in the working class. 

Hypothesis 2, that both general and spe- 
cific normative terms (NP) have lower cor- 


Univariate analyses of each dependent variable were 
also conducted, to more precisely identify the source 
of any observed overall differences. Univariate 
effects not associated with significant multivariate 
effects were not reported, to minimize the possibil- 
ity of a Type 1 error. Jones (in Cattell, 1966) pro- 
vided a discussion of this technique. 

Since race and socioeconomic class are correlated 
in the population, a purely random sampling design 
reflects this correlation. In order to meet the as- 
sumptions of analysis of variance, these variables 
must be made independent (unconfounded). The 
analyses of job-history and demographic variables 
show that this goal was, in fact, achieved (for a 
brief summary, see Feldman, 1973; full details are 
reported in Feldman, 1972a). 


with behavioral intentions in the 
samples, was not supported. As 
shows, the correlations of general 
fic NP with behavioral intentions 
er significant for the black hard 
were significant 7 of 10 possible 
the white hard core. Many behavior- 
sle-specific effects were also present. 
nesis 3, that attitude toward working 
es not predict behavioral intentions, 
iude toward the specific act (A act) 
dently predicts behavioral intentions, 
i supported. Table 4 presents the 
lized regression weights and multiple 
ion for each behavior in each sample, 
ever significantly predicted behavioral 
ns, while Ao, based either on the Job 
ption Index or population-elicited out- 
was significant 4 out of a possible 10 
for the black working class. Once again, 
sample- and behavior-specific effects 


d. 

ble 4 also illustrates a failure to support 
othesis 4, that the black and the hard- 

samples’ intentions are less predictable 
those of the white and the working 
s. The white hard-core and the black 
-orking-class samples were the most predicta- 
on the basis of the number of significant 
vultiple correlations. Two-way (Race X Eco- 
nomic Class) analyses of variance (Jones, 
1968) were performed on the multiple corre- 


TABLE 2 


PERCEIVED NORM DIRECTION FROM Four SOURCES 
TO Loox ror WORK 


Economic dass 
— — — 
Hard core | Working class | of variance 


SS 


Black White | Black KR EI 

Friends 2.17 2.79 aa fet 
. — 3.00 | 3.00 — 1 — 
'eople in — | 03 0 
= | o2|— 
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or better), 
E Significant univariate interaction effect (P < 0000. 
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TABLE 3 


CORELLADON or GENERAL AND Serumi Noam 
Peeves witt Five Benaviosa:t ene, 


Norm presare 
Behavior — b 
Sample ! 
Agency | uC | -0 
awe 17 
| WHC | eg 
Í wwe N 
Applications l RHC -H 
BWC 17 
wie arn 
wwe As 
Training BHC m 
BWC | 9 
WHC | a 
| wwe | sr 
Ads | BHC =A 
| BWC as 
WHC 1 
j wwe 2 
Friends | BHC | w : 
BWC | —.06 | 0 a 
unc e 
wwe | —.19 1 19 
Note. All rounded to — 
bard core = Men woking chas, WHC = white 
hard core; WWE = white working clase. For BHC, « = 4 for 
BWC, w = S2; for WHC, # = S1; for WWC, e = 48. 
p< 0S. 
p< 02 
p< 01. 
s.p < 0 


lations for each behavior. Only a very mar- 
ginal (x? = 2.95, C 10) interaction on in- 
tention to go to an employment agency and 
a somewhat stronger (x= 4.44, P< 05) 
interaction on intention to read want ads 
were found. Neither was in the predicted 
direction. 
Discussion 


Limitations on Internal and External 
Validity 

The results were, for the most part, not in 
the predicted direction. It should be noted 
that response-bias effects are probably not 

nsible. Means and standard deviations 

for all variables in all samples were syste- 
matically compared, and no consistent pat- 
tern of extremely high or low means Co! i 
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TABLE 4 
Premcroon or Five BEHAVIORAL INTENTIONS IN Four SAMPLES BY THE INSTRUMEN ty Mop. 
Standardized regression weights 
Behavior — — 
Sample Aozpi A= Nye NP. ne A act VwPw R 
Agency BHC | —.25 05 =13 04 | -03 | —.06 29 
è BWC 13 25 ‘08 30 —09 10 38 
WHC OF 22 22 48% 02 | ~.14 | ohe 
| Wwe 0 01 30 05 | —.16 0s | 32 
i 1 
Applications | BHC 16 10 —.08 | —.09 31 4 at 
e s 585 19 03 07 | 48 
WHC) Ob aaO 7 20 01 29 | {Seem 
P WWCS) Sato fo “ok 00 32 00 67 40 
Training | BHC | =.12 | —.06 S 17 12 2s 
BWC | -0 41 44 .02 —.02 00 Sone 
| WHC | -25 AA Dae 07 | —.02 ji 32 
WWG | -—0l | —12 Zi 06 8 12 33 
Ads | BHC | oo | —0s ~.29 27 20 12 38 
| BWC | 325 25 AS 13 02 61ee* 
| WHG | -07 | 20 4 19 05 ror ber 
| wwe | —.23 | —.09 33" — 03 —19 15 | 46 
Friends | BHC 05 01 O1 .21 3 22 E 
| BWC 08 08 —10 30 1 
| WHC (ae (Ser 32* 09 o | i | .50* 
| WWC | —08 | —0 —.15 .08 —.06 — 45 


5 Note. All figures rounded to third decimal, Ao = attitud 
‘Bormative 


of interest; Vw! = the individual's directly measured evaluation 
vior leads to a job. For BHC, n = 48; for BWC, „ = 52; for WHC x = 
*p <05. 
<o 
“ep <.01 
sss. > <.001 


with low standard deviations was found. 

Thus, response bias in one sample or the 

Other seems unlikely. It is simpler to assume 

that the observed differences are valid than 

to propose response biases which operate in 
different directions for different variables, and 
sometimes not at all. 

Interviewer bias or demand-characteristic 
effects were also possible. However, since 
interviewers were never told the exact hy- 
potheses, and since some involve combina- 
tions of variables and the prediction of cor- 
relations rather than mean differences, it 
seems unlikely that the obtained results 

were due to bias. Furthermore, the interview 
was highly standardized, more like an oral 
questionnaire; interviewers were trained and 
tested in the use of neutral tones of voice 
and expression; they were also limited in the 
_ interpretations they could provide. The im- 


eure: Agon = outcomes from the Job Descriptive Index; A 


oward the act Ne = 
Ao = outcomes elicited from members of the popr 
of working times the expectancy that a particular 
52; for WWC, » = 48, 


le toward the object; A act = attitude 


portance of unbiased data was stressed re- 
peatedly, and informal observation and dis- 
cussion indicated strong motivation to per- 
form properly. The above precautions do not 
rule out biased data, but such a possibility is 
less likely than it would be otherwise. 7 


Race and Economic-Class Differences 


It seems true that the hard core of both 
races feel less normative pressure to look for 
work, at least in terms of their “own” norm. È 
The black working class appears to feel yi 
normative pressure from self and society 
most strongly. The differences were not large, 
but this may have been due to a lack of 
response options. 

However, only in the black hard-core sam- 
ple was there no relationship between norms 
and behavioral intentions. This finding con- 
trasts sharply with the white hard core to 


{ 


Š 


norms seem important. It is tempting 

lude that ideas of what one “should 

not strong determinants of the black 

re's behavior, but such a conclusion 

ature. Besides the inability to infer 

from correlational data and the 

r cross-validation in any case, it may 

t the norm sources used were not 

it to the black hard core. The hypothe- 

reasonable, however, and deserves ex- 
ental investigation. 

e failure to support Hypothesis 3 may 

ue important group differences. Fishbein 

>) has stated that attitude toward the 

nd /or normative beliefs, but not attitude 

d the object, should independently pre- 

behavioral intentions. This is usually 

ound to hold true. Job seeking, however, 

be a special case, since the loss of a job 

be a personal disaster. Any behavior 

ich might lead to a job, whatever the prob- 

lity, may be performed. Thus, the more 

ie values work, the more one should look for 

job, regardless of the evaluation of the 

chaviors involved. 

This attitude toward work and job seeking 

cms to be the case in the black working 

class. Why it is not in either hard-core sam- 

le is suggested by correlational evidence 

reported more fully in Feldman, 1972b). In 

the combined hard-core sample, evaluation of 

work was significantly negatively correlated 

with the skill level of the last job held and 

whether nonpublic aid income was received. 

Two behavioral intentions were also nega- 

tively correlated with the receipt of non- 

public aid money.“ The black working class 

may regard work as a source of outcomes 


beside money, and the hard-core samples may 
source of income 


predicted behavioral 
hard core (with one exception), while the 

black hard core were not predictable at all. 
The white working class, 
were also unpredictable, may 
ferent orientation. If they 
as a way to support themselves, 
8 The correlations of these and other job-history 
with evaluation and in- 


and demographic variables 
tentions measures are reported fully in Feldman 


(1972a, 197 2b). 
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1 
Bleck White | Bisi | Wie R |N 
Leck for works 4% ae eee ee i 
Go vo an emmo) i 12 
mem agwncys | 2.77 | L99 | 204 za |10 
Paon maay | | i | 
konee aa an| am] zala 
R wast ad | 18 | KU sa Ate = 
Try te a l 1 
y p 1] i i i 
11 neee 
Hang f | j | 
with frjemde 140 | 3.08 w | 8 |= 
Note, Mult) „i once .. Aac 


ivariste effects: 
cia (F = 5.25, p < 001), interaction d n 
are toundrd to | 


ance ( te 
(Hay, 1905). R = raoe K = moman i 
1 = im 

+ Univariate race effect (p < 08 or better), q 
* Univariate ip < DS ot beret), 

+ Univariate imetaction Beet (p < 02). 1 


other money-making skills, a regular job is 
vital. Thus, evaluation of work does not 
relate to intentions to seek it, because they 
perceive no Feldman (1973) has 
shown that black working-class men believe 
they can get along well if unemployed, but 
white working-class men do not. Therefore, 
the blacks’ evaluation of work should corre- 
late with job seeking, since they perceive a 


choice. 

The fact that not all behaviors were pre 
dictable also calls for some comment. It may 
have been that some behaviors had very high 
or low-base rates in some samples and thes 
reduced the obtained correlations. This pos 
sibility is partially supported by Table 
which shows that the only unpredictable be 
havior for the white hard core has an e 
tremely low intended frequency. It does ne 
completely explain the failure to predict thre 
of five behaviors in the black working clas 
since “want ads” has a fairly high intende 
frequency. Also interesting is the finding th: 
training programs appear much more saliel 
for black than for white subjects. 

The fact that the attitudinal terms pr 
dicted behavioral intentions for the bla 
working class and normative for the whi 
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hard core, while the white working class and 
the black hard core were not predictable 
at all, suggests that recent social and eco- 
nomic reforms have not yet reached the black 
hard core and have negatively influenced the 
White working class. Changing these percep- 
_ tions and attitudes is not a psychological, but 
_ a social problem. They will change when op- 
_ portunities for rewarding employment change, 
and not before. 
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were collected for 12 consecutive 


arch on goal-setting theory (Locke, 
has been criticized for its lack of em- 
on investigating employee behavior in 
ural work environment (Heneman & 
ib, 1972). Data supporting the theory 
ased largely on laboratory experiments 
hich the dependent variable is typically 
sk requiring simple addition. As Camp- 
Dunnette, Lawler, and Weick (1970) 
» stated, the differences between college 
lents solving addition problems and the 
‘avior of workers in industrial settings 
st be considered. 
‘titicism on the theory’s lack of external 
dity is not completely justified. Latham 
d Ronan (1970) factor analyzed data ob- 
‘ned from a random sample of 292 pulp- 
wood producers. The results showed that goal 
citing and supervision loaded on the same 
factor as two measures of production (posi- 
tive) and one measure of injuries (nega- 
tive). Goal setting without supervision loaded 
on the only factor containing measures of 
turnover (positive). Supervision that did not 
include goal setting loaded on a third factor 
that did not include any criterion variables. 


This study is based in part on a technical report 
written by the authors for the American Pulpwood 
Association, Harvesting Research 
opinions expressed herein are those 


Kraft Company. 

The authors are grateful to Edwin A. Locke and 
Gary A. Yukl for reading and commenting on an 
earlier draft of this manuscript. 

2 Requests for reprints should be sent to Gary P. 
Latham, who is now at the nt of Human 
Resource Planning and Research, Weyerhaeuser 


Company, Tacoma, Washington 98401. 
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TRAINING IN GOAL SETTING* 
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of goslactting theory 10 
operations were matched 


randomly assigned to either a one-day training program in goal setting or a 
control group. Measures of production, 


alwenteeism, and injuries 


zoal setting can lead to an increase in production and a decrease in absentertsm. 


analysis of variance on the man-hour produc- — 
tion. The results indicated that producers 
who supervise their employees and set pro- 
duction goals have higher productivity than 

who supervise their men but do 
not set production goals. In summarizing the 
results of these two studies, Ronan, Latham, 
and Kinne (1973) interpreted their findings 
as supporting Locke's (1966) contention that 
supervision leads to superior performance 
only insofar as it results in the establishment 
of specific performance goals. However, 
supervision, as defined by staying on the job 
with the crew, is required to insure that the 
worker accepts these goals. Goal setting with- 
out the presence of supervision may lead to 
an increase in labor turnover. 

A limitation of the above studies is chat 
both were correlational in nature. In no in- 
stance was the independent variable manipu- 
lated by the authors. The purpose of the 
present study was to determine the effects of 
a one-day training program on goal setting 
on the job performance of pulpwood workers. 
The primary criteria for measuring the ef- 
fectiveness of the program were cords- per- 
sawhand-hour and cords-per-crew-hour pro- 
duction. Additional criteria included 1 
sures of turnover, absenteeism, and injuries. 
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Muro 
Sample 


Twenty-six producers and their crews were se- 
fected and matched on the following variables: (a) 
crew size, (Ò) cords-per-man-hour production, (c) 
| geographic location, (d) logging system, (e) delivery 
point, (/) sawhand function, (g) delivery system, 
(a) number of hours the producer spent on the job 
site with the crew, and (i) “did not previously set 
a daily, weekly, or monthly production goal.” One 
cem from each pair was randomly assigned to the 
the remaining subjects constituted a 

The method of payment for both 
was their normal piece-rate system. No 
_ monetary or nonmonetary bonus other than verbal 


Procedure 
Seven company foresters who were aware of the 
aims and objectives of the producers job, who 
_ frequently observed the producer on the job site 
and who were well-known by the producers, were 
en instruction in training. A three-hour program 
conducted in which the purpose of the experi- 
went and the assumptions underlying it were ex- 


_ plained. This program was followed by an intensive 


in a research program, that the 
would last three months, that they were under 
ligation to enter the program, and that they 

leave the program at any time. The essential 
difference in the instructions to the two groups was 
mat the experimental subjects were told that the 
3 of the study was to give them a one-day 


equations that had been developed from actual field 
measurements to predict cords per man-hour and 
z number of trees cut per sawhand-hour, production 
5 tables were constructed to systematically establish 


ahs 
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production goals.? These tables took ir unt 
factors that could affect production du 
ences in harvesting stand conditions 

On the first working day of each wee! n 
visited the experimental and control ; ‘ 
the job site. The average diameter b e 
based on 30 to 40 trees to the neare 
determined for the harvesting area. B 
table which represented the sawhand 
stand condition, a production goal in 
hand hour was determined for the 
and/or week. This goal was explained 
ducers in the experimental group w 
explained it to their sawyers. This goa! 
as a minimum standard of acceptable 
however, no penalties were provided 
attain the goal. The production goa! 
known to producers in the control gn 

Tally meters were provided for th: 
the experimental group so that they 
stantly monitor or evaluate their perio 
assumption at this point was that e: 
accumulated on the tally meter provid 
tion feedback to the employee and thus 
ing for the individual. The possible 
effects of goal feedback with goal settin: 
sidered minimal, since previous research 
that knowledge of score alone does not lea 
performance unless it is used to set go 
Cartledge, & Knerr, 1970; Locke, Car ‘se, & 
Koeppel, 1968). 

Producers in both the experimental anc 
groups were given a form similar to that 
Latham (1971) to record production, tur: 
senteeism, and injuries. The only differen- 
the form given to the experimental group 
given to the control group was that th 
contained an item requesting information concerns 
ing the production goal. The trainer had à ques- 
tionnaire on which the production goal íc 
control group was recorded. These forms s § 
same purpose as the tally meter for the awer, 
namely, knowledge of results to the producer. 

Data were collected for 14 consecutive weeks. The 
three-month study period was employed in order 
to determine the effects of the one-day training 
program over a considerable length of time. The 
results of only 12 weeks were retained, inasmuch as 
mill shutdowns occurred in 2 of these weeks, and 
hence the majority of the sample were unable to 
deliver timber. 


RESULTS E 
The results of this study were based on SIX 
performance scores. The two primary Ctl 


These equations were developed by Georgia — 
Kraft Company prior to this study. Time studies 
were made of more than 300 logging operations On 
varying harvesting conditions. The resulting produc- a 
tion goal may be characterized as moderate in difi- 
culty in that average to above average production 
is necessary in order to achieve the goal. Informa- 
tion concerning these tables can be obtained from 
the second author. J 
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ter rds per sawhand-hour and cords per 
cr r, were based on the arithmetic dif- 


5 turnover, absenteeism, and injuries, 
1 r was defined by two measures, 
n the number of men who quit and 
n of men who were fired. 
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on the production of the individual 
sawhands. The difference in production 

tween the group of sawyers who received 
ining in goal setting and the control group 
was in the expected direction but the differ- 
ence was only marginally significant (F= 
3.43, df= 1/9, p < .10). The performance 
means and standard deviations of the indi- 


TABLE 1 
SUMMARY OF DIFFERENCES BETWEEN GOAL AND ACTUAL 


PERFORMANCE FOR SAWHAND IN 
Corps PER SAWHAND-HOUR 
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his equipment., The results showed that the 

ormance of the experimental crews was 
significantly higher than that of the control 
group (F = 7.44, dj= 1/9, p< .05), thus” 
supporting the assumption that the sawyer 
has a key position in the producer’s work 
force. Again, the effects of weeks and the 
Weeks X Goal Setting interaction were not 
significant. Thus, it may be concluded that 
goal setting leads to a significant increase in 
the productivity of the group as well as 
individual. Table 2 shows the means 
standard deviations for each crew. 

The finding that the difference betwee 
the experimental and control crews was at 
higher level of significance than that bet 
experimental and control sawyers is no 
worthy, This result underscores the im-“ 
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portance of a well-balanced harvesting sys- 
tem in terms of manpower and equipment. 
Any improvement in performance in one func- 
tion in harvesting, even if only slight, can 
have an appreciable effect on the total per- 
formance of the operation when its effects are 


distributed (and hence magnified) over a well- 


balanced operation. Conversely, if perform- 
ance in one function is greatly improved, its 
effects on the entire system are nullified to 
the extent that the operation is unbalanced. 

Two-way analyses of variance (tables are 


not shown) were conducted to determine the 


effects of a month (four weeks) and goal 


" setting on turnover, absenteeism, and injuries, 
No significant differences were found between 


months. Moreover, goal setting had no sig- 
nificant effect on the two measures of turn- 


over or injuries. The number of people who 
quit or were fired or injured was very low in 
both groups. Absenteeism, however, was sig- 


nificantly higher (p<.05) in the control 
group than in the experimental group. 


Discussion 
The present study has shown that training 


in goal setting can lead to an increase in pro- 


duction and a decrease in absenteeism, The 


theoretical advantage of approaching the 


problem of worker motivation through goal 
setting is that it is not dependent on the use 
of mythological terms such as id, ego, and 


Superego; nor does goal setting postulate per- 
sonality mechanisms, or drive states, or sepa- 


Tate and distinct factors independent of each 
other that contribute differentially to short- 


term and long-term performance. However, 


goal Setting, as a theory of motivation, does 
focus on variables intrinsic to the job, namely, 
the job task(s) itself. It is based on a 


learning model in that the worker must be 


taught to set a task objective and he should 
be given information feedback concerning his 
performance, Knowledge of results provides 
meaning to a task. It is probable that tree 
counting in the present study enabled the 


_ Sawyer to obtain a sense of achievement, The 
number of trees cut assumed meaning in that 


the worker was able to determine the extent 
to which his performance was above average, 
It may have been this “meaningful” dimen- 
sion that contributed to the lower absenteeism 
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in the goal-training group, that is set- 
ting led to effective performance ive 
performance led to job satisfactio job 
satisfaction led to a reduction in a ism. 

The use of the production table sys- 
tematic method for setting product vals 
is of theoretical importance. Wor! vels 


of aspiration has shown that if ndi- 


vidual sets a performance goal be arry- 
ing out a task, he tends to raise il if 
he is successful, since he increases pec- 
tations of himself (Lewin, 1951). , to 
improve one’s performance one mus st as- 
pire but to aspire one must see th cess 
is possible, that is, clear evidence t be 
available that others under similar í ions 
are succeeding. The production ta! vere 
based on data collected from a larg: iber 
of sawyers and hence provided this nce. 
The sawyer knows that the goal is a able 


and that it is not based on the w! 
demanding supervisor or on the ur 
expectations that he himself has. 

While there is an obvious need fi 
tional research on the validity of goa! 
theory, the situation is by no means 
ous as previous researchers have lec 
believe. As a theory of motivation, ¢ 
ting appears to be effective in an ind 
setting, and its effects appear to hold over 
time. A possible limitation of training in goal 
setting is that it is too simplistic for 2 com- 
plex situation, that is, it leads to a significant 
improvement in the performance of the aver- 
age or above average worker, but it does lit- 
tle for the worker who is badly in need of a 
comprehensive training program. Neverthe- 
less, even in such instances the supervisor 
who sets a specific task goal for his subordi- 
nates is in effect making it clear what it is 
they are supposed to do (Campbell et al., 
1970). Thus, in the process of discussing with 
the worker the nature of these goals, the 
worker may acquire specific knowledge con- 
cerning his job tasks, their priorities, and the 
most effective methods that lead to their 
attainment. 
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ASPECTS OF DRIVING PERFORMANCE AS A 
FUNCTION OF FIELD DEPENDENCE 


PAUL L. OLSON + 3 


General Motors Research Laboratories, Warren, Michi ga- 3 


Three studies were conducted to investigate the relationship between field 
dependence and performance in selected driving situations. The first study 
compared the field dependent and field independent groups in terms of control 


in a skidding situation, Overall performance levels were not st 


different although the field independent subjects showed a performance 
improvement after the first trial and the field dependent subjects did not. 
The second and third studies compared performance in a platoon car-following 


situation. Results indicated that field dependent subjects may not use 
tion from vehicles ahead of the lead vehicle as well as do field inc 


subjects. 


This study reports the results of three 
studies relating field dependence (e.g., Wit- 
kin, Dyk, Faterson, Goodenough, & Karp, 
1962) to selected measures of driving per- 
formance. Although the literature concern- 
ing field dependence is substantial, relatively 
few references relate to the driving situation. 
A comprehensive program of research in 
the application of field dependence to driver 
capability was proposed by Michael (1968), 
but no experimental work was reported. Bar- 
rett and Thornton (1968) discuss the results 
of a study in which subjects were exposed 
to an emergency situation (pedestrian step- 
ping in front of a car) in a driving simula- 
tor. They found a measure of field depen- 
dence, rod-and-frame test, significantly corre- 
3 lated with reaction time, deceleration rate, 
_ hit-miss criterion and incidence of simulator 
= sickness. Field independent subjects per- 


The author wishes to express his gratitude to a 
number of individuals who assisted at one phase or 
another of the research reported in this paper. D. N. 
Goodenough and P. K. Oltman were most helpful 
at all phases of the project, supplying ideas and 
making suggestions. R. Herman and R. W. Rothery 
assisted in the design of the car-following studies 
and were also quite helpful in other ways. K. Migda 
assembled and maintained the instrumentation 
throughout and also assisted in collecting data. 
Sincere appreciation is also due the two reviewers 
who made significant contributions to the revision 
of this paper. 

Requests for reprints should be sent to Paul Olson, 
who is now at the Highway Safety Research Insti- 
tute, Human Factors Department, University of 
Michigan, Huron Parkway at Baxter Road, Ann 
Arbor, Michigan 48105, 
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tically 


forma 
lependent 


formed better in the tests but were more 
prone to simulator sickness. A later study 
(Barrett, Thornton, & Cabe, 1969) investi- 
gated an alternative measure of field de- 
pendence, the Embedded Figures Test, as 
well as the rod-and-frame test, using the 
same driving task and replicating their find- — 
ings with respect to prediction of emergency ~ 
behavior. i 

Thornton, Barrett, and Davis (1968) found 
that field independent subjects performed 
better in identifying targets in aerial photo- 
graphs. Similarly, Moses (1970) found that 
his feld independent subjects performed bet- 
ter in a complex vigilance task and Ton 
(1972) found that field independent subjects 
responded more quickly in the detection of 
motion in depth. These results suggest that 
field independent subjects may also perform 
better in identifying pertinent stimuli in a 
complex street scene. : 

The only published study dealing with field 
dependence and actual accident involvement 
is that reported by Harano (1969), who 
found that field dependent drivers associated 
with a higher frequency of traffic mishaps — 
than field independent drivers. 

The work reported in this study examined ` 
the relationship of field dependence and i 
measures of driving performance, specifically, 
skid control and car following capability. 


Subjects 


This study was conducted in three phases. 
For the first two phases a sample was con- 


structe screening 93 employees of the 
Gener tors (GM) Proving Ground in 
Milfor higan and by using the portable 
rod-an re test described by Oltman 
(1968 jects scoring at the extremes were 
compa in terms of personal data in an 
effort nd 10 each field independent sub- 
jects ield dependent subjects that were 


matc! s well as possible in terms of age, 


Sex, iriving experience. Rod-and- frame 
test for the 10 field dependent sub- 
jects d from 50 to 198; those scores for 
the d independent subjects ranged from 
4 to 

F ase 3 another sample was generated 
that drawn from a group of 72 employ- 
ees se General Motors Research Labora- 
toric’ Similar matching procedures were em- 
plo: Scores for the 10 field dependent sub- 
jec this sample ranged from 75 to 179; 
tho ores for the 10 field independent sub- 
jecis ranged from 3 to 23. 


PHASE 1: SKID CONTROL 


Control of a car on a slippery surface is one 
of the most challenging situations the average 
driver faces, requiring precise control inputs 
and exact timing. It seemed promising to de- 
termine whether performance differences in a 
skid control task might be related to field 
dependence. 


Method 


A straight lane, 7 feet wide and 400 feet long was 
marked out with traffic cones set at 20-foot intervals 
on a special low-friction surface at General Motors’ 
Milford Proving Ground. The front brakes on the 
test vehicle were disabled, making it prone to spin 
out. Subjects were instructed to approach the lane 
at 30 mph and begin braking as they came abreast 
of the first pair of cones, trying to stop in the 
shortest distance possible while keeping the car 
within the lane of cones. All 20 of the proving 
ground subjects described earlier participated in 
this study. Each made 10 runs through the course. 
No practice trials were allowed. 


Results 


Analysis of variance for mean stopping 
distance indicated that only the interaction 
between subject classifications and trial was 
significant (F = 2.22, df =9, p < 05). An 
analysis of simple effects revealed significant 
variance (F = 2.61, df = 9, p < 01) among 


trials for only the field independent group. A 
Newman-Keuls test on these data ` 


poorer than all other trials (p< .01). 
analysis of variance conducted on the co 
data revealed no significant differences. 


Discussion 


Differences between the groups of subj 
in performance on this task of skid control 
could not be established. It is interesting to 
note the superior ability of the field indes“ 
pendent subjects to improve their perform- 
ance after a brief exposure to such a task. 
This finding is consistent with the results of 
a number of studies (see Witkin et al., 1962) 
that indicate a greater capability on the part 
of field independent individuals to alter their | 
behavior in the face of changed conditions, 


PHASE 2: Car FOLLOWING 


Since many drivers log a significant per- 
centage of their miles in congested traffic, 
where they are close enough to a lead car to 
be immediately influenced by what it does, 
and many drivers maintain headways of less 
than a second on fast-moving freeways, it 
would seem essential that they be well in- 


— 


formed about what is happening ahead of the 
car directly in front of them. It has been 
suggested by two sources (D. R. Goodenough, 
personal communication, May 1968; Michael, 
1968) that the feld dependent driver could 
have his consciousness so dominated by the : 
F 
: 


Phases 2 and 3. 
Method 


Subjects drove the last car of a three-car pla- 
toon, Each subject drove under two conditions: 
(a) open—allowing the normal visibility of both 
lead cars, and (6) closed—making an effort to 
screen the view of the lead car by blocking the rear 
window of the middle car and by shifting the 
lateral position of both lead cars to make it more 
difficult for the subject to see around the middle s 


car. 

All 20 of the Proving Ground subjects described 
earlier participated in this study. All took the condi- 
tions in the same order in preference to the usual 
balanced design, since it was felt probable that 
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CLOSED 


10 


2.0 2.5 SECONDS 


DIFFERENCE 


0.5 0 
DECREASED R.T. 


0.5 1.0 SECONDS 
INCREASED R.T. 


Fictre I. Reaction times for individual subjects to onset of accelerations 
by the lead car. 


Subjects who took the closed condition first became 
‘more attentive than usual in the open condition 
because the purpose of the study had become ap- 
parent. 

The following dependent variables were employed: 

Reaction time. The driver of the lead car went 
through a series of programmed mild accelerations 
and brake applications in a speed range from 20 to 
60 mph, and the subject’s reaction time to the 
onset of these mancuvers was measured. It was 
expected that the more a driver used information 
gained from cars ahead of the lead vehicle, the 
more his reaction time should be affected by deny- 
ing him that information. 

Intervehicular separation. The subject’s car was 
connected to the middle car by a “yo-yo” device 
which provided a continuous analog record of spac- 
ing. Reaction time is affected by intervebicular 
separation and if the two groups differed in this 
respect, it could have biased the results. 

The study was run on a straight road facility 2.6 
miles long. Subjects made one complete lap, 5.2 
miles, under each condition (open—closed). During 
each lap they experienced eight accelerations and 
seven brake applications. The same driving schedule 
was followed on each lap. 

The subjects were read the following instructions: 


Imagine yourself driving home on a crowded 
urban freeway. The density of traffic is such 
that you cannot pass or change lanes and you 
can only follow along behind the vehicle in 


front of you. In this test we would like you 
to drive as you think you would under such 
circumstances. Please bear in mind that in a 
real situation such as I have described, if you 
tried to make things easy for yourself by drop- 
ping far back, someone would surely cut in 
front of you. So maintain a reasonable following 
distance. 


Subjects were also informed that their car was 
connected to the vehicle in front by a fine wire 
that measured the distance from their car to the 
car in front. Questions were then answered and the 
test began. 


Results and Discussion 


An examination of the data made it clear 
that despite the precautions taken, about 
three quarters of the subjects were able to 
see the lead car at least some of the time 
during the closed condition. As a result 
there was no difference in reaction time to 
brake applications between the groups of 
subjects. However, reaction time to accelera- 
tions did show a significant difference. These 
data are plotted in Figure 1. 

The top part of Figure 1 shows the mean 
reaction time for each subject under the open 
condition. The second line shows the mean 
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OPEN 


CLOSED 


DECREASED R.T. 


Ficure 2. Reaction times for individual subjec 
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4 6 
INCREASED R.T. SECONDS 


ts to brake applications by the 


lead car. 


reaction times associated with the closed con- 
dition. Note that the field dependent sub- 
jects occupy much the same position on the 
scale as they did formerly, while the field 
independent subjects have moved up scale 
(longer reaction times). The bottom line 
shows a plot of each subject’s reaction time 
under the closed condition after having sub- 
tracted his times under the open condition. 
Note that half of the field dependent sub- 
jects showed increased reaction time under 
the closed condition, half a decreased reac- 
tion time. On the other hand, all of the field 
independent subjects showed increased reac- 
tion times under the closed condition. The 
Separation between the groups Was signifi- 
cant (p < .01) as determined by the Mann- 
Whitney U test. 

There was a significant difference in mean 
headways (p < 01) between groups as de- 
termined by the Mann-Whitney U test, with 
the field dependent subiects tending to fol- 
low more closely than did the field indepen- 
dent subjects. 


Prase 3: CAR FOLLOWING REPLICATION 


Because the Phase 2 study permitted in- 
complete obscuring of the lead car, a replica- 
tion was conducted to determine whether 
undesirable artifacts may have resulted. 


Method 


This study was conducted on the General Motors 
Technical Center site, using five each field inde- 
pendent and field dependent subjects drawn from 
the Research Laboratory pool described earlier. The 
procedure was the same as for Phase 2 with four 
exceptions: (a) greater care was taken in the 
lateral positioning of the vehicles to more effectively 
screen the lead car; (b) the outboard pair of the 
four lamps on the rear of the lead car was disabled 
during the closed trials; (c) a four-lane divided 
facility with .7 mile straightaways was used and the 
subjects made two complete laps of the track under 
each condition, always starting in the open condi- 
tion, next going to the closed condition and finishing 
with the open condition again; and (d) instruc- 
tions were changed to the extent that the last two 
sentences (“Please bear in mind. *) were de- 
leted. 

As in the first study, Phase 1, all measures were 
taken on the straight portion of the track. There 
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were nine acceleration trials and eight braking trials 
per condition. The dependent variables were the 
same as in Phase 2. 


Results and Discussion 


3 
$ 
: In a preliminary analysis no significant 
3 difference (p > .05) was found in mean reac- 
= tion time to acceleration or braking under 
the two open conditions. The subsequent 
analysis dealt only with the first open condi- 
tion contrasted with the closed condition. 
© The acceleration reaction time data from 
this phase compared favorably with those 
from Phase 2, in that the field independent 
subjects showed more of a reaction to the 
loss of lead information than did the field 
dependent subjects. However, the difference 
© was not significant. 
r Figure 2 shows the data for response to 
braking. The first line shows the response 
under open conditions. Removal of the lead 
information was associated with a lengthening 
of the reaction time for both groups, but es- 
pecially for the field independent subjects, 
© such that all subjects were clustered at a 
_ relatively narrow range on the scale. Sub- 
_ tracting one condition from the other reveals 
that most subjects suffered an increase in 
_feaction time under the closed condition, but 
the effect on the field independent subjects 
was sufficiently pronounced in that there was 
a total separation between the groups. 
= There was no difference in mean headways 
between groups in Phase 3. 


Discussion 


The results of the preceding two studies, 
Phases 2 and 3, provide some evidence that 
individuals classified as field dependent are 
less capable of or less willing to avail them- 
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selves of lead information in a | 


n car- 
following situation. Unless the fic! endent 
driver is able to compensate for Some 
way, he may be significantly han ped in 
this common driving circumst« These 
results lead to the expectation ti eld de- 
pendent drivers are over involved ear-end | 
collisions. It appears that a rela large- | 
scale investigation of actual colli nvolve- 7 
ment by different perceptual ty; uld be 
rewarding. l 
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priate for the new situation. 


Behavioral Expectation Scale (BES) 

ias been praised as a significant im- 
ent over other rating scale formats 
to measure job performance (e.g. 
oll, Dunnette, Lawler, & Weick, 1970; 
ie, 1966). Although procedures have 
somewhat, Smith and Kendall’s (1963) 
| suggestions for developing behavioral 
ation scales are generally followed 
ely. Briefly, those procedures are as fol- 
(a) A sample of raters contributes be- 
havioral examples representing low, average, 
and high performance on the job in question. 
The investigator attempts to exhaust the job- 
performance domain by sampling the content 
of the job as completely as possible in terms 
of these concrete observables. (0) The behav- 
ioral examples are clustered by content, and 
dimensions of performance are named and 
defined. (c) Each member of a separate 
sample of raters scales the performance ex- 
amples (or anchors) and sorts them into the 
dimension categories. Means, standard devia- 
tions, and dimension frequencies are tabu- 
lated for each anchor. (d) The investigator 
decides which anchors are to be included for 
each dimension based on the criteria of low 
standard deviations for the scaling task and 
rater agreement on the sorting task. When 


lows: 


1 Requests for reprints should be sent to Walter 
C. Borman III, Personnel Decisions, Inc, 2515 
Foshay Tower, Minneapolis, Minnesota 55402. 


VIEW OF WHAT CAN HAPPEN WHEN BEHAVIORAL 
EXPECTATION SCALES ARE DEVELOPED IN ONE 
SETTING AND USED IN ANOTHER 


Supervisors rated their subordinate nurses using both Smith and Kendall's 
behavioral expectation scales and a numerically anchored format. A counter- 
balanced research design was used to explore the differences in the operational 
effectiveness of the two formats. Results indicated that in terms of interrater 
reliability and confidence in ratings, 
slightly superior. However, when the simpler scale was used there was signifi- 
cantly less leniency effect and raters were better able to discriminate among 
different ratees in terms of performance. 
behavioral expectation scale is transplanted from one setting to another, the 
effectiveness of the scaled-expectations format may suffer because the raters 
do not participate in scale development and/or certain anchors are inappro- 


the Behavior Expectation Scale was 
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It was concluded that when a 


two or more anchors have means very near 
each other and otherwise meet the criteria of 
membership for a single dimension, the o 
with the lowest standard deviation is gen- 
erally chosen. In addition, the retranslation 
step outlined in (c) and (d) above provides 
a further check on the meaningfulness of the 
formance dimensions chosen in step (6) 
above. (e) The investigator retains those di- 
mensions which he feels have a reasonable 
number of behavioral anchors meeting the 
criteria stated in (d) above. | 
Since the first Behavioral Expectation Scale 
was published (Smith & Kendall, 1963, for 
nurses), a number of other behavioral ex- 
pectation scales have been developed for a 
variety of jobs. The purported advantages of 
the Behavioral Expectation Scale format- 
over other rating scale formats include the 
following: (a) During the evaluation process, 
behavioral anchors force raters to attend 
more to each ratee’s actual behavior than to 
a global overall feeling about the ratee. (b) 
Behavioral expectation scales are more mean- 
ingful for raters because they uniquely un- 
derstand the jargon and situations used in 
the content of behavioral anchors; thus, ac- 
cording to the “meaningfulness hypothesis” 
(O' Donovan, 1965) behavioral expectation 
scale ratings should be less susceptible to the 
central tendency error. (c) Behavioral expec- 
tation scales provide relatively less ambiguity 
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£ in rater understanding of each dimension’s to do with scale effectiveness or, case 

! meaning because of the careful definitions of (d), actually may lower the al of 

prepared by samples of raters as well as the the scale to define effectively i erion 

4 added definitions supplied by the behavioral space. Even (a) above may hav: ng to 

> anchors on each dimension. do with operational scale effec s. 1 
Despite considerable praise for the Behav- merely deals with the potential o‘ le to 

' ioral Expectation Scale, to the best of the be less ambiguous. Thus, scale re y ing 
~ authors’ knowledge, only one study compar- dices for behavioral expectation s e not 

ing its effectiveness with that of a simpler needed nearly as much as data bear 


format has been published (Campbell, Dun- on the operational effectiveness oi ales, 
nette, Arvey, & Hellervik, 1973). Since the There have been a number of r scale- 
time and effort necessary to construct a be- format comparison studies (for a 1 w, see 
havioral expectation scale is great compared Peters & McCormick, 1966). In ge l, re- 
to other formats commonly used, studies sults suggest that formats which incl. both 
which examine the relative strength of be- dimension definitions and some kind“ verbal 
havioral expectation scale formats with sim- anchors are superior operationally t mats 
_ pler formats attempting to measure the same that do not possess these characte . OF 
criterion space seem to be in order. Sechrest’s particular interest to the authors a e Te 
_ (1963) incremental validity notion is rele- ported findings of Campbell et al. 737% 
vant here. Before the Behavioral Expectation These investigators compared ratir na 
Scale is declared superior to other formats, behavioral expectation scale format hose 
investigators should compare behavioral ex- done on a “summated” rating scale nat. 
bectation scales with less complex rating Store managers and their assistan each 
‘methods using dependent variables appropri- rated their subordinate department mə: gers 
ate to the determination of rating-scale ef- using a newly prepared behavioral cta- 
fectiveness. tion scale format and the summated © ales. 
Instead, scale reliabilities have often been Results of the Campbell et al. (1973 udy ~ 
Submitted as evidence supporting behavioral favored the scaled expectations format. “fore 
expectation scales (eg., Fogli, Hulin, & specifically, the behaviorally anchored scales 
Blood, 1971; Landy & Guion, 1970; Smith & yielded less leniency error, greater disc:imi- 
Kendall, 1963). Scale reliabilities for a given nant validity, and a clearer factor-anaiytic 
behavioral expectation scale dimension are solution than did the summated rating format. 
generally computed by splitting the sample The main purpose of the present study was 
of individuals Who scaled the anchors into to investigate the relative effectiveness of a 
= two equal sized groups. Next, for each group behavioral expectation rating scale applied in 
che mean scale values are computed for each a setting different from the one in which it 
anchor on this dimension. Finally, the two was developed. To accomplish this, the per- 
“sets of mean scale values are correlated (n of formance of Smith and Kendall’s original 
dhe correlation = number of anchors). As Behavioral Expectation Scale was compared 
Others have pointed out (e. g., Borman, 1972; to the performance of a less complex numeri- 
Fogli et al., 1971; Zedeck & Baker, 1972), cally anchored rating format using five de- 
= scale reliabilities are inadequate for predict- pendent variables seen as important for judg- 
ins a format’s operational effectiveness. In ing rating scale effectiveness. The dimension 
4 fact, this kind of internal reliability can be names and definitions were the same for 
raised by: (a) choosing anchors with small both formats, but the simpler scale did not 
standard deviations; (5) choosing anchors contain the behavioral anchors found on the 


g whose means . relatively far apart; (c) Behavioral Expectation Scale format. 
having more raters assign values to the an- 
; : METHOD 
cChors during the retranslation process; and $ f in two 
(d) including fewer anchors for each dimen- VV 


i . 8 large Northern California hospitals participated in 
men influences, although im- the study. The number of ratees in each subgroup 


proving scale reliability, either have nothing varied from 7 to 29, The median was 16. 


)EVELOPMENT AND USE oF BEHAVIORAL EXPECTATION SCALES 


Subgrouy cach hospital were randomly 
assigned to i two categories. Rater pairs from 
12 of the performed ratings using the Be- 
havioral |! tion Seale (e.g., both raters from 
the grou; as number one used the Behav- 
ioral Exp Scale at this point), Raters from 
the other ‘ups accomplished their performance 
evaluatio: z the simpler scale. After approxi- 
mately tl ceks, those who had used the Be- 
havioral ition Scale first evaluated the same 
individu: the simpler scale. Raters who had 
been in ple scale group first switched to the 
Behavior ectation Scale for the second ad- 
ministra r every rating made, each rater was 
also ask record confidence estimates from 1 
(not at wnfident) to 9 (extremely confident) 
accordin ow sure she felt about each evalua- 
tion m 

Since ater reliability can be influenced pro- 
foundly using raters who observe ratees under 
differen‘ litions (Borman, 1972; Campbell et al., 
1970; & Baker, 1972), the investigators 
outline requirements for the participation of 
the ra irs in the study. First, supervisors at 
both is were asked to provide rater pairs 
whose nembers possessed similar working rela- 
tionshi; h nurse ratees. Second, the investigators 
request at the sample contain (for each sub- 
group of raters who observed ratees for 
approxi eln the same amount of time per day. 
These t requirements were best satisfied by having 
head nurses and assistant head nurses from each 


Subgroi icipate as the rater pairs. It was felt 
that for organizations, satisfying the two stip- 
ulations for rater-pair participation discussed previ- 
ously made interrater reliability a legitimate cri- 


terion of scale effectiveness. The other four de- 
pendent variables used were means of each rater’s 
evaluations, standard deviations of the evaluations, 
mean interdimension (halo) correlations for ea 
rater, and mean confidence ratings. 

Ten interrater reliability coefficients were avail- 
able for each participating subgroup, 5 for each 
pair’s first. administration on each dimension, and 
5 for the second administration on the same di 
mensions. Each interrater reliability coefficient was 
transformed to a z score using the Fisher r to £ 
transformation. These z scores were subsequently 
used in an analysis of variance to be discussed later. 
Data points for the dependent variable, means, con- 
sisted of each rater’s mean rating level across ratees 
on a single dimension and administration. Since the 
scale points differed for the two formats (0-2 for 
the Behavioral Expectation Scale and 1-9 for the 
simpler scale), each mean behavioral expectation 
scale rating for each rater was transformed by mul- 
tiplying by 4 and adding 1. Standard deviations’ 
data points were individual rater’s standard devia- 
tions of their evaluations across ratees on a single 
dimension and administration. Standard deviations 
of each rater's evaluations on the Behavioral Ex- 
Pectation Scale ratings were multiplied by 4 to 
arrive at indices equivalent to those for the sim- 


oe) ee Sree ee ee ae 


TABLE 1 


Benavion Exrrcration Scare (BES) axe mE 
burn ScaLe (SS) Means on THe Five Derexorst 


VARIANLES 

Behav- 

ior ex- 

Dependent variables pecta- 
tion 

scale 
Interrater reliability j ól 
Mean level of ratings (Er 
Standard deviations of ratings 1.02 
Confidence level 6.50 
Dimension intercorrelation (halo) 73 


s Means and standard deviations of ratings are adjosted 
differences in the number of scale points on the havior 
pectation Scale and the Simpler Scale, 


pler scales. “Halo” data points were established by 
taking the mean (using Fisher r to = transf 


scale ratings. Finally, each raters mean con 
rating for each administration on each dimension 
was taken as a data point for the confidence level 
dependent variable. 

The research design used for four of the five 
dependent 


variables was a three-way (Format 
Dimension X Administration) analysis of variance 
(ANOVA). That is, four separate univariate 8 X 2 
X2 ANOVAs were done with interrater reliability, 
means, standard deviations, and confidence level 


as dependent variables. A two-way (Format X 
Administration) ANOVA was used with the de- 
pendent variable, “halo.” 


RESULTS 


Table 1 shows the means on the five de- 
pendent variables in the present study for 
each of the two formats. Table 2 is an analy- 
sis of variance (ANOVA) summary for the 
same five dependent variables. 

Although mean interrater reliability was 
higher for the Behavioral Expectation Scale, 
the difierence was not significant. In addi- 
tion, mean dimension intercorrelations (halo) 
were virtually equivalent for the two formats. 
Differences between format performance in 
terms of means and standard deviations ac- 
tually favored the simpler scale. Adjusted 
mean ratings on the Behavioral Expectation 
Scale were significantly higher (5 < 01) than 
on the simpler scale (Table 2). That is, the 
simpler scale showed less leniency effect that 
did the Behavioral Expectation Scale. Stan 
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TABLE 2 


ANALYSIS OF VARIANCE SUMMARIES FOR Tite DEPENDENT VARIABLES: INTERRATER RELIABILIT 
LEVEL, STANDARD DEVIATION Or RATINGS, CONFIDENCE LEVEL, 
AND ĪNTERDIMENSION CORRELATIONS 


j 5 dimen- 
Interrater Mean level Standard Confidence orrela- 
reliability ofratings | deviations level ons 
Sources of variance df 2 ol ratings ohe 
4 us | F | r s F | oF 
. — — — — — — — 
Formats (A) 11 29 | 9.43 8.05 | 1.10 | 5.00% | 1.50 
b - l] ji 
Dimensions (B) 4 37 | 3.31 | N19 | 1.20 1 
‘ Administrations (C) | 1 49 | | 15 1 27 | | 8.66 7.20* 
AXB 1171 19 04 | .03 
AX ¢ 1 | 48 | 8.50 8.18 | 2B | 1.66 | 
BXC 4 |0| | «ll .02 30 | 
AXBXC 4 55 | 42 | -09 | 17 
f 1 | 
Note. The first author will supply further details of the ANOVA on request. Only significant Fs are given, 
* The ANOVA with “halo” as the dependent variable was a two-way, 2 X 2 (Formats X Administrations 
„ <05. 
<0 
dard deviations of ratings on the simpler specific behavior orientation of the ies, di- 
7 scale were significantly (p< .05) greater mension intercorrelations were approximately 
the same for both formats. In short, under 


4 than the adjusted standard deviations of 
evaluations done on the Behavioral Expecta- 
tion Scale. Thus, raters tended to spread out 
their ratings more on the simpler scale. Al- 
though this is not necessarily a virtue, scales 
that encourage raters to fairly discriminate 

among the performance of different ratees to 
a greater extent than do other scales are 
generally considered superior. Finally, raters 
Were more confident in their evaluations us- 
ing the Behavioral Expectation Scale than 
they were using the simpler scale, but not 
significantly so. 


DISCUSSION 


In a situation where interrater agreement 
seemed to be a fair benchmark for judging 
rating-scale effectiveness, no significant dif- 
ference was found between the Behavioral 
_ Expectation Scale and a much simpler for- 
mat on this criterion. In addition, because of 

the high mean “halo” for both formats, it 

appeared that most ratings were a reaction 
to overall ratee favorableness without any 
careful distinction made of performance on 
different dimensions. Although behavioral ex- 
pectation scales should be especially good at 
lowering “halo” because of the relatively 


j 


these conditions, the Behavioral Expectation: 
Scale exhibited no superior operation char- 
acteristics. 

One possible explanation for the lack of 
superiority shown by the Behavioral Expec- 
tation Scale in this study is that nurse raters 
may have had trouble relating to certain out- 
of-date behavioral examples contained within: 
the behavioral expectation scale used in this 
study. A supervisor nurse did suggest to the 
second investigator that a few of the anchors 
were no longer appropriate as performance 
examples for nurses in 1972. In addition, 
raters did not have the opportunity to pre- 
pare their own dimensions and anchors. Rater 
participation in scale preparation may be an 
important variable for determining rating for- 
mat effectiveness. Greater participation in the 4 
performance-evaluation process may increase 
the raters’ motivation to evaluate individuals 4 
carefully. As Campbell et al. (1973) have in- 
dicated, development of a behavioral expecta- 
tion scale instrument seems to be a valuable 
learning experience for the raters. Participa- 
tion forces the raters to define carefully ef- 
fective performance and assess ratees accord- 
ingly. 


nnen, 


` 
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4 2 * 
JEVELOPMENT AND USE or BEHAVIORAL EXPECTATION Scares 


Perhaps learest message communicated 
by the re f this investigation is that a 
practition: 10 wishes to use a behavioral 
expectatic Je which has been developed 
in a diffe setting cannot assume that the 
scales gi any better criterion informa- 
tion tha: a much simpler format. That 
is, the p metric superiority of the Behav- 
ioral Ex tion Seale method under condi- 
tions in ch the raters themselves develop 
the rati ales has not been directly chal- 
lenged ‘he results in this study. Instead, 
the fin suggest that more research is 
needed determine the conditions under 
which havior expectation rating system 
might psychometrically superior to sim- 
pler pe /ormance-evaluation formats. 

REFERENCES 

Boraca C. The rating of individuals in or- 
ganiz An alternative approach. Unpub- 
lis. loctoral dissertation, University of Cali- 
fornis, Berkeley, 1972. 

Came J. P., Duxnerte, M. D., Arvey, R. D., 
Kei, L. W. The development and evalu- 
ation of behaviorally based rating scales. Journal 
of Applied Psychology, 1973, 57, 15-22. 

CAMPEELL, J. P., DUNNETTE, M. D., LAWLER, E. Ey 


a N 


I, & Wzcx, K. E., Ja. Managerial 

performance, and effectiveness. New York: 
Graw-Hill, 1970. 

Dunnerte, M. D. Personnel selection and 
ment. Belmont, California: Wadsworth, 1960, 
Foret, L., Hunts, C., & Brooo, M. lop 5, 
first level behavioral job criteria. Journal of 

plied Psychology, 1971, 55, 3-8. 3 

Lanpy, F. J., & Guios, R. M. Development 
scales for the measurement of work mo tio 
Organizational Behavior and Human Per} 
1970, 5, 93-103. 

O'Donovan, 
meaningfulness? Psycholo; 
358-372. 

Perters, D. L., 


multitrait-multirater „Org 
havior and Human Performance, 


(Received January 15, 1973) 


S 1974, Vol. $9, No. 2, 202-706 


3 
F 
A 


E. of Applied Prychology 


| WHAT DOES THE LEAST PREFERRED CO-WORKI 
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The Least Preferred Co-Worker Scale has been variously interpreted as a 
measure of leadership style or as a measure of the cognitive complexity of 
the kader. In this study, using 112 subjects including business students, 
managers and systems analysts, the Least Preferred Co-Worker scale scores 
were correlated with two other measures of differentiation and with three 
cognitive measures in an attempt to assess the interpretation of the Least 
Preferred Co-Worker Scale as a cognitive measure. Results show that while 


with cognitive simplicity, 


unequivocably cognitively complex. 


CFD 


Despite extensive research over the past 20 
years, interpretation of Fiedler’s (1958, 1967) 
r Least Preferred Co-Worker Scale (LPC) re- 
mains elusive. Some have suggested that a 
manager's LPC score can be interpreted as a 
direct indicator of his leadership style (Fied- 
ler, 1967, 1971). Others, relying on the fact 
‘that LPC scores have been found to corre- 
late with several measures of cognitive com- 
| plexity (Steiner & McDiarmid, 1957; Van- 
f noy, 1965; Weissenberg & Gruenfeld, 1966), 
have advanced cognitive-based leadership 
bene explaining the relationship of LPC 

cores to leader behavior (Foa, Mitchell, & 
Fiedler, 1971; Mitchell, 1970). vet, despite 
> these recent attempts to raise LPC-based 


leadership studies to a higher theoretical 
plane, there 


rounding the 
LPC scale. 


= Interpreting LPC scores as measures of the 
ability to discriminate links them to cognitive 
complexity theory. The scale consists of 16 
items presented in semantic differential for- 
mat. Respondents are asked to describe their 


is still considerable doubt sur- 
cognitive interpretation of the 


The authors wish to acknowledge the helpful 
comments of U. G. Foa, D. T. Hall, R. Mansfeld, 
T. R. Mitchell, R. I. Simon, and H. C. Triandis on 
an earlier draft of this paper. 

Requests for reprints should be sent to Martin G. 
Evans, who is now at the Faculty of Management 
Studies, University of Toronto, Toronto MSS IVa, 
Canada. 


the low end of the Least Preferred Co-Worker Scale Seems to be associated 
the high-least preferred co-worker subject was not 


least preferred co-worker by indica here 
he should be positioned along an « inter- 
val continuum anchored by the pol jec- 
tives. These responses measure a res ent's 
ability to differentiate among the c ter- 
istics of an individual as a pers eg., 
items such as cold-warm; frie» un- 
friendly) and also as a co-worker (eg. ems 
such as enthusiastic-unenthusiastic: ef= ent- 
inefficient). Since the ability to mako fine 


discriminations characterizes cognitively com- 
plex individuals ( Schroder, Driver, & Streu- 
fert, 1967) the LPC scale has been inter- 
preted to be a measure of cognitive com- 
plexity. 

A problem arises with this interpretation, 
however, when one attempts to define the na- 
ture of the differentiation process captured 
by the LPC scale. Initially, differentiation 
Was measured by an assumed similarity be- 
tween opposites measure, computed as the 
square of the difference between the mean 
rating of one’s most preferred co-worker and 
one’s least preferred co-worker. Later, it was 
found that scoring on the most preferred co- 
worker scale was consistently high and that 
the assumed similarity between opposites 
measure and LPC scores correlated consis- 
tently; therefore, only the LPC score was 
retained (Fiedler, 1967). Thus, the average 
score on the scale items describing a least 
preferred co-worker became the LPC score. 
Low scores, resulting when a respondent 
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stereot) ı poor co-worker as also being 
an un ble person, were interpreted as 
indicat tendency to dichotomize people 
into g r bad categories, a tendency at- 
tribute conceptually simple individuals. 

Stei ind McDiarmid (1957) demon- 
strate t Fiedler’s assumed similarity be- 
tween sites score was jointly determined 
by ar vation (mean score) effect and a 
profil similarity effect, the latter mea- 
sures he square of the difference between 
each preferred co-worker- and least 
pref co-worker-item score summed over 
all | tems of the scale. High scores on 
their file-dissimilarity measure were in- 
terp | as reflecting a high degree of dif- 
fere ion. Furthermore, based on the fnd- 
ing elevation and profile-dissimilarity ef- 
fect vere independent of each other, it was 
sug d that their two measures tapped dif- 
fer bes or styles of differentiation. 


F .ontly, Mitchell (1970) and Foa et al. 
(4% have suggested that the 16-item LPC 
scale | Fiedler, 1967) can be partitioned into 
thre distinct subscales: (a) tapping a 
(3 cms), (5) an interpersonal (11 items) 
as wcll as a task/interpersonal, or (c) mixed 
dimension (2 items). The mean scores on 
these three dimensions were used to investi- 
gate the ability to discriminate which was 
measured in terms of dimensional differentia- 
tion. That is, the relationship among the 
scores on these dimensions was interpreted to 
be the indicator of the extent to which an 
individual stereotypes or differentiates his 
least preferred co-worker. 

Thus, data from the LPC instrument have 
been used to produce three different indexes 
of differentiation: (a) elevation, (6) dimen- 
sional differentiation, and (c) profile dissimi- 
larity. Each of these indexes reflects a dif- 
ferent type or degree of differentiation and 
hence each can be considered to be a corre- 
late of cognitive complexity. But leadership 
studies have generally relied only on LPC 
(le., mean or elevation) scores as the mea- 
sure of the ability to differentiate. 

This study is designed to contribute to the 
understanding of the LPC score as a measure 
of differentiation and hence as a correlate of 
cognitive complexity by determining (a) the 
extent to which differentiation measured by 


LPC is also related to dimensional 
nation and profile dissimilarity and (6) tb 
extent to which LPC relates to other measu 

of cognitive complexity. 


METHOD 

Sample 
Subjects included 112 night school students 
business (both masters of business admi i 
(MBA) graduates and undergraduates), m. 
and systems analysts. About $0% of the 
had a university degree, and the median age 
30-35. 


Instruments 


plexity. The short form (Trodahl & Powell, 1965) 

the scale developed by Rokeach (19) was used 

measure authoritarianism and closed-minded 
(dogmatism). To measure intolerance of am 
the scale developed by Budner (1962) was used 
and Brim's (1955) scale was used to measure Gesine 
for certainty. Individuals who are cognitively com 
plex are expected to be nondogmatic and to rant 
and comfortable 


uncertainty (high scores). 
RESULTS 
LPC as a Measure of Differentiation 


subsample, the task and mixed dimensi 
correlated significantly (p< .001, n= 62), 
as did the mixed and interpersonal dimensions 
(p <..05). In the high LPC subsample, how- 
ever, no interdimensional correlation was sig- 
nificant at the Os level (n = 50). On the 
whole, these results support the position 
Foa et al. (1971) that the high-LPC indi- 
vidual discriminates by using different cogni- 
tive dimensions. Thus, it appears that : 
scale does measure dimensional discrimina- 


2As age and education are often related to cogni- 
tive dimensions (see Ondrack’s (1971) work on dog- 
matism). For all the analyses, partial correlations 
were computed controlling for these variables. : 
did not affect the pattern of jnterrelationships re- 
ported in this article. . 


tion. It should be noted, however, that this 
finding only holds for Fiedler’s (1967) ver- 
© sion of the scale. Work with an earlier ver- 
i sion of the scale showed that moderate least 
Preferred co-worker subjects (as measured 
by this early scale) were the high differenti- 
ators (Evans, 1973). 
x To measure profile discrimination, a stan- 
dard deviation was computed for each re- 
T spondent’s scores on the 11 interpersonal 
items of the LPC scale. The correlation of 
T this measure with LPC scores for the whole 
> sample was .29 (p < 01, n= 112), while the 
© correlations with the high and low LPC sam- 
T ples were —.14 (ns, n= 50) and 31 (p< 
05, n = 62), respectively. Thus, although the 
significant overall correlation indicates that 
LPC measures profile differentiation, the alge- 
braic signs of the subsamples indicate that 
the relationship is nonlinear. 
However, the correlations between LPC 
scores and the above-mentioned type of dis- 
~ crimination measure may be artifactual. Mea- 
sures of profile discrimination based on the 
sum of squared absolute scores or of squared 
deviations from the mean may be a subtle 
function of the LPC scale (ie,, mean score) 
because the extent of profile discrimination 
Possible is constrained by the mean score. 
For example, respondents with means in the 
á -5 range (on an 8-point scale) can employ 
 fatings of 1 or 8 as long as they are evenly 
distributed, while those with very high or 
very low scores cannot use as great a range 
of discrimination and still produce the same 
LPC score. Thus, the maximum possible pro- 
file variation is 4 symmetrical curvilinear 
function of LPC scale with a maximum score 
of 4.5. Hence, this may account for the curvi- 
linear relationship found. 

To correct this bias, the measure of profile 
discrimination was converted from an abso- 
lute to a relative Measure by dividing each 
standard deviation by the maximum possible 
standard deviation that could be achieved 
for that LPC score. Correlations were then 
computed between this new measure and LPC 
score, both for the sample as a whole as well 
as the high and the low subsamples. These 
correlations were in the Tange of —.6 to 3 
and were not significant, Thus, it appears that 
when measures of profile discrimination are 
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corrected for attenuation in t nge of 
possible variation, they are in ent of 
LPC scores. Hence, LPC and pro scrimi- 
nation measures appear to tap ifferent 
types of discrimination. 

Based on these results, it aj that 
while the LPC scale may reflect nsionaly 
discrimination, it is not related ile dif- 
ferentiation. Thus, it is not a robu dicator 
of the ability to discriminate one ist pre- 
ferred co-worker. 

LPC Scale as a Correlate oj Cog 
Complexity 

Intercorrelations between the scores 
and the cognitive measures reve.’ everal 
interesting relationships. It was that 
high dogmatics tended to score } n the 
LPC scale (r= 21, p< .05, 112), 
while neither the intolerance for iguity 
nor the tolerance for certainty measi:es cor- 


related significantly (r = —.04 and re- 


spectively). Among the cognitive m:sasures 
themselves, dogmatism and ambiguity corre- 
lated significantly ( < 001) while un- 
certainty measure did not correlate signifi- 
cantly with either. 

Although the uncertainty measure un- 


related to the LPC scores, it did ap 
moderate the LPC-dogmatism rela ship 
slightly. The pattern of means (low dogma- 
tism-high tolerance for uncertainty = 3.84; 
low dogmatism—low tolerance for uncertainty 
= 3.88; high dogmatism-high tolerance for 
uncertainty = 3.85; high dogmatism-—low tol- 
erance for uncertainty = 3.48—the latter cell 
showed a trend at the p = .10 level of being 
different from the other three cell means) 
Suggests that the high least preferred co- 
worker person is high in tolerance for uncer- 
tain situations, coupled with high or low dog- 
matism; in addition, he may be low in dog- 
matism coupled with low tolerance for un- 
certainty. Thus, the high least preferred co- 
worker person can be either cognitively com- 
plex (low dogmatism, high tolerance for un- 
certainty) or only moderately so. On the 
other hand, the low least preferred co-worker 
individual tends to be characterized by cog- 
nitive simplicity-high dogmatism and an 
intolerance of uncertainty. Thus, it appears 
that the LPC scale is not a strong correlate 


to 


of 0 
appt 
scale 
to di 
defin 


ive complexity. In particular, it 

at high and low scores on the LPC 

not differ in degree but may relate 

at cognitive attributes—the low end 

‘ly being a measure of low complexity 

and eotyping, the high end being some- 
what ‘ndefinable. 

1 plore these possibilities further the 

ıs again split at the mean LPC score 

LPC scores for each subsample were 


cor with the three cognitive variables. 
Th sults indicated that the significant 
ove jegative correlation of LPC scores 
wil matism is contributed solely by a 
sigi nt negative correlation between these 
va (p <.05) for the low LPC sub- 
sam, The high LPC subsample correlation 
wa- >t significant. Thus, it appears that very 
lo ast preferred co-worker subjects are 
cle dogmatic and that high least preferred 
co~ ker subjects may either be low or high 
dc itics, 


relations of the LPC scores with am- 
bis and uncertainty computed for high 
and LC subsamples were not significant 
anc sence did not clarify the relationship of 
LPC scores to these measures. On the whole, 
thersiore, this evidence suggests that the low 
least preferred co-worker respondent lacks 
complexity and the ability to differentiate. 
However, the high least preferred co-worker 
subject may vary along the complexity di- 
mensions. 


Discussion 


These findings do not provide favorable 
support to the interpretation of the LPC 
scale as a measure of cognitive complexity. 
The results suggest that the low least pre- 
ferred co-worker subject is a poor differenti- 
ator and that the high least preferred co- 
worker subject may be poor or good at dii- 
ferentiating. The implications of this finding 
may shed some light on the criticism by 
Graen, Orris, and Martella (1970) that the 
correlations in the octants of Fiedler’s con- 
tingency model (Fiedler, 1967) often do not 
cluster appreciably. While Graen et al. 
(1970) and Mitchell et al. (1970) suggest 
that this lack of clustering may reflect the 
lack of a good measurement of the situational 
characteristics (leader power, Jeader-member 


cent study by Graen, Orris, and Alvares 
(1971) seems sufficiently well controlled to 
ensure that cells on the continuum were 
appropriately filled, and again discrepancies 
between the correlations in each cell were 
found. 


The research reported in this study sug- — 
gests an alternative explanation for these dif- 
ferences. It has been found that a low LPC ~ 
score was consistently an indicator of cogni- 
tive simplicity in that it tended to be associ- 
ated with the combination of high dogmatism ~ 
and high intolerance for uncertainty. How- 
ever, the high least preferred co-worker indi- 
vidual could be one of several types: (a) 
cognitively complex—undogmatic and com- 
fortable with uncertainty; and (6) cogni- 
tively mixed—undogmatic but uncomfortable 
with uncertainty, and dogmatic but com- 
fortable with uncertainty. The prediction? 
made by Foa et al. (1971) about the 
strengths of the correlations (between LPC 
scores and performance) in the various oc- 
tants therefore only holds when the high 
least preferred cc-worker individual is cogni- 
tively complex—undogmatic and tolerant of 
uncertainty. The presence (in any experi- 
ment, in any octant) of high least preferred 
co-worker leaders of lower complexity (cog- 
nitively mixed) weakens the predicted rela- 
tionship (makes a predicted negative relation- — 
ship less negative or makes a predicted posi- 
tive relationship less positive) and might, 
therefore, explain the range of correlations 
found by Fiedler (1967) in each octant. 
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AN G. WEINSTEIN? 


rnegie-~-Mellon University 


predicted by 


compensation. After adjusting for 


Mo: believe that graduate education in 
mana. ent is instrumental in upgrading the 
skills -í our managerial population. Perhaps 
the est evidence of this is the high de- 


man master of business administration 
(MP) graduates by leading business firms. 
Suc! belief, however, has recently been 
chal d in a study by Livingston (1971) 
whic), asserted that formal managerial educa- 
tion cid little to prepare students for suc- 
cessful managerial careers since grades in 


neither undergraduate nor graduate school 
correlate with success in management. 

This critique deserves analysis. First of 
all, correlation analysis determines only the 
differential effect of academic performance 
on business success. If graduate training 
were to have a more or less uniform effect on 
all graduates, then the correlation is nearly 
zero and it is erroneous to conclude that 
graduate training has no effect on managerial 
performance. Second, part of the evidence 
used by Livingston came from a comparison 
of senior executives attending an advanced 
management program at Harvard with Har- 
vard MBA graduates having comparable 
years of work experience. The former group 
earned about a third more, on the average, 
than the latter which, according to Living- 

1An abstract of this study was presented at the 
79th Annual Convention of the American Psycho- 
logical Association, Washington, D.C., 1971. 

2 Requests for reprints should be sent to Alan G. 
Weinstein, Graduate School of Industrial Admin- 
istration, Carnegie-Mellon University, Schenley Park, 
Pittsburgh, Pennsylvania 15213. 
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DICTING MANAGERIAL SUCCESS OF MASTER OF 
USINESS ADMINISTRATION (MBA) GRADUATES ` 


Annual compensation of 136 alumni of a graduate management program Was 
using years of work experience, graduate grades and data 
obtained prior to graduate school admission. Stepwise multiple regression 
analyses revealed that work experience was more predictive of staff than line 
the effect of work experience, graduate 
grades were significantly correlated to both line and staff carnings though the 
incremental predictive validity was much higher for line managers. Several 
pregraduate academic and nonacademic variables were also significant pre- 
dictors. The results support the ability to predict managerial compensation 
prior to career and graduate school entry. 
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ston, was sufficient evidence to question the 
value of the MBA program. Comparing the 
earnings of the high performers of one group ; 
with those of the mean of another did not 
deter Livingston from making his point. To 
make a valid assessment of the treatment 
effect of MBA training, a comparison of 4 
sample of MBA graduates with a representa- 
tive sample of non-MBA graduates with simi- — 
lar backgrounds is needed. E 
The authors could find no published re- 4 
search on the treatment effect of MBA train- 
ing on subsequent managerial success. In ad- 
dition, chere has been relatively little pub- 
lished research focusing on the differential 
relationship of formal managerial education 
with career success. Significant relationships 
have been reported between managerial com- 
pensation and grades in elective MBA courses 
(Marshall, 1964; Williams & Harrell, 1964) 
as well as second year MBA grades (Harrell, 
1972). In addition, Campbell, Dunnette, Law- 
ler, and Weick (1970) cite research in which 
undergraduate academic performance related 
to several criteria of managerial success. 
The purpose of the present study is to (a) 3 
examine the differential effect of MBA per- 
formance on career success, and (b) develop 
an early identification model to aid in the 
selection of students for an MBA program. 
Specifically, the relationship was explored be- 
tween undergraduate performance, extracur- 
ricular activities and other nonacademic 
variables, and subsequent managerial suc- 
cess. To evaluate the differential effect of p 
7 


E 


graduate school training, business school 
graduate grade point average (GPA) was 
used in conjunction with the preadmission 
variables in predicting managerial compensa- 
tion. 


METHOD 
Criterion 
From an alumni survey conducted at the Gradu- 
ate School of Industrial Administration, of Car- 
negie-Mellon University in 1969, several criteria of 
managerial success were obtained such as current 
annual compensation and organizational level at- 
tained. Srinivasan, Shocker, and Weinstein (1973) 
had three groups of senior executives compare pro- 
files of managers on these criteria and make overall 
judgments of relative success. A composite cri- 
- terion that was developed, using these judgments, 
revealed that current compensation was regarded as 
the single most important criterion. Laurent (1970) 
also arrived at the same conclusion after analyzing 
several criterion variables by factor analysis. In 
addition to this empirical support, annual compen- 
Aion has “face validity“ as a criterion of success 
to managers and to faculty in business schools. 
Therefore, annual compensation was used as the 
“measure of managerial success in this study. 


Alumni graduating between 1960-1966 were 
chosen for the study. The rationale for eliminating 
1967-1969 alumni from the sample was based on 
4 the desire to reduce spurious effects of grades on 
current compensation. Graduate grades are corre- 
lated with initial earnings perhaps because faculty 
use these grades to guide their recommendations, 
and companies use these recommendations and 
transcripts to guide their selection and compensa- 
tion. After a few years of experience, however, this 
effect should diminish and performance on the job 
Would be expected to have a greater effect on cur- 
dent compensation. For each year of the sample 
investigated, correlational analyses revealed no sig- 
nificant relationships between initial and current 
compensation.“ 
The criteria for inclusion in the sample were: 
(a) employment in a managerial position, (b) em- 
ployment in the United States, and (c) existence of 
data on preadmission variables, graduate grades and 
= managerial compensation. A preliminary analysis 
showed differential compensation histories for line 


Other studies by Brenner and Lockwood (1963) 

and Haire, Ghiselli, and Gordon (1967) have found 

a moderate to high correlation between initial and 

later compensation for the span of years included in 

this study. In contrast to the present investigation 

both of these studies analyzed salary relationships 

within single firms, and the high correlations found 

may partly be due to the effect of a common 
compensation system. 
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and staff managers. In addition, they differed in 
nature of the tasks they performed, thus suggesting 
separate analyses. 
Line and staff designations were obtained by have 
ing each questionnaire respondent determine to 
which classification he belonged. The broad defni- 
tions given were: (a) staff No formal managerial 
responsibility.“ Typical job titles were 
accountants, consultants, and specialists. 
“Formal managerial responsibility.” Job 
cluded various types of supervisors, 0 
gers, and company officers. The sample ine 136 


persons; 74 in staff and 62 in line jobs. 


Predictors 
The predictor variables used in this study were 
as follows: * a 


Predictor Variables 


Administration Test for Graduate Study in 
Business (ATGSB) scores (Verbal, Quantita- 
tive, Total) 

Candidate Excellence by School! indexes (Ver- 
bal, Quantitative, Total) ` 

Undergraduate grade point average by year 
(freshman, sophomore, junior)“ 

Undergraduate grade point average by subject 
area (mathematics and statistics, science, en- 
gineering, economics, humanities, and social 
sciences excluding economics, management, 
computer science) f 

Undergraduate college major“ (same areas as 
above, excluding computer science) 

Rating of scholastic recognition 

Rating of offices held 

Rating of involvement in social organizations 

Rating of involvement in sports activities 

Rating of letters of recommendation k 

Weighted graduate study (by length and rele- 
vance to management) 

Age 

Marital status (see Footnote 6) 

Graduate (MBA) grade point average 


Most of these variables were taken directly fro 
the transcripts and applications. However, relevan 
information about nonacademic variables existed 
either narrative or descriptive form so that meas! 
of these descriptions needed to be developed. 
Extracurricular activities. Four independent scales 
were developed: (a) a 9-point scale of scholastic 
recognition; this included awarded scholarshi 
membership in honorary societies, Dean’s List, ete: 


This index was taken from tables compiled b; 
the Educational Testing Service on the average 
ATGSB score for all students taking the test from i 
the candidate’s undergraduate school. 

5 Undergraduate grade point average (senior) was 
not available in most cases, since admission decision 
were made prior to completion of the senior ye 
11 predictors were treated as dummy va 

es. 


. 
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(b) a cale of offices held in organizations; 
a highe was given to those who held kader- 
ship p in a broad range of organizations ; 
tc) a cale based on the extent of involve- 
ment dth of social-organizational activi- 
ties; ar point scale of involvement in sports 
activiti zing from casual playing of athletic 
games “ge level participation. 


Leti recommendation. This rating, on a 
9. poir 


sisten 
menti certain qualities such as leadership po- 
tenti motivation. Two graduate students 
train dentify these qualities rated a sample of 
30 « dependently. The interrater reliability for 
these was r = .90. 

Gr All undergraduate grade point averages 


(GP cre computed 
A -3, C=2, D=1, and Failure =0 for 
cac? >. Graduate (MBA) grade point average 
was puted on a 9-point scale with A+ =9 

D to assign courses to content areas, emphasis 
was ed on the skills used and the nature of 
the nt covered in a course. Course descriptions 
wer d to guide judgments whenever possible. 


e ing Compensation for Effect of 
wW Experience 


current annual compensation was used as 
the terion, the number of years of work experi- 
en is expected to have an important effect, thus 
nece--itating some form of co . 


to current compensation. A multiple regres- 
s on current annual compensation (Y) with 
ycars of work experience before graduation (WEBG) 
and after graduation (WEAG) as independent varia- 
bles gave the following result: 


Y = 10,990 + (1,120 X WEBG) + (873 X WEAG). 


Consequently, adjusted current annual compensation 
(Viet) can be written as: 
Yaa V= (1,120 WEBG) — (873 X WEAG).- 


In the case of line managers, the plot of current 
compensation against the number of years since 
graduation approximated à linear relationship UP to 
an experience level of 6 yeats; but this relationship 
tailed off for alumni with seven to nine years o 
experience, Studying this effect more exactly would 
have involved a multiple regression with seven 
(dummy) variables, each corresponding to & different 
level of experience (i.e. three; four. . nine years). 7 


See Cohen (1968) for a discussion of dummy 
variable regression models. A dummy variable D is 
a binary variable indicating presence (D=1) or 
lack of presence (D =0) of an attribute. This tech- 


lated 


Since some of the graduating chase bad too 
alumni, they were placed into four groupe © 
sponding to (a) cight to nine years, (6) * 
seven years, (c) four to five years, and (d) 
years of experience. A multiple regression with U 
current compensation (Y) as the dependent 

and these four dummy variables (Di, Da D. Da 
as independent variables gave the following result 


Y = 20,511 + 1,875D. + 3 
1,838D, — 1,041 Da — 2. 4 


an F test indicated that the above equation 
counted for significantly more variance (p< 2 
than a lincar regression on the same data. In 
case of line managers, it was not necessary tO 
just for years of work experience before duation, 
since there was no significant effect. Consequeni ys 
the adjusted compensation (V.) for line managers 
was computed as: 


v. u = Y- 1873D — 18D, + 104, + 2% 


RESULTS 


The results of (double) cross-valid: 
stepwise multiple regression analyses are te 
ported in Tables 1 and 2 for staff and li 
managers, respectively. The columns desi 
nated as (1), (2), and (3) summarize th 


(1) predicting unadjusted compensation using 
preadmission, graduate grade point average 
and years of experience variables, (2) pre- 
dicting adjusted compensation using pread- 
mission variables and graduate grade point 
average, and (3) predicting adjusted com- 
pensation using preadmission variables alone. 
Column (3) represents an early identification — 
model of career success at the time of entry 
into graduate school. The difference in the 
explained variance between columns (2) and 
(3) represents the incremental validity of 
graduate school performance. ; 

The number of years of work experience 
had à substantially greater effect on staff 
managers than line managers. This is dem- 
onstrated by the differences in standard devi- 
ations of adjusted and unadjusted compensa- 
tion for staff (2,683 versus 3,722) and line 
(5412 versus 5,707). Also, as demonstrated 
in Table 2, a substantial improvement in pre. 
nique allows for estimating nonlinear effects of a 
variable. x 

s For a manager with six years experience, D., 
D:=1, D. = O, and D. 2 O; for a manager ith 
three years experience, D. o, Da=0; D. O, and 
Die 


ee 
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= spectively 


sample 2, 


| tween predicted and actual criterion scores for the enti: 
ures. 


dictability of staff compensation occurs when 
work experience is included as a predictor. 
For line managers, there is only marginal 
improvement. 
In the population studied, there were clear 
findings to support the ability to identify 
ig successful managers prior to career entry, 
Staff managers differed considerably from line 
managers in the prediction of success. Gradu- 
ate grades were significantly correlated with 
adjusted compensation for both Bus (7 = 
49, P<.01) and staff (r= 24, p< 05) 
managers, though the incremental predictive 
validity was much higher for line managers, 
For staff managers, involvement in social or- 
ganizational activities was predictive (p < 
05) whereas for line managers, involvement 


TABLE 1 
RecresstoN ResuLTs on (Starr) MANAGERIAL COMPENSATION 
= — — = = — — — — 
(1) (2) (3 
| 1 bia * LTO 
f Beta Weights (on entire sample) der 
| | oe clation 
N 8 sb hal With 1969 Same as ( ie 
riables 2 $ a d 8 as (2 
ob With 1969 compensa | bat withou npen= 
eee i Le i using tion 
tion ass 1 graduate entite 
dependent | for years GPA as nple) 
variable or work predictor 
experience 
Years of work experience after | 
MBA 5.37 1.95 48˙˙ — - 
Years of work experience 
prior to MBA 1.51 1.40 39 — — a 
Graduate GPA 6.27 73 13 18 - 775 
Ratings of involvement in 4.85 2.03 14 19 .24°* 22 
social organizations 3 
Engineering major 54 50 3 23˙ 23 15 
Mathematics and statistics 
major 10 30 .15* .21* 22* 12 
Current compensation ($) 17,363" 3,722" 7 
df = — 67 69 70 2 
Re — = 75 38 34% 4 
(Double) Cross-Validated Re — — 8e phere 26** 
Note. Columns designated as (1), (2), and (3) summarize the stepwise regressions, MBA = master of busines- nistra= 
tion; GPA = grade point average, i 
* After adjusting for the number of years of work experience, the mean and standard deviation were 10,989 a 83, re 
ively. 


* Adjusted for degrees of freedom, using a standard correction formula (Nunnally, 1967, 
total sample was divided by placing odd numbered managers into subsample 1 an ji 

Multiple regression models 1 and 2 were developed from the corresponding subsa " d 

| Obtained for subsample 1 using Model 2 and for subsample 2 using Model 1. The double cross-validated R is the 


nto suba 
es were 


in sports activities was predictive (p < 0505 
Several other variables predicted adjusted 

compensation: junior grades (line) (p< 

01), mathematics and statistics major (line) 
(P< .05) and (staff) (p< .10), science 
grade point average (negatively for line) ($ 

< 05), and engineering major (staff) (p< 
10). 


Discussion 


The significant correlation of graduate 
grade point average and adjusted compensa- 
tion for both line and staff jobs directly 
refutes the claim made by Livingston that 
there is no association between performance 
in a graduate management program and 
managerial success. Graduate grades have 4 
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m 
TABLE 2 
REGRESSION RESULTS ON (LINE) MANAGERIAL COMPENSATION 4 
= — f 2 ——.. ee 
i 00 | Q) W | 
| er Ea —| Zero- 
Beta Weights (on entire sample) | order 
| ern 
* f with 
ables x SD S With 10% | Same as (2) | adjusted 
With 1969 cap te but without | compen= 
compensa using sation 
tion as adjusted graduat tire 
dependent | for years uate | (on en 
ie wick GPA as sample) 
4 predictor 
Dumm ible (8-9 years 
Work ence) 2h 70 214 — — — 
Dumm ible (6-7 years 
work ience) AS 67 274 -3 E = 
Dumn iable (4-5 years 00 
wor rience) 57 =. 114 =- — — 
Grad PA 6.14 72 gee be — ager 
Unde te (GPA) science | 2.69 Sl re 
Matt s and statistics 
ma 08 28 20˙ are ** ase 
Ratir olvement in 
8900 vities 1.19 1.74 14 15 e 21° 
Under ate GPA 
(Jus 2.88 58 16 16 ore 16 
Curre mpensation ($) 21,16% 5,707" — = 72 = 
df ee = 53 56 57 i 80 
A 2 2 ose 10 < 
(Double) Cross-validated R- — 5 * 


+. adjusting for the number of years of 
a and standard deviation were 15240 and S412, 
under Table 1. 
under Table 1. 


c is the negative of the sum 


p are after graduation from GSIA). 


greater effect on line jobs than staff jobs 
which may, in part, be due to the reduced 
variability of adjusted compensation in staff 
compared to line jobs. The coefficient of vari- 
ation of adjusted compensation was .24 and 
36 for staff jobs and line jobs, respectively. 

A related finding was the pronounced effect 
of work experience on staff jobs compared to 
line jobs. The reduction in standard devia- 
tion due to adjusting compensation for work 
experience was $1,039.00 and $295.00 for staff 
jobs and line jobs, respectively. One expla- 
nation for this difference is that staff work is 
more difficult. to evaluate with respect to 
organizational performance, allowing tenure 


Now. Co of the stepwise regrestions. GPA = grade point a d 
lumns designated (1), (2), and (3) summarize the ese the regression coefficients for the dummy ee the 
respecti: ye 


that the coefiici 
timation Se he a te dors in mice 3 coefficients ee tine dummy variables (the years of work exp 


ent for the dummy variable for 3 years work 1 
perience i 


to play a more important role in the reward 
system for staff jobs compared to line jobs. 
The significance of the early identification 
models based on academic and extracurricu- 
lar activity variables support the ability 10 
predict managerial success prior to entry into 
graduate school. With the exception of the 
mathematics and statistics major, the varia- 


“ples that enter the staff model differ from 


those that enter the line model. The reasons 4 
for these differences may be a function of 
differential abilities necessary for success and/ 
or self-selection, that is, different types of 
persons entering these two career patterns. 


More research needs to be conducted to test 
these hypotheses. 
` Although in a previous study, Srinivasan 
and Weinstein (1973) found seven significant 
5 predictors of graduate grades at the Graduate 
School of Industrial Administration of Car- 
negie-Nellon University, the present study 
3 indicated that only two of the predictors of 
adjusted compensation (junior grades and 
: engineering major) predict graduate grades. 
This finding suggests that schools that solely 
use academic criteria for selection may be 
ignoring variables which predict future ca- 
reer performance. Management schools are 
"justified in searching for bright graduate 
students, but if they are at all interested in 
their ability to produce high-earning (suc- 
cessful) graduates, they should consider al- 
_ ternative selection strategies. 
The results reported in this study were 
somewhat restricted to business school gradu- 
ates from analytically-oriented programs 
similar to that of the Graduate School of 
Industrial Administration of Carnegie-Mellon 
‘University. In order to generalize these re- 
“sults to the whole population of MBA gradu- 
dates, it is necessary to study a broader sam- 
ple including students from other business 
_ schools. Furthermore, to directly answer the 
question of the relevance of business educa- 
tion to managerial success, one should also 
include a control group of managers who did 
not have a formal management education, 
Finally, the results of the present study are 
based on students who were admitted and 
Who subsequently enrolled in the MBA pro- 
gram. The predicted relationship may not be 
valid for the applicant population as a Whole. 
The results must be corrected for “curtail- 
= ment” or “restriction of range” (Thorndike, 
1949) before they can be used in a valid 
model for selecting students from an entire 
population of applicants. 
Three major conclusions can be derived 
from this study: (a) Graduate grade point 
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average is significantly correlate: 


man- 
agement compensation 3-10 year grad- 
uation. (b) The effect of work e nce on 
annual compensation is considera onger 
for staff managers than for lin nagers 
3-10 years after graduation. (: ff and 
line management compensation cai differ- 
entially predicted from pregrad e and 
graduate data. 
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\ study was conducted to 
in an industrial environment. Subjects 


for directory assistance operation. 


in the mean pupil diameter between 
conditions. Results indicated that the 
relative fatigue levels. 


Objective measurement of fatigue has long 
ded precise specification (Bartley, 1957). 
e factor which has contributed to the 
has been the notable absence of an 
le definition of fatigue that has a 
E: scientific meaning (McFarland, 
1971). Fatigue has traditionally been in- 
| ferred by subjective reports of tiredness and/ 
or by measurement of the presumed conse- 
quences of fatigue such as performance decre- 
ment (Przystanowicz, 1970). 
» Attempts to identify objective measures of 
fatigue have included a variety of psycho- 
~ logical and physiological variables that range 
_ from reaction time, blink rate, Critical Flicker 
Frequency (CFF), and visual acuity to bio- 
chemical, circulatory, and psychophysiologi- 
cal analyses of bodily processes. The major 
difficulty with many of these techniques is 
that the human organism is able to tem- 
é 1 Porarily overcome effects of fatigue—except 
in cases of extreme exhaustion—during a test 
situation, thereby offsetting all or some of 
_ the dependent variables (Cameron, 1971; 
_ Dearnaley, 1958). 


An area of investigation which has been 
rement of fa- 


N 


even more elusive is the measu 


4 1 Certain aspects of the technique in this study 
have been previously reported at ia on the 
pupil held by the German Ophthalmological Soci- 
| Cty, Bad Nauheim, Germany, March 1972 and the 
American Psychological ‘Association, Honolulu, Ha- 
ioe wal, September 1972. 
“Ss 2 Requests for reprints should be sent to W. Scott 
Peavler, P.O. Box 2020, Bell Telephone Laboratories, 
New Brunswick, New Jersey 08903. 


PUPLLLOGRAPHY IN INDUSTRIAL FATIGUE 
ASSESSMENT ' 


TAMARA GEACINTOV aso W. SCOTT PEAVLER * 
Bell Telephone Laboratories, New Brunswick, New Jersey 


participating in a trial of an automated retrieval 


‘iter working on the traditional telephone book as well as on the mechanical 
device. The testing consisted of photographing the subjects’ pupil at a rate 


of two frames per second for three minutes. 
pre- and postwork measures across work 


technique may 


BAs. 


be useful for determining 


tigue accruing as a result of normal woi 
activity at jobs that do not require an 
ordinate amount of physical activity. 
mental/psychological fatigue is presumably a 
normal result of repetitive and/or mono 
nous tasks and may be partially caused 
boredom. i 
One physiological system which under con- 
trolled conditions seems to yield a rather 
sensitive indication of central and/or mental 
activity patterns is the pupillary apparatus 
(Goldwater, 1972). In a series of studies that 
investigated pupillary dilation patterns as- 
sociated with psychological processes (Kahne- 
man & Peavler, 1969; Kahneman, Peavler, 
& Onuska, 1968; Peavler, 1969; Peavler * 
McLaughlin, 1967), a consistent observation 
was a reliable decrease in baseline diameter 
between the onset and termination of eo 
experimental session, regardless of the nature” 
of the task, which suggested that systematic 
diameter changes may be reflective of not- 
mal cognitive effort associated with typical 
job activities. 2 
Other investigators of the pupillary system 
(Lowenstein & Loewenfeld, 1951) have indi- 
cated a close relationship between pupillary 
patterns and fatigue levels. They state that 
pupillary waves of dilation and contraction 
accompany arousal and drowsiness in the 
tired individual and that the waves are ideal 
indicators of fatigue because of their objec- 
tive, quantitative nature and sensitivity 
(Lowenstein, Feinberg, & Loewenfeld, 1963). 
Under normal light conditions, pupil diameter 
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is maximum in a well-rested and alert sub- 
ject and diminishes as fatigue increases— 
finally reaching a minimum diameter just 
prior to the onset of sleep (Lowenstein & 
Loewenfeld, 1964). 

A review by Bell Laboratories of the direc- 
tory assistance operation, in order to identify 
labor or timesaving methods and devices, re- 
sulted in a trial examination of new systems 
for storage and retrieval of customer listings. 
One of the proposed systems consisted of a 
microfilm viewer that enabled the operator to 
access the telephone directory listings using 
an electronic keyboard. Customer listings 
were displayed on a screen prior to a final 
search. Because the operator works many 
hours per day at this task, concern was ex- 
pressed that continuous usage of display de- 
vices could possibly induce undue fatigue on 
the part of the operator. 

As a result of the reviewed evidence relat- 
ing pupil size to fatigue, a pilot study em- 
ploying 10 subjects was executed. Presumed 
fatigue was induced by exposing the subject 
to three Hollywood films, all shown on one 
day, with a total viewing time of six hours 

interrupted by a lunch period and a short 
_ afternoon break. Promising results of that 
study (reliable decreases in baseline diameter 

were observed) resulted in plans to obtain 
relative measures from operators participating 

in the microfilm trial while working at both 
_ the traditional book positions as well as the 
microfilm positions, 

Previous work by Lowenstein and his as- 
sociates that related pupil size to fatigue 
level was confined primarily to clinical evalu- 
ations of alert subjects and chronically tired 
individuals, whereas the purpose of the pres- 
ent study was to determine if the pupillary 
response sensitivity permits discrimination be- 
tween two similar job activities. 


METHOD 
Subjects and. Apparatus 


Seventeen operators participated in the study. 
Measurements on two of the subjects were not com- 
pleted due to operator schedule changes. The data 
of four additional subjects were discarded because 
the pupils were unmeasurable for various reasons, 
As a result, the data were based on 11 subjects, 5 
of whom normally used corrective lenses that were 
not used during data collection periods. 
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tech- 
paratus 


The pupillary data were obtained 
nique developed by Hess (1972) 


consisted of a viewing cabinet with a d cam- 
era and red source light that pern nofrared 
photographic records of pupil mov to be 
recorded. Measurements were taken ¢ room 
in darkness except for light from a amp on 
the rearview screen of the viewing box subjects 
looked straight ahead at a set of three | (ABC) 
after having been instructed to focus o middle 
letter (B) located about one third o listance 
from the top of the rearview screen. 
Procedure 

Each subject’s pupil was allowed to to the 
red light for one minute, after which i photo- 
graphed for three minutes at a rate o frames 


per second during each of the four is, and 


resulted in 360 measures at each sessic h sub- 
ject was measured before and after day’s 
work on the book and before and aft i] day's 
work on the microfilm reader. The obse is were 
taken on two consecutive days and res in four 
sets of measures for each subject. Hai ve sub- 
jects were initially measured before and work- 
ing on the book position, while the ot if were 
measured initially before and after work on the 
machine positions. Each frame of the pr d film 
was projected on a screen 89 centimete: m the 
front of the lens of a Dunning Anim single- 


frame projector, and the pupil image, m- ied 15 
times, was measured to the nearest millin er. 


RESULTS AND Discussion 


The pupil is sensitive to many in{izences 
other than fatigue. Other variables which 
affect pupil size include psychic stimulation, 
mental concentration, anxiety, attention, mo- 
tivation, emotional excitement, etc. In order 
to isolate the effect of fatigue, an attempt 
was made to minimize influences of extrane- 
ous variables by photographing the subject’s 
pupil in a dark, quiet room, with only the 
experimenter present. Also, because cognitive 
relaxation is necessary in order to obtain 
data uncontaminated by the factors men- 
tioned, data analysis was limited to the final 
minute of the recording period. It should be 
added that continuation of the session in a 
partially sensory-deprived environment prob- 
ably may have resulted in some subjects g0- 
ing to sleep. 

The mean pupil sizes for all subjects be- 
fore and after work on the book position and 
the microfilm position are shown in Figure 1. 
The graph encompasses the entire three- 
minute recording period. For easier graphic 
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present data points have been averaged 
for eve! econds. 

The mean pupil size for all the sub- 
jects a beginning of the workday was 
essentia e same at the book and machine 
positio: )k—4.6 millimeters, microfilm— 
4.5 mil rs. Furthermore, individual pupil 
sizes í ed during the morning session 
were | rrelated with diameter differences 
betwee rning and afternoon sessions. 

The n difference in pupil size (.32 mil- 
limete tween the morning and the evening 
measu r all 11 subjects across work con- 
dition s statistically significant (f= 
2.78, 01). Further analysis of the data 
within h work condition indicated that the 
mean ease (.43 millimeter) was statisti- 
cally nificant. (t= 2.64, p <.05) when 
the cts worked on the microfilm device 
but nen they worked on the traditional 
book tions. 

TL- unpose of the study within which the 
fatig neasures were carried out was to com- 
pare i> performance of operators who worked 
on to microfilm machine with that of op- 
erato: cho used the paper telephone record. 
This principal study showed that using the 
mechanized device resulted in improved per- 
form: by operators because of the inher- 


reater access speed of the mechanized 
im device. Therefore, performance 
es traditionally correlated with fatigue 
confounded with the differences in 
treatment conditions. 

While subjective reports of fatigue from 
the specific operators participating in the 
study were not identifiable because of guar- 
antees of anonymity, the population from 
which they were selected reported significantly 
more physical symptoms of fatigue such as 
neckaches, backaches, eyestrain, and head- 
aches when working on the microfilm device 
than when working at the paper positions. 
These complaints were somewhat contra- 
dicted, however, by an overwhelming pref- 
erence for the microfilm device. 

A difference in relative fatigue indexes was 
not unanticipated, because several plausible 
contributing factors associated with the micro- 
film device had been identified both prior to 
and during the study. The major factors, 
identified both by operator comment and 


were 
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Ficure 1. Mean pupil-diameter changes for 11 
subjects across time before and after working on 
book and microfilm positions. 


by environmental analysis, included poor 
overhead lighting that created glare, exces- 
sive contrasts in the visual field, specular 
reflections, inappropriate chair design that 
did not provide proper support, and an in- 
adequate physical design of the microfilm 
device that precluded shifts into restful posi- 
tions. It should be noted that operator ac- 
ceptance of the device, as evaluated, was gen- 
erally very positive and enthusiastic in spite 
of the noted problems. 

The interpretation of fatigue in the present 
study rests primarily with the subjective re- 
ports, the environmental factors that were 
identified as selectively affecting the micro- 
film operation, and the existing literature. 
Reliable performance differences, even with- 
out the previously mentioned confounding 
between tasks, are extremely difficult to ob- 
tain in the live directory assistance environ- 
ment because of the nature of the task. Since 
the customer, who is not under experimental 
control, is the most potent determiner of 
speed and accuracy in the transaction, data 
collection must necessarily continue over 
weeks or even months before reliable indi- 
vidual differences in operator performance 
may be identifed. 


In the present study, we were able only to 
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speak of statistical significance when com- 
_ paring mean pupil size at the end of a day’s 
Work performed on two somewhat similar 
tasks, both of which are assumed to be nor- 
mally fatiguing. In order to apply this tech- 
nique to practical problems involving groups 
of subjects, it is necessary to correlate the 
degree of fatigue, as measured by perform- 
ance decrement, with changes in pupil size 
and to develop quantitative norms that re- 
q flect individual differences. The investigators 
feel, however, from the results of the present 
study and the literature reviewed, that pupil- 
_ lary response shows some promise as a reli- 
able indicator for the more objective assess- 
ments of fatigue resulting from industrial 
work activity. 
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SHORT NOTES 
GETTING THE MAIL QUESTIONNAIRE RETURNED: 
SOME PRACTICAL RESEARCH CONSIDERATIONS 
JOHN F. VEIGA? 
University of Connecticut 
Methods used for returned mail questionnaires have not been systematically 
evaluated in terms of their effect on the rate of return and respondent bias. 
To evaluate the effectiveness of three most often used return methods, 100 
managers received a questionnaire which Was to be returned by either a 
stamped envelope, a business reply envelope or their firm’s interplant mail 
system. Results suggest that the interplant mail system provided the greatest 
return rate, at lowest cost, without respondent bias. The business reply 
envelope produced a very poor response rate. 
Eve uch the choice of a return vehicle for the mailers returned. It appears that the ques- 
a mai -stionnaire can have a substantial im- tionnaires returned by business reply somehow 
pact research, rarely have return vehicles received a lower priority in getting filled out and 
ever ` a tested as rigorously as the data which returned than either of the other methods. O 
they generate. As part of a major research get a closer look at the length of time required, 
proje . study was conducted to evaluate three weekly return rates were recorded for four weeks 
return methods: (a) the preaddressed stamped after the study began and are reported in 
enve (b) the postage guaranteed business Table 2. 2 
repl), lope, and (c) an interplant (in-house) The data in Table 2 further substantiate the 
mail em. The stamped envelope, while being ineffectiveness of the business reply mailer. More 
the most expensive in terms of postage cost, important is the large discrepancy noted in the 
requires the respondent to waste a postage stamp first week returns between the interplant system 
suggesting a possible advantage. The business and the stamped envelope. Two explanations for 
First it could 


reply envelope avoids the added expense of un- 
returned questionnaires but might be associated 
with junk mail, The interplant method has the 
advantage of saving postage cost but it could 
create substantial bias if the respondents feel 
their anonymity might be jeopardized. 


METHOD 
Return Rates 


One hundred randomly selected managers re- 
ceived a questionnaire. These managers were 
divided into three groups depending on the type 
of return mailer received. Table 1 presents the 
initial results. 


rate, it appears that the 
e business reply offers 
mail hypothesis. In 
t that the other two 


In observing the return 
poor response from th 
some support for the junk 
addition, the findings sugges 
methods are superior to the business reply and 
are about equal in return effectiveness. Also of 
interest was the length of time required to get 


Requests for reprints should be sent to John F. 
Veiga, Department of Industrial Administration, 
University of Connecticut, Storrs, Connecticut 06268. 


the discrepancy can be posited. ; 
be argued that the questionnaires returned by = 
the interplant system received attention faster 
because the return method demonstrated evi 
dence of company support. Second, it was noted 
that about 20% of the stamped envelopes were 
from distant geographic locations. Thus it ap- 
pears that stamped envelopes provided the re- 
spondents with an opportunity to take the ques- 
tionnaire home or on a business trip thereby — 


TABLE 1 
QUESTIONNAIRE RETURNS BY MAILER TYPE 


Questionnaire returns 


— — 


Mailer type Length of 


Return rate 


(%) period 
(in days) 
Interplant 82 13 
Stamped envelope 80 12 
Business reply 57 25 
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generated on the attitudinal questi 
have statistically similar distributions 
of the mailer used. The Mann-Whit) 
was employed to test the hypothesis 
were all nonsignificant at .01 level 
cance and, thus, it was concluded that 
plant system did not produce any 
respondent bias. 


SUMMARY 
The study supports the use of the 


pothesized, the interplant method di 
duce any respondent bias, In a co 
research environment, it provides a 
hicle that could save a considerable 
money in projects where the sample - 
high. Moreover, it is clear that the bv: 
mailer is not an effective return m. 
the stamped envelope could be a “ 
if an interplant or in-house system 


able. Certainly one could argue that 
factors affect mail returns such 
questionnaire, profile of individuals sam, 


Mainly, the system finally selected d: 
the trade offs the researcher makes. 


(Received November 8, 1972) 
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ECTS OF ALTERNATIVE FOLLOW-UP PROCEDURES ON 
MAIL SURVEY RESPONSE RATES * 


MICHAEL J. ETZEL” axe BRUCE J. WALKER 
University of Kentucky 


a large-scale mail survey of consumers, this esperiment compared the 
jects on response rate of (a) not sending a followup letter, (b) sadias A 
ollow-up letter with a duplicate questionnaire and return envelope, and ie) 
sama a follow-up letter without duplicates, The reals sagerst that the 
cst method k a follow-up without duplicates. 


problems in the use of mail quès- 
e of the most vexing is low re. 
Response rates between 10% and 
non (Boyd & Westfall, 1972; Luck, 
lor, 1970). * 


three categories, according to when 
under consideration is ted. 


example, pre- 
ine calls or letters (Ford, 1967; 
s; Kephart & Bressler, 1958; Myers 
69: Parsons & Medford, 1972; Staf- 
. (b) coincidental to mailing the 


ford 


quest c, for example, type of postage (Kep- 
hart § eser, 1958; Kernan, 1971; Watson, 
1965), h of address (Kernan, 1971), type of 
signature in cover letter (Kawash & Aleamoni. 
1971 lusion of incentive (Kephart & Bres- 
sler, 19:3; Watson, 1965; Wotruba, 1966), na- 
ture of cover letter (Andreasen, 1970; Simon, 
1967), and questionnaire format (Brown, 1965~ 


1966; Ford, 1968); and (e) after mailing the 
questionnaire, for example, follow-up interviews, 
phone calls, and correspondence (Francel, 1966; 
Hochstim & Athanasopoulos, 1970; Myers & 
Haug, 1969; Nichols & Meyer, 1966-1967 ; 
Watson, 1965). The present study is related to 
the third category. 

It is generally accepted that one of more 
follow-ups increase the rate of response (Fran- study, a subsample o 
= 1966). However, recent findings 3 male adults) was drawn from the 1,500 names. 
this factor are inconclusive. For example, in ‘ama in the study was 00 
study of “bank image,” a follow-up r used in the study 
week after the initial letter produced no greater dete. Only “check-type” answers were 
* return than no follow-up letter (Myers quired. The responses were anonymous. 
. The study was con 

The authors wish to acknowledge the financial sesearch conditions. The sample was large and 
9 providi by the Kentucky R oar representative of the general population in the 

nee i dditionally, the topic investiga 

R i uld be sent to Michael study area. Additionally, pi 
J. e Business Administration, in the survey (bank credit card usage) and the 


University of Kentucky, Lexington, Kentucky 40506. questionnaire itself were not atypical in any wax. 
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TABLE 1 


Mau Survey Response RATES FROM THREE 
DIFFERENT SAMPLE GROUPS 


Orig- | Re- y Net % 
Sample | inal vised 9 8 response 
base | bases rate 
| | 
Group control 
(no follow-up) | 300 299 116 38.8 
Group 2—experimental 
(follow-up without 
duplicate question- | 
naire and return f | 
envelope) 200 | 200 118 59.0 
Group 3—experimental | 8 
(follow-up with | 
duplicates) | 200 | 193 | 102 52.9 


* The revised base reflects adjustments due to questionnaires 
returned as undeliverable, 


Thus, the study possesses a high degree of ex- 
ternal validity; as pointed out (Luck, Wales, & 
Taylor, 1970, p. 121), the conclusion can prop- 
erly be projected, therefore, to the real-world 
problem of the decision maker. 

The subsample of 700 was divided into three 
groups: a control group of 300 (Group 1) and 
two experimental groups of 200 each (Groups 2 
and 3). Each sample member was sent the ques- 
tionnaire packet (an individually addressed, per- 
sonally signed cover letter explaining the credit 
survey; a questionnaire; and a postage-paid, self- 
addressed return envelope). 

Five days after the initial mailing, the ex- 
perimental groups were sent follow-up letters“ 
Group 2 received another individually addressed, 
personally signed letter requesting cooperation, 
Group 3 received the same follow-up letter and 
a duplicate questionnaire and return envelope. 


The letters were identical except for a necessary 


addition in the Group 3 letter: “In case you have 


_ misplaced either the questionnaire or the postage- 


paid envelope, duplicates are enclosed.“ 


RESULTS 


The cut-off date for examining response rate 


Vas 17 days after receipt of the first completed 


questionnaire. Only four questionnaires were 
received after this date. To that date, 336 com- 
pleted questionnaires had been received for an 
overall response rate of 48.6%. 


The response rates for the groups are shown in 
Table 1. The control group response rate of 


To stimulate response from those favorably 
disposed or neutral regarding the questionnaire, the 
follow-up, which represents a reminder, should be 
sent while the potential respondent still recalls the 
nature of the survey. Thus, a relatively short in- 
terim, five days, was used. 
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38.8% (116 returned from a re ase of 
299) was higher than normal for ques- 
tionnaire, particularly a four-page Perhaps 
this response rate reflected an ir in the 
subject matter among the respond nd the 
relative ease with which the instrum uld be 
completed, The response rates for experi- 
mental groups, 59.0% and 52.9¢ tively, 
were considerably higher than the group. 

The chi-square analysis used to he hy- 
pothesis indicated that both grou ceiving 
follow-up correspondence (Groups 3) ex- 
hibited significantly higher respons s than 
the group receiving only the ini nailing. 
Comparing Group 2 and the con group, 
x* = 19.6, p < .001; comparing Gro und the 
control group, x7 = 9.4, p < .001. 

The difference in response rat: etween 
Groups 2 and 3, those that received lterna- 
tive follow-ups, was not significa: = 1.53 
p> .05). Thus, including a duplic: 1estion- 
naire and return envelope in the foi up did 
not produce a greater proportion o ponses 
than did sending the follow-up wit dupli- 
cates. 

According to the results of this stu the re- 
searcher who uses a mail questic: and 
wishes to maximize response rate shou: consider 
employing a follow-up letter withow! plicates 
of the questionnaire and return enveloos. This 


procedure not only should improve the response 
rate to a greater extent than that generated by 
a single mailing but also should avoi e addi- 
tional printing, clerical, and perhaps postage ex- 
penses connected with including duplicates in a 
follow-up letter. 
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USING MONETARY INDUCEMENTS TO INCREASE RESPO` 
RATES FROM MAILED SURVEYS: 


A REPLICATION AND EXTENSION OF PREVIOUS RESEARCH 


SCHUYLER W. HUCK t axb EDWIN M. GLEASON 


University of Tennessee 


study indicates that (a) a quarter (25¢) is still an effective incentive, despit: 
its devaluation during the past 15 years, (b) the Kephart and Bressler study 
can be generalized beyond their unique target population, and (c) a follow-up 
mailing sending the quarter to nonresponders rather than to everyone on th. 
initial mailout saves about 50% of the cost of the financial incentive but does 
not significantly decrease the response rate. 


One need only glance through the professional 
journals of many disciplines to note that the 
mailed questionnaire continues to be a most 
popular data gathering tool in research investi- 
gations. Unfortunately, the response to mailed 
questionnaires is typically poor. Returns of less 
than 50% are common, while higher percentages 
_ are rare (Helmstadter, 1970). Since nonrespond- 
ers may be different from responders, a re- 
Searcher cannot be certain that generalizations 
from his data are valid when the response rate 
is low. For this reason, various attempts have 
been made to identify factors that increase the 
rate of return from mailed questionnaires, Re- 
Searchers have investigated the possible effect of 
using airmail stamps rather than regular postage 
on the return envelope (Orr & Neymay, 1965), 
ol having an actual signature on the cover letter 
(Kawash & Aleamoni, 1971), and of sending gifts 
to the recipients of the questionnaire (Pucel, 
Nelson, & Wheeler, 1971). 
As one might suspect, research has demon- 
4 strated that a monetary inducement can be used 
to improve response rates. In a marketing sur- 
__vey, Robinson and Agism (1951) found a quarter 
1 (25¢) to be effective in substantially increasing 
4 the return rate whereas lesser coins or stamps 
_. Were not; a dollar bill, however, was not more 
effective than the quarter. Kephart and Bressler 
(1958) applied these findings to the educational 
= setting and concluded that although the quarter 
= was effective in increasing the response rate 
(70% versus 52%), it was no more effective 


4 1 Requests for reprints should be sent to Schuyler 
. W. Huck, Department of Educational Psychology 
and Guidance, University of Tennessee, Knoxville, 
Tennessee 37916. 
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In a previous investigation, Kephart and Bressler demonstrated that the 
response rates to mailed questionnaires could be significantly increased by 
including a monetary incentive of 25¢. Using 200 college students, the present 


The present research investigation was unde: 


than a questionnaire with an ordi 
to nonresponders (68%). 

The Kephart and Bressler study ) raises) 
three important questions. First 


caused the value of a quarter to e since 
the late 1950s,? and it is question whether 
25¢ can still be used as an effectiy onetal 

incentive. Second, the target popula‘ urveyed 
by Kephart and Bressler was, by t wn ad- 


mission, quite unique, highly moti |, pre 
fessional female nurses. Assuming 

results could be replicated in spite o! 
value of a quarter, it is questionable whether the 
quarter could “work” in the more ge 

demic community. Third, and most im 

seems reasonable to ask whether it is 

send the quarter on a follow-up mailing 
responders as compared with sending i 
initial mailout to all subjects. In the Kep a À 
and Bressler study, many individuals receivingi 
the monetary incentive would have returned the 
questionnaire even if the quarter had not beem 
included. In fact, returns from the control group 
suggest that the quarter was wasted on approxi- 
mately 52% of the individuals to whom it was 
sent. Although it obviously represents substan 
tial monetary savings to send the quarter only 
to those individuals who do not initially 15 
spond, this procedure cannot be recommended 
unless it can be shown that the delayed use OF 
the monetary incentive results in at least as high 
a response rate as does an initial use of this in 
ducement, 


In terms of purchasing power, the Consumer 
Price Index shows that 25¢ in 1958 was worth only 
17.1¢ in October, 1972, the time of the present study. 


npt to answer all three of the 
ed above. Although answers to 
ions were considered interest- 
vemselves, the researchers were 
with the third question, More 
present study was designed to 
cr it is better to send a quarter 
ul mailout to all individuals, (6) 
to those individuals who do not 
initial mailout, or (e) in a sec- 
mailing to those individuals who 
d to either the initial mailout or 


taken in 
question 
the first 
ing in a 
most co 
specifica 
find out 
(e) in 

in a fo! 
respond 
ond fol 
do not 


the fir v-up mailing. 
METHOD 
Subjec: 

Duri: c fall academic term, 1972, 200 
subject -e randomly selected from approxi- 
mately male and female students living in 
reside: alls at the University of Tennessee. 
Once ned, these subjects were randomly 
assign: ne of four treatment groups. 

The ( nnaire and Accompanying Materials 

The onnaire used in this study was a 
variat the Rokeach Dogmatism Scale 
(Roke 1960). This instrument contained the 
40 stan ents that appeared in Form E of the 
origins’ Rokeach scale, plus an additional 10 
statenn that had been included for the pur- 
pose of « previous research study. The directions 
requested the subjects to indicate the degree to 
which they agreed or disagreed with each state- 
ment by means of a 6-point Likert-type rating. 


letter was signed 
graduate student. The preaddressed envelope did 
not require any postage as it was clearly marked 
for return through the university campus mail. 


Procedure 


The actual study involved four treatment 
groups (» = 50) and three mailouts. On the first 
mailout, all 200 subjects were sent the cover 
letter described previously, the questionnaire, 
and the return envelope. The material sent to all 
four groups was identical except that the cover 
letter for the first group of subjects (Group 1) 
had a quarter (25¢) taped to the bottom of the 
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TABLE 1 
Suuwary or me Restasce Desiox 
== ms ae = <= 2 — T 
Group First Second Thiri 
| mailul walli; t 
1 | Q+28 ln Ou. tone 
i Q Q+ ise | Q 
864 Q | Q 2 
Q i Q 


page. A short note next to the quarter thanked 
the recipient for taking time to Gill out the ques 
tionnaire and invited him to use the quarter to 
buy a cup of coffee and a doughnut. j 

The second mailout involved only those sub- 
jects who did not respond to the initial mailout. 
‘These nonresponders were sent a second copy of 
the Rokeach questionnaire and another envelope. 
A revised version of the cover letter indicated 
that no response had been received from that 
subject, and the recipient was asked to complete 
the enclosed questionnaire and return it in the 
accompanying envelope, For all subjects in the 
second treatment group (Group 2) that received 
this follow-up, a quarter was attached to the 
bottom of their cover sheet with the same note 
that had been put next to the quarter for sub- 
jects in the first treatment group on the first 
mailout. 

The third mailout involved only those subjects 
who did not respond to either the first or the 
second mailout. These nonresponders (holdouts) 
were sent a third copy of the Rokeach question- 
naire and another envelope. This time the cover 
letter indicated that the success of the study was 
contingent on a high percentage of returns and 
that the investigator would greatly appreciate 
the recipient taking a few minutes out of his 
schedule to complete and return the question- 
naire, For all subjects in the third treatment 
group (Group 3) that received this final follow- 
up, a quarter was attached to the bottom of the 
cover sheet with the same explanatory note as J 
had been used on the previous mailouts for 
treatment Groups 1 and 2. 

The design of this experimental research in- 
vestigation is summarized in Table 1. Of the four 
treatment groups, the first three may be thought 
of as experimental groups, Each of these groups 
received a quarter, and they differed only in 
terms of whether the quarter was included in 
the first, the second, or the third mailout. The 


. y 


: 
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fourth treatment group (Group 4) may be 
thought of as a control group, inasmuch as no 
monetary incentive was provided for this group 
at any point throughout the study, 


Resvuts 


On the basis of the first mailout, the Rokeach 
questionnaire was returned by 78% of the sub- 
jects in the first group (i e., the group receiving 
the quarter) as compared with 50% of the 
subjects in the remaining three groups. A chi- 
square test indicated that the difference between 
these two percentages was significant (x? = 10.8, 
dj = 1, p <.001). As expected, a comparison of 
the response rates for the three groups that did 
not receive the quarter yielded a nonsignificant 
value for chi-square (p> .10). These results 
demonstrate that (a) a quarter can still be used 
to increase the response rate, even though it is 
worth less than when it was initially shown to be 
effective as a monetary incentive in 1958, and 
(b) the conclusions drawn by Kephart and Bress- 
Jer (1958) can be generalized beyond their 


unique population of female nurses to a much 


larger segment of the general academic com- 
munity. 

The combined returns from the first two 
mailouts indicated that Group 1 (a quarter on 


_ the first mailout) had a response rate of 92%; 
_ that Group 2 (a quarter on the second mailout) 


had a response rate of 859%; and that Groups 3 
and 4 (no quarter) had an average response rate 


_ of 65%. Separate chi-square tests indicated that 
there was no significant difference between the 


percentage of returns from Groups 1 and 2, 
while the combined response rate from Groups 
3 and 4 was significantly lower than the response 
rate from either of the two groups receiving the 


quarter. In spite of the fact that twice as many 


quarters had been invested in Group 1 as com- 
pared with Group 2, the response rates from 
these groups, after two mailouts, were not sig- 
nificantly different. 

? The combined results from all three mailouts 
indicated that Groups 1 through 4 had total re- 
sponse rates of 94%, 92%, 78%, and 71%, re- 
spectively. Chi-square tests revealed that Groups 
1 and 2 did not differ in terms of these overall 
response rates, nor did Groups 3 and 4 differ 
from each other. Thus, the quarter, when delayed 
until the third mailout so as to minimize the 
researcher’s financial outlay, did not seem to 
work. To verify this apparent finding, Groups 
3 and 4 were compared in terms of the per- 
centage of returns from only the third mailout. 
No significant difference was found. 
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Discussion 


On the basis of their results, Kephart a 
Bressler (1958) recommended sending a fo 
up mailout to nonresponders rather than sei 
a quarter on the first mailout. n) 
tion was made because the 
although not significantly different in 
rate, was said to be more costly to the research 
The present authors maintain that neither 
these procedures should be adopted. 

According to Kerlinger (1964), a rëspo 
rate of 80%-90% or higher must be achiew 
before a researcher can feel confident that W 
results are valid. Sending a quarter on the fif 
mailout created a response rate of 70% in 
Kephart and Bressler study and 78% in W 
present study. On the other hand, the to à 
response rate from an original mailout plas 
follow-up mailing to nonresponders, with ni 
monetary incentive at all, was found to be 68 
in the Kephart and Bressler study and 65% Ñ 
the present study. Thus, the results from bol 
studies indicate that the “Kerlinger criterio 
cannot be achieved (a) in one mailout by 
ing a quarter to everyone, or (b) in two 
by sending a follow-up mailing to nonresponders 
with no monetary incentive given to anyone, 

The results of the present investigation dêl 
onstrate that the monetary incentive must | 
combined with a follow-up in order to achie 
an acceptable response rate. The resea ch 
seems to have two alternatives: (a) He can e 
the quarter to all subjects on an initial mailot 
with one or two follow-ups to nonresponders; | 
(b) he can send the quarter only to those int 
viduals who do not respond to the initial mal 
out, possibly with a second follow-up sent 
individuals who do not respond to either of t 
first two mailouts. These procedures do m 
significantly differ from one another in th 
effectiveness. and both produce a response f 
which closely approaches or exceeds the © 
terion of 80%-90%. However, since the fit 
procedure costs the researcher about twice 
much as the second, the present authors reco 
mend using the monetary incentive in the fr 
follow-up, not in the initial mailout. 
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This report reexamines previously collected 
experimental data to determine whether it is 


$ This investigation was supported by U.S. Public 
Health Service Grant #1 ROI HD 05600-01 to T. J. 
Bouchard, Jr. The report was written while the first 
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to werk on + 


the group sine (4-0) and procedures (various forms ol 
cematdeted, it was concteded that the pooled-individual pro 


superior to the poolrd croup 


procedure. 


more effective to form several p 
pool their results, or to use the 
of individuals who worked alone 


Mertiop 


The raw data from three prev st 
which compared nominal groups an: roupi, 
were reanalyzed in the following w=: he ne 
overlapping ideas of all ame- omii 
groups in a study were pooled and vet 
lapping ideas of the corresponding oup 
were pooled. Thus, Condition A in! 1 
resents the pooling of the nonoverls: -g ide! 
of three four-man real groups and s 
man nominal groups. The total num) of id 
generated in cach condition was 48 
respectively. The pooling of the nom: 


nonoverlapping ideas is equivalent to pooling 
nonoverlapping ideas of 12 individuals and ¢ 
responds to what we have called procedure ( 


The pooling of the real groups’ nonoverlappi 
ideas corresponds to procedure (c). Nine ine 
pendent contrasts were available tha! varied 
terms of group size, experimental conditions, 4 
problems worked on. The data are presented 
Table 1. 

Of the nine contrasts, Contrasts A, B, C, # 
D are from Bouchard (1972, Experiment II) 
Contrasts E and F are taken from an unpuw 
lished replication of part of the above stud) 
and contrasts G, H, and I are taken from Bo 
chard and Hare (1970). In all nine cases, 


two-tailed test). 

The results clearly suggest that for the grou 
sizes and experimental conditions sampled, it 355 
more effective in terms of total number of ide 
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EXPECTANCY AS A MODERATOR OF THE RELATIONSH 
BETWEEN JOB ATTITUDES AND TURNOVER * 
FRED DANSEREAU, Jus 
State University of New York at Buffalo 
JAMES CASHMAN asp GEORGE GRAEN 

University of Illinois, Urbana 
The hypothesis that the strength of the predictive relationship of job attitude 
on turnover is moderated by the expectancy of finding a comparable job was 
tested separately with a sample of office workers (22?) and a sample of 


managers (354). Results, while supporting this hypothesis, showed litti 
enhancement of the typical relationship. 


In a recent text, Industrial and Organizational 
Psychology, Korman (1971) pointed out the 
lack of published research on the moderating 
effect of the availability of comparably alterna- 
tive jobs on the empirical correlation between 
job attitudes and terminations, Although few in- 
vestigators working in the area of attitude and 
behavior doubt the validity of this moderating 
effect, documentation has not been supplied, 

Perhaps the main reason for the lack of docu- 
mentation of this moderating effect is that most 
models make the same prediction, namely, that 
a participant tends to leave a given organiza- 
tional field when the utility (or the like) of 
Staying in that field becomes sufficiently lower 
than some alternative organizational field, Even 
those researchers who only report the simple 
correlation between job attitudes and job termi- 
nation implicitly assume that the attainability of 
comparable jobs is essentially a constant or at 
least a severely restricted distribution for the 
sample under investigation. The point on which 
researchers disagree is whether the subjective 
probability of attaining a comparable job serves 
to moderate this relationship. The purpose of 
this analysis is to test for this moderating effect 
using a measure of subjective probability within 
samples of office workers and managers within a 
single organization. 


The authors would like to thank the president 
and vice president of the Participating organizations 
for their willing cooperation and to thank the sub- 
jects in this study for their contributions. The au- 
thors would like in addition to acknowledge the 
research assistance of Ken Alvares, Chip Johnson, 
Jeanne Herman, Takao Minami, Dean Orris and 
Tom Johnson and the critical reading of an earlier 
version by Bill Haga, 

? Requests for reprints should be sent to Fred 
Dansereau, Jr., Department of Organization and 
Human Resources, School of Management, State 
University of New York, Buffalo, New York 14214, 
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METHOD 
The samples for this analysis we 2 unig 
ized office workers and 354 manage: aA 


international corporation in one of 
dustries. These samples were asses 0 
a larger study (Graen, Danse rea Minam 
1972). Thirty months after the o | as 


ment, the employment record of ea rtici 
was obtained from organizational rds 
the last date of employment was! Du 
this 30-month period, 23% of the oi work 


sample and 17.2% of the manager 
left the organization. 

For purposes of this analysis, wo 
were indexed by the Role Oriente: 
(Graen et al., 1972), and the ex 
attaining a comparable job was asses 
ing each participant the following que 
you really looked for another job, wh 
think are your chances of finding one as good g 
better than your present job in the near future’ 
The response alternatives were: (a) “No chana 
at all,” (b) “Probably would not find one,” (e 
“Do not know,” (d) “Probably would find one, 
and (e) “Certain to find one.” This expectane 
variable was trichotomized by dividing it ; 
low (c and below), medium (“Probably woul 
find one”), and high (“Certain to find one”) e 
pectancy, These cutting points used on each sam 
ple resulted in the group frequencies shown 1 
Table 1. 

One analysis appropriate to test for 
moderating effect of perceived expectancy 
the correlation between job attitudes and te 
nations is the correlation between the predict 
(attitude) and the criterion (stay or leave) wi n 
in each of the three expectancy groups and © ie 
all three expectancy groups. Since the nonline: 
relationship between expectancy and predictor 
was close to zero for both samples, little trun 
tion of attitude scores was evident within 
pectancy groups. 
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TABLE I 
CORRELATIONS BETWEEN JOB ATTITUDES AND TERMINATION WITHIN Exrecrancy Geours 


— 


Expectancy of finding a comparable job 


— or pe Y a a a TRA Total sample 
Sample a tor Low Medium | High 
7 D * at i -F 
eo r | n r | ” | r | n r | * 
Office | 
Wo | —.04 76 -21 76 =—277% | 70 -18° 222 
Turn f | 18.4% 21.0% 2.9% | } 23.0% 
Mana | i 
Sup nīR | — 04 129 —.14 127 —,21° | 98 —. u 3s 
Tur ate 16.3% 15.7% i 20.3% 17.2% 
3 X coefficients of determination (eta) of expectancy on predictors were Ut with work itself and 01 with supervision 
2 2 onal rele 
255 
LESULTS AND DISCUSSION (1971). Even though these results are as pre- 
The s of this analysis showed that one dicted by our various models (theoretical sig- 
of the i- Role Orientation Index scales reliably nificance), the strength of these relationships. 
predict -minations for each sample. As shown lack the magnitude to be of much utility to the 
in Ta! work (intrinsic value) predicted practitioner (practical validity). A more prom- 
termin for the office workers and super- ising approach that might be considered is 
vision (i erpersonal relations) predicted termi- described and illustrated by Johnson and Graen 
nations managers. The correlations for the (1973). 
total snes are typical of those reported in REFERENCES 4 
previous investigations (Hulin, 1966; Libo, 1953; Graes, G., Daxsextau, F In., K Masi, T. Dys- 3 
Sagi, C Stead, & Altelsek, 1955; Waters & functional leadership styles. Organizational Be 
Roach, i971; Webb & Hollander, 1956; Weitz & havior and Human Perjormance, 1972, 7, 216-236. 
female clerical 


Nuckols, 1953) approximating —.20. 

As shown in Table 1, the perceived expec- 
tancy of finding a comparable job moderated the 
correlation between attitude and termination in 
both samples.“ In both samples, the pattern of 
observed correlations was as expected based on 
the constraint of an absence of a comparable 
alternative. Moreover, the pattern of turnover 
rates of the expectancy groups in both samples 
is compatible with the notion that people tend 
not to leave their present jobs if they doubt 
the attainability of comparable alternatives. 

Although a number of alternative hypotheses 
are compatible with the moderating effect of 
perceived expectancy on the relationship of atti- 
tudes to terminations, the empirical results sup- 
ply some of the documentation required to 
temedy the oversight mentioned by Korman 

In order to adjust for different turnover rates 
within the mobility groups biserial correlations can 


be used. The biserial correlations for low, medium, 
and high expectancy groups were —.06, —31, and 
—.30 for the office workers and .06, —.21, and —.30 


for the managers, Adjusting the correlations for the 


lack of a 80-30 split within expectancy groups had 
little impact on the moderator relationships. 
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JOB EXPECTATIONS AND VALENCES OF JOB REWARDS FO 
CULTURALLY DISADVANTAGED AND ADVANTAGED 
CLERICAL EMPLOYEES 


RICHARD D. ARVEY?* 


University of Tennessee 


This study examined the job expectancies of a culturally advantaged and 
disadvantaged employee group (female civil service clerical workers) con- 
cerning whether effective job performance would lead to certain job rewards 
and the relative importance or valences of these rewards compared to an 
advantaged group. Data revealed that the disadvantaged group saw for 
themselves, as opposed to the advantaged group, less likelihood for effectiveness 
i in their jobs to lead to advancement or to the use of one's own judgment 
> Abo, the disadvantaged group indicated that a high salary was more important 
$ to them than to the advantaged group. 


It is sometimes claimed that culturally disad- 
vantaged employees are just not motivated to 
perform effectively in their jobs. One explana- 
~ tion behind the behavioral manifestations of 
these disadvantaged individuals is that they per- 
_ ceive that working effectively in their jobs has 
little effect on attaining valued work outcomes or 
rewards, That is, culturally disadvantaged indi- 
viduals may believe that doing well in their jobs 
does not lead to such rewards as advancement, 
recognition, and so forth, regardless of whether 
these perceptions are true or not. This explana- 
tion stems primarily from the instrumentality- 
_ valence theory formulated by Vroom (1964). An 
_ expectancy is defined as a momentary belief con- 
= cerning the likelihood that a particular act is 
followed by a particular outcome, while valence 
essentially refers to the desirability or preference 
an individual has for a particular outcome. 

The purpose of this research was to examine 
the expectancies of a culturally disadvantaged 
employee group concerning whether doing well 

in their job would lead to certain reward out- 
comes compared to a relatively advantaged 
group. It is hypothesized that culturally disad- 

_ vantaged individuals differ, in these job expecta- 

tions, from a more advantaged group. 

a We also have reason to suspect that culturally 

_ disadvantaged individuals value different reward 
outcomes than a more advantaged group. That 


1 Appreciation is extended to H. Dudley Dew- 
hirst and Michael E. Gordon for their comments 
on an earlier version of this article. 

Also at Applied Psychological Systems, Inc., Knox- 
ville, Tennessee, 

Requests for reprints should be sent to Richard 
D. Arvey, Department of Industrial Management, 
University of Tennessee, Knoxville, Tennessee 37916, 
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is, the same reward outcomes have en 
values or valences for the two groups. Slocum 
and Strawsen (1971) reported a study indicating 


differed from white CPAs with regard 
they felt was important in their jobs. Bla 


tunity to help people, (6) omiy for inde- 
pendent thought and action, and (c) feeling of 
self-fulfillment. Thus, another objective of this 
research was to examine differences between a 
culturally disadvantaged and an advantaged 
group with respect to their perceived valences 
what they felt were important aspects in a 


METHOD 
Sample 


The population consisted of 266 female cleri 
cal workers employed for the civil service in 4 
large midwestern city. All but three of thes 
individuals had completed high school, For 
population a culturally disadvantaged and 
turally advantaged group were identified on 
basis of two kinds of information: (a) whethe 
or not their fathers had completed high schoo 
and (b) their scores on a measure of cultura 


(Mathis, 1967). Age was partialled out of thi 
latter variable. 

The Environmental Participation Index ( 
is a 68-item check list where the first 20 ite 
reflect household possessions (TV, typewri 
etc.) and are meant to indicate the richness 
the material home environment if such object 
were in their household prior to their sixt 
birthday. The last 48 items can be thought of 
activities (e.g., visited a museum, rode in a tax 


Snorer 


re asked to check “yes” if they 


etc.) Sul 
dad part in such activities by the time 
they were s old. 

Indivic re designated as culturally dis- 
advantas heir fathers had not 
high sch they were below the median on 
the EP! core. The number of individuals 
included advantaged subgroup was 76— 
65 of w? | complete data—while the num- 
ber inc! n the disadvantaged sample was 
74—64 « xn had complete data; both sub- 
groups v -awn from the total sample of 266. 

A fa alysis revealed that these two 
variable: in actuality, measures of cultural 
advantay ey & Mussio, 1973), Similarly, 
accordir che norms presented in the EPI 
manual 1, 1967), the raw scores obtained 
by the ‘antaged sample correspond to the 
scores eted to signify a disadvantaged 
popula! 

Measur 

Dur time when these employees were 
being for a test validation project, they 
were a o complete a questionnaire contain- 
ing val ind expectancy measures. 

In orco- to obtain information concerning the 
desirat or valence of certain reward out- 
comes hei! by these individuals, 10 job outcomes 
were presented with instructions to distribute 
100 points among the outcomes according to 


their perceived importance. The 10 reward out- 
comes were: making use of abilities, accomplish- 
ment, advancement, supervise others, fair com- 
pany policies and practices, high salary, getting 
along with co-workers, praise, use Own j ent, 
and steady and secure employment. 

Subjects’ perceptions of their expectancies for 
the outcomes were determined by asking them to 
indicate the probable results of efective per- 
formance according to whether they were ei 
quite certain the job reward would not occur; 
probably would not occur; was not sure one way 
or another; probably would occur; or was quite 
certain the job reward would occur. Values of 1 
to 5 were assigned to these response alternatives. 
This expectancy was determined for each of the 
10 reward outcomes, resulting then in 10 sepa- 
rate expectancies for each individual. 

The disadvantaged and advantaged groups were 
compared on the separate valence and expec- 
tancy measures by obtaining means and com- 
puting £ tests. 


RESULTS 


Significant differences beyond the os level 
occurred between the two groups with regard to 
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with co-workers. In contrast, 
indicated that accomplishment 


iji 
i 

F 

z 


abilities, accomplishment, 
em 


ployment. 
Statistically significant differences (p < 05) 
variables: we of 


The results of this study have indicated that 
a culturally disadvantaged group differs from a 
relatively more advantaged group with regard to 
their valences for certain job outcomes and their 
expectancies concerning whether effective job 
formance leads to certain job rewards. P 
The finding that the disadvantaged group had 
greater valence for a high salary and secure em- 
ployment than had the more advantaged group 
was not surprising. To some extent, these data 
replicate the findings of Slocum and Strawsen ` 
(1971) that blacks assign more importance to 
lower order needs, That the culturally disad- 
vantaged also feel that getting along with their 
co-workers was the most important feature in 
their job was somewhat surprising. Apparently 
social needs were particularly acute for this 
sample. However, one of the items in the Slocum — 
and Strawsen (1971) study that black certified 
public accountants indicated was relatively more 
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VELOPMENT OF A SCALE TO ASSESS STUDENT NURSES’ 
VIEWS OF THE PROFESSIONAL NURSING ROLE * 


M. CROCKER * R J. BRODIE 
niversity oj Florida University of Virgis 


h of 112 behaviors common to practicing sures. A fsal 
xd consisting of (0 items on which students shifted toward 
ass rank Reliability computed for + cro validation poop wet 
39. Significant gains in student scores occurred as chase raak 
adicating that students adopt faculty professional views in dimet 
ngth of training. 

Ama | of educators in the health pro- uing the same trait list, 
fession cndow students with realistic views increase in the correspondence 
of thei: “Sonal roles. Failure to develop a and faculty professional 
realisti image is believed to be a primary vanced in class rank, Sharp 
cane ademic dropout, poor clinical per- found increasing correspondence 
forma: d desertion from the field after and 2 
gradua r example, Katzell (1968) reported Adjective 
that v wal after the first year of nurses’ class rank. ‘Thus far, 
trainin: inversely related to fulfillment of standardized instrument has been 
studen cctations and Fox, Diamond, and measuring changes 
Walsh ) and Kibrick (1958) observed that ing role, 
nursing gdents who were realistic in their ap: Wik the growing 
praisal. | the field were more satisfied and more in health care education 
likely | persist in the profession, The process faculty influence student 
by wh.) students acquire appropriate percep- need for instruments which 
tions he professional role has been called the process of professional 
“profess.onal socialization.” primary — kes eri ; 

amount i a 
Paramount in professional socialization theory — — 


is the auum that faculty strongly influence stu- ange 
dent perceptions of their professions. Several = 
researchers have attempted to measure this influ- role. 
ence on medical and nursing students, with vary- 

and Michiel 


ing degrees of success. Instrument 
(1966) reported that medical students’ views of of 11 
the doctor image and doct relationship pool , 


or- patient An initial 

became more like those of their faculty as stu- and entitled the nurses 
dents progressed in their training. Unf scale. tem Ben eo 
the statistical significance of these student re- displayed by nones g 1 
sponse shifts were not Similarly, Davis spondents were req 
and Oleson (1964) and Oleson and Davis (1966) on its importance fos 
found that student nurses’ value rating of traits ure wing a N 
were inſſuenced by the emphasis placed on these tremely important 
traits by their instructors; Dut Siegel (1968), ple. items 


‘This study was supported in part by a grant them is not clear, 
(DID NU 00232) from the Division of Nursing, of a patient with no 
U.S. Public Health Service. i 


encouraged them to undertake this project. ani facal bers from the 
Requests å Linda M. sample of 94 nursing faculty mem 
Crocker, Ne eat 2 Instit three participating universities. The proportion 

i of faculty that endorsed a particular response 


Gainesville, Florida 32601- 


was rounded to the nearest 10% and this 
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was assigned to that response, Thus, if a student 
selected a response endorsed by 90% of the fac- 
ulty judges, she received 9 points for that item; 
if she selected a response endorsed by 20% of 
the faculty judges, she received only 2 points 
for that item, etc. Consequently, a student could 
achieve a high score only by rating the traits in 
the same way that a high proportion of faculty 
members had rated those traits. 


Sample 

The 488 student nurses who responded to the 
nurses professional orientation scale were all stu- 
dents who could be contacted in four-year un- 
3 dergraduate, baccalaureate nursing programs at 
Michigan State University, Wayne State Uni- 
versity, and the University of Illinois. 


Design 
An item analysis, cross-validation design was 
employed. The sample of 488 student nurses 
was randomly divided into two groups, stratified 
by class rank, The responses of the first sub- 
sample of 244 students were submitted to an 
item analysis. Selection of items was based on 
the assumption that as students progressed 
through the program, their views of the nursing 
profession become more like that of profession- 
als. The correlation between class rank and the 
student’s item score was tested for each item. 
If a significant, positive correlation was found 

(a <.05) that item was retained for the final 

scale. Sixty items from the original pool of 112 

were retained after the item analysis. On each of 
_ these items there was a significant shift toward 
~ agreement with faculty perception of the trait’s 
importance as the students rose in class rank. 
For example, the item: “Questions instructions 
when the reason for them is not clear” was 
originally rated as extremely important by 80% 
of the faculty judges. When the students rated 
this item, that response was chosen by 28.6% 
of the freshmen, 50.0% of the sophomores, 83% 
of the juniors, and 81.8% of the seniors in the 
item analysis group. 

The responses of the remaining 244 students 
in the cross-validation group were then-scored 
for the reduced set of items selected through 
item analysis. The data from this independent 
sample were used to test the reliability and 
validity of this scale. 


RESULTS 


To test the scale’s validity, the scores of the 
244 students in the cross-validation group were 
then computed, based on the final set of 60 


Snort Nores 


items. Means and standard devi or the 
nurses’ professional orientation sc res of 
freshmen, sophomores, juniors, a ors in 
the cross-validation group are prese Table 
1. Each class mean was progressive! er than 
that of the class below, ranging fr mean 
score of 132.9 for freshmen to a m ore of 
193.5 for seniors. An analysis of y e indi- 
cated that the observed differences en the 
class mean scores were highly signi t (R= 
34.7, df = 3/240; a<.01). Pairwi ympari- 
sons between mean scores of fr n and 
sophomores, between sophomores juniors, 
and between juniors and seniors, etc made, 
using Scheffe’s test (Hays, 1963). T results 
show that the mean score of each on the 
nurses’ professional orientation sca signifi- 
cantly greater (a S. 05) than the of the 
class below, A mean increase of 17 its was 
observed between the junior and s- lasses; 
an increase of 21.8 points, between omores 
and juniors; and an increase of 21.4 its, be- 
tween freshmen and sophomores. 7 reatest 
difference (60.6 points) was found | en the 
senior and freshmen classes, 

These results indicated that the pro- 
fessional orientation scale measures 3 in the 
student’s view of the profession tha s posi- 
tively related to the length of the ng ex- 
perience. The congruence between st cnt and 
faculty ratings of items of the nurse profes- 
sional orientation scale increased si. ‘icantly 
with each year of training experience. 

The reliability of this 60-item scale wos com- 
puted using the responses of the cross dation 
group. The internal consistency c ient of 
the nurses’ professional orientation scale, com- 


puted with the generalized coefficient alpha 
formula, was r = 80, indicated that the attitude 
measured by the nurses’ professional orientation 
scale was a fairly unitary trait among student 
nurses. 


Discussion 


Generally, the results of this study support 
the contentions of Davis and Oleson (1964) and 
Sharp and Anderson (1972) that there is a defi- 
nite trend for the student to endorse the faculty 
view of the profession as graduation nears. This 
may indicate that the student’s views are molded 
by actual clinical experiences, or that the faculty 
who teach students basic skills and procedures 
also indoctrinate them with their own profes- 
sional views. 

One limitation inherent in this study should be 
noted. All students in this study were enrolled 
in baccalaureate university programs. Although 


‘re quite similar to many others 
country, the professional values 
ents in other settings may differ. 
‘se findings might not be appli- 
in junior college or hospital 


y and validity data for the nurses’ 
entation scale are very encourag- 
this instrument merits further 
cudy, An interesting area for in- 
the relationship between clinical 
and scores on the nurses’ pro- 
mation scale, It would also be 
un the relationship between the 
professional views and persis- 
profession, Additional studies such 
needed before this scale could be 
vidual counseling or evaluation. 
must be stressed that the score on 
professional orientation scale mea- 
de degree of the student's conform- 
culty's professional standards, The 
ww score could not be inferred from 
ae. A student could earn a low score 
cynical about her profession, too 
her profession, or too idealistic. Each 
er of this scale needs to examine the 


TABLE 1 
“DATION SAMPLE MEANS AND STANDARD 
tons ON NURSES PROFESSIONAL ORIEN- 
ON Scate Scores FOR Four CLASSES 


educational setting. 


(Received December 11, 1972) 
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STRUCTURE AND MEASUREMENT OF MARINE 
RECRUIT ATTITUDES * 


RICHARD F. BOOTH? anp ANNE HOIBERG 


Navy Medical Neuropsychiatric Research Unit, San Diego, California 


Five orthogonal factors labeled as attitudes toward the toughness of Marines 
the spirit among Marines, affiliation with the Marine Corps, and authority 
and consideration in the Marine Corps were identified in the responses oi 
recruits to items in a Marine Corps Opinion Questionnaire. Scales developed 
to provide scores on these factors should be useful in future studies of Marine 
recruit attitudes and the relationship of these attitudes to posttraining effec- 


tiveness criteria. 


Marine recruits come from a wide variety of 
backgrounds into the highly structured environ- 
ment of basic training where, during the period 
of a few weeks, they are exposed to an intensive 
military indoctrination. An implicit objective of 
this indoctrination is to modify recruit attitudes, 
presumably for the purpose of improving subse- 
quent performance and conduct in the Corps. 
Evidence, however, as to the nature of attitude 
changes that actually occur during this period 
and particularly the relationship of these atti- 
tudes and attitude changes to criteria of effective 
service, has generally been lacking. One reason 
for the lack of empirical evidence pertaining to 
these relationships has been the absence of a 
reliable measure of relevant recruit attitudes 
(Wilkins, 1969). 

The purpose of this study was to develop a 
measure of attitudes toward the Marine Corps 
that could be used in a larger research project 
designed to explore Marine recruit attitudes, atti- 
tude changes, and the validity of these attitudes 
for predicting posttraining effectiveness. The 
primary objective of this larger research program 
was to evaluate these attitudes as a possible ad- 
diticn to the composite of variables currently 
employed for the prediction of effective service 
in the Marine Corps (Berry & Nelson, 1967; 
Plag, Goffman, & Phelan, 1971), Furthermore, 
it was expected that the identification of factors 
related to recruit attitudes and attitude changes 
eventually leads to improved decisions regarding 


This study was supported by the Bureau of 
Medicine and Surgery, Department of the Navy, 
under Research Task MF12.524.001-9005. The 
opinions expressed are those of the authors and are 
not to be construed as necessarily reflecting the 
official policy of the naval service. 

2 Requests for reprints should be sent to Richard 
F. Booth, Navy Medical Neuropsychiatric Research 
Unit, Department of the Navy, San Diego, Cali- 
fornia 92152. 
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programs designed to facilitate y 
ment to military life. The initial 
overall investigation, therefore, invo! 
struction of an instrument that coul 
assess Marine recruit attitudes on 
sions during basic training. 

One useful method for identify: 
tudes measured by this questionn and the 
items that constitute stable indi: F these 
attitudes is factor analysis. The pt > of this 
study, then, was (a) to develop vr scales 
that provide stable indices of attit- assessed 
consistently across administrations i (b) to 
evaluate the independence and re ability of 
scores computed on these factor sch 


the atti- 


METHOD 
Subjects 


The sample was composed of 481 Marine re- 
cruits, from 8 platoons, who entered ‘raining at 
the Marine Corps Recruit Depot (1 D), San 
Diego, during October 1969 and who remained 


with the same platoon throughout training 


Attitude Measure 


A Marine Corps Opinion Questionnaire 
(MCOQ) was created to assess attitudes toward 
several aspects of Marine Corps life: (a) leader- 
ship, (b) discipline, (c) affiliation, (d) toughness, 
(e) group living, and (f) general living and 
working conditions. These areas were chosen, on 
the basis of interviews conducted and literature 
reviewed, because they were considered relevant 
to all Marines, After a sizable item pool had 
been assembled, the attitude items were evalu- 
ated for readability and meaningfulness by 2 
number of Marine recruits and noncommissjonè 
officers. Finally, 96 items were selected to be 
included in the experimental questionnaire. 
six-category, Likert-style response format Was 
employed with item alternatives ranging on 4 
continuum from 1 (agree completely) to 
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(disagree ce ely). To reduce the possible 


efiects of r: se bias, half of the items se- 
lected were ed positively with respect to the 
Marine Cor; ereas the remaining items were 


worded neg 


Procedure 

i The ques! aire was administered to the sub- 
jects four during training; these adminis- 
trations coi snded to the beginning of train- 
ing, the en the first and second phases, and 
the conclus f training. 

Administ n of these questionnaires was 
conducted senior noncommissioned officer in 
a classroo: the Marine Corps Recruit Depot. 
It was en ized in the instructions that an- 
swers won de treated as confidential and that 
no Marin „uld see the responses of an indi- 
vidual rev 

RESULTS 

The pal-axis factor model (Harman, 
1967) wa. used to determine the factor pattern 
of item ‘correlations for each administra- 
tion of the Marine Corps Opinion Questionnaire. 
The extraction of factors was stopped at 10 
because, on the basis of limitations imposed on 
item content during creation of the question- 


not expected that additional factors 


naire, it 
eful for the purpose of this study. 


would be 


Further, he percentages 
of variance explained by Factors 6 through 10 
were very small, only the first five factors from 


each analysis were rotated to a normalized vari- 
max criterion of simple structure (Kaiser, 1958). 
Coefficients of congruence computed to determine 
the degree of similarity between rotated factors 
from each analysis (Harman, 1967) suggested 
that the same five factors were represented in 
the attitudinal responses made by recruits during 
each administration. 

The five factors extracted from the Marine 
Corps Opinion Questionnaire responses of re- 
cruits may be labeled as attitudes toward tough- 
ness, spirit, affiliation, authority, and considera- 
tion. Items correlated with Factor 1 (9%), 
toughness, pertain to the predisposition of a Ma- 
rine toward hostilities as well as the invincibility 
of a Marine once in combat. Factor 2 (11%); 
spirit, was loaded with items which suggested a 
cynical view of Marines as people. When the 
wording of these items was reversed from nega- 
tive to positive, however, they seemed to suggest 
a spirit that may exist among Marines, that is, 


3 Percentage of variance is explained by this fac- 
tor on fourth administration of the Marine Corps 


Opinion Questionnaire. 


* 


that Marines are honest, friendly, loyal to their i 
buddies, hardworking, ambitious, and brave, 
Items with high loadings on Factor 3 (9%), = 
affiliation, refer to serving in, and identifying 
with, the Marine Corps. The items loading on 
Factor 4 (9%), authority, pertain to tradi- i 
tional views of leadership and discipline in the 
Marine Corps. Finally, the content of items that 
were correlated with Factor 5 (8%), considera- 
tion, reflect a concern on the part of leaders ſor 
the needs and problems of subordinate Marines, 
In order to develop scales that could be used 
to measure these factors, items were selected 
from the Marine Corps Opinion Questionnaire 
which had consistently high loadings on one of 
the factors and low loadings on the others. Some 
compromises were made, however, when an item- 
loading was low on a factor on a single admin- 
istration or when an item had a high loading on 
more than one factor, as occurred in several in- 
stances with Factors 4 and 5 on the fourth 
administration. In this manner, 35 items were 
chosen as indexes for the five factors; representa- 
tive items from each scale with corresponding 
fourth administration factor loadings are pre- 


sented below. 5 


Representative Items and Factor Loadings 
For Each of the Five Attitude Scales 


Factor 1 (7 items): Toughness 
2. Combat is not hazardous for a well- 
trained Marine. (.53) 
Most Marines would eagerly take on 
several civilians at a time in a bar fight. 
(.62) 
Factor 2 (10 items): Spirit 
13. In the Marine Corps, nobody seems to 
“give a damn” about anything. (.57) 
gs. A smart Marine does only what is re- 
quired for him to “get by.” (.64) 
Factor 3 (5 items): Affiliation 
58. If I had my choice now I would prefer 
to be a civilian. (.88) 
73. Despite everything, I feel more at home 
in civilian life. (.72) 
Factor 4 (10 items): Authority 
20. The most important thing to teach Ma- 
rines is absolute obedience to their su- 
periors. (82) 
Any good Marine Corps officer or NCO 
should be strict with the men under him 
in order to gain their respect. (.64) 


Factor 5 (3 items): Consideration 

91. In general, Marine Corps officers and 
NCOs keep their men informed about 
what is going on. (.65) 
Marine Corps officers and NCOs are 
usually understanding of the needs and 
problems of their men. (.68) 


45. 


68. 


95. 
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The final objective of this study was to de- 
termine the intercorrelations and reliabilities of 
scores computed as indices of these factors. Fac- 
tor scores were computed for each administra- 
tion by reflecting (subtracting from 7) responses 
to items worded negatively with respect to the 
Marine Corps and then summing responses across 
all items chosen to represent a factor. Inter- 
correlations among these scores were computed 
and reliability coefficients were derived for each 
scale. These figures indicated that the intercorre- 
lations among scores were low to moderate in 
magnitude (average r= 34) and that the esti- 
mates of test-retest reliability computed for each 
scale were quite high (average reliability co- 
efficient = .82). 


Discussion 


The consideration and authority factors identi- 
fied in the results of this study were strikingly 
similar to the “consideration” and ‘initiating 
structure” dimensions of leader behavior which 
emerged from the Ohio State Leadership Studies 
(Stogdill & Coons, 1957). The “consideration” 
factor pertains to the amount of friendship, 
mutual trust, respect, and warmth generated be- 
tween a leader and his group; the “initiating 
structure” factor pertains to the degree of defi- 
nition by a leader of group organization, group 
activities, and of his relationship to the group. 
Although the Ohio State factors have been 

identifed in a wide variety of leadership situ- 
ations (Fleishman, 1973; Stogdill & Coons, 
1957), the present study suggests that the 
generality of these two constructs may extend 
to a highly structured, authority-oriented en- 
vironment such as Marine Corps basic training. 
Of more importance, however, is the finding in 
several studies that the amount of “considera- 
tion” displayed by leaders is inversely related 
to such behavioral measures as grievances, ab- 
senteeism, and turnover (see e.g., Fleishman & 
Harris, 1962; Fleishman, Harris, & Burtt, 1955; 
Vroom, 1964). These relationships suggest that 
investigating the potential uses of consideration 
scores for evaluating and predicting leader ef- 
fectiveness in the Marine Corps may be a prom- 
ising area for further research. 

Scale reliabilities were high, despite the rela- 
tively small number of items representing each 
factor. Nevertheless, additional items should be 
developed to supplement each scale. Further- 


SHort Notes 


more, a better balance is needed ch scale 

between positively and negatively 5 d items, 
Overall, the findings of this stud ve dem- 

onstrated that the Marine Corps m Ques- 
tionnaire can indeed be a useful sure of 
attitudes toward the Marine Cor he atti- 
tudes assessed by this questionnai pear to 
be relevant with respect to the C and the 
scales developed to measure these a ies were 
found to be reliable. The Marine Opinion 

Questionnaire, then, should be partic ly useful 

for the study of Marine recruit s. Fur- 

thermore, scores derived from measure 
should be evaluated as potential tors of 
effectiveness criteria. 
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. DISGUISED MEASURE OF CIVIL RIGHTS ATTITUDES 


ABRAHAM K. KORMAN ' 
Baruch College, City University of New York 


he purpose of the research was to construct and to validate two knowledge 
sts at different occupational levels as disguised measures of civil rights 
ttitudes. Results indicated that there is a significant relationship between 
nowledge of civil rights events, attitudes toward civil rights, and membership 
n groups concerned with civil rights affairs. These findings seem sufficiently 


promising to justify further research. 


The d pment of a measure of civil rights 
attitudes is free of “social-desirability” 
seems t f value for two reasons. First, it 
should le some guidance toward resolving 
the pers problem of how psychologists can 
effective easure individual differences along 
signific: notional and attitudinal dimensions. 
Second hould be a step toward helping ad- 
ministr identify for appropriate action those 
individ vho might be antagonistic toward 
the ful aration of our nation’s organizations. 

The ach that was decided on to develop 
such a ure was based on the logic used by 
Greene 40) in his development of the Michi- 
gan Vor-hulary Profile Test. This test is based 
on the logic that we become more knowledge- 
able in those fields where we have positive in- 
terest, evidence has been presented which 


supports this assumption (Greene, 1940). More 
recently, Ewen (1969) has used a similar logic 
in explaining his obtained positive correlation 
between scores on the Graduate Record Exami- 
nations (GRE) Advanced Tests: Psychology 
and performance in graduate school in psychol- 
ogy, after verbal intelligence had been partialled 
out. 

These studies served as a background for de- 
velopment of a series of test items that measured 
knowledge of civil rights events and the ability 
to think logically with such materials. The 
hypothesis tested was that scores on such mea- 
Sures are related to independent measures of 
civil rights attitudes and behaviors. 


METHOD 
Test measures 
After pretesting, two separate tests were de- 
veloped, a long difficult 69-item form and a 
short, easier 29-item form. Both versions were 
designed to reflect the subject’s knowledge of 


1 Requests for reprints should be sent to Abraham 
K. Korman, Department of Psychology, Baruch 
College, City University of New York, 17 Lexington 
Avenue, New York, New York 10010. 


civil rights events and his logical thinking ade- 
quacy when presented with civil rights materials. 


Samples and Administration 


The tests were administered to various groups. 
For student groups, administration took place in 
class sessions and as part of research require- 
ments for introductory psychology. For indus- 
trial groups, volunteers were obtained with com- 
pany cooperation with responses by mail. All 
tests were administered anonymously. 

While in no way could the groups be consid- 
ered random samples from particular popula- 
tions, the following fairly diverse groups of re- 
spondents were obtained: (a) 87 financial-dis- 
trict managers, all of whom were enrolled in the 
evening master’s of business administration pro- 
gram of a local university; (b) 24 managers in the 
insurance industry; (c) 52 commerce students 
at a suburban university under religious con- 
trol; (d) 156 liberal arts (mostly psychology 
majors) students at two urban universities; this 
group included a subsample of 73 within which 
a sufficient number of individuals belonged to 
civil rights groups to enable the use of this 
variable as a second criterion measure. In addi- 
tion, Wonderlic scores were available to be used 
as control measures in another group of 44 from 
this overall sample of 156; (e) 25 lower level in- 
surance company employees; (f) 28 employees 
of diverse occupational levels from an office 
supplies corporation; and (g) 140 evening stu- 
dents at a local community college, almost all of 
whom were employed in subprofessional lower 
level positions. 

Samples a, b, c, and d responded to the long 
form while e, f, and g responded to the short 


form. 


Civil Rights Indexes 


At the close of each testing session or at the 
end of the test, each respondent was asked to 
indicate his answer to two questions: (a) Did he 
belong to any group that could be considered a 
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TABLE 1 


RELATIONSHIPS BETWEEN Civit Ricuts KNOWLEDGE 
J GROUPS 


] 
| r with 

Sample | civil 
n ve SD | rights 

atti- 
| tudes 

Long test 

Managers 87 | 33.4 | 10.8 | .27** 

Insurance managers 24 | 36.7 8.4 | .40* 
Suburban university 52 31.8 818 42 
Urban university 156 | 31.6 Tp) Be had 
Urban university 73 | 35.0 6.927 


Short test 
Insurance company 
employees 25 
Office equipment 


company employees, 28 22.7 3.7 .33 
Mixed employee 
sample 140 17.2 4.1 | .46** 


a r biserial with "membership in civil rights groups.” 
*p 2.05. 
* <01. 


= pro-civil rights group? (b) How would he rate 
~ himself on a 5-point scale as to his attitude 
toward civil rights (1 = anti, 5 =pro, 3 = neu- 
tral, etc.)? This latter question, with the one 
exception previously mentioned, was the major 
dependent variable in this research, 


RESULTS 


The results of the research are outlined in 
Table 1. In general, these findings justify the 
conclusion that there is a significant relationship 
between knowledge of civil tights events, atti- 
tudes towards civil rights and membership in 

' groups concerned with civil rights affairs. This 
conclusion did not appear limited by the non- 
_ linearity of regressions, since scatter plot analyses 
indicated that in no case were the regressions 
nonlinear; nor did it appear to be limited by 
the contaminating influence of intelligence as the 
major contributor to the correlation between the 
knowledge test and the attitude measurement, 
Thus, for a subsample of 44 within the urban 
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university sample of 156, the corr 1 between 
the knowledge test and the intelli: test was 
not significant (r= .29, p > .05 he same 
group, the correlation between Wonderlic 
and the attitude measurement was „ and the 
partial correlation coefficient, een the 
knowledge test and attitudes, with V onderlic 
scores partialled out, was +.33 ( 5). 
Discussion 

It would seem that the im tions of 
Greene’s (1940) and Ewen’s (19 work for 
developing measures of attitudes ved sup- 
port in this study. However, fu: research 
does seem warranted as to why the tionships, 
while consistent, are sometimes t in mag- 
nitude. One explanation might bi possible 
unreliability of our criteria (i.e., 1 | attitude 
measurement and statements of bership). 


Perhaps stronger criteria may led to 
stronger relationships. 


Even with stronger criteria, h ex, it is 
likely that the practical value of strument 
of this nature is limited to samp! relatively 
homogeneous in education. The res for this 
limited use is the relatively weli tantiated 
finding of a general relationship i! popula- 
tion between educational level, l racial 
attitudes and general knowledge Allport, 
1958, pp. 405-407). Yet, this find does not 


obviate the fact that, holding educ 
constant, there may be other varia 
positive interest, that may affect th: 
between knowledge and attitudes g i 
viduals within each educational level. Such posi- 
tive interest could lead to the types relation- 
ships which we have reported here, relationships 
which, while encouraging, still need to be further 
validated with behavioral criteria. 
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IR RELATIONSHIP TO WOMEN’S ROLE CONFLICTS AND COPING * 


DOUGLAS T. HALL 


Michigan State University 


feminine woman to role conflicts, 
sample of 229 college-educated 
be related. The woman’s perception 
the best predictor of various types 
style of coping with conflict was a 
also associated with satisfaction 


lefinitions of women's roles. 


of a feminine woman or her understanding of 
the male image of a feminine woman. Addition- 
the relationships of these three image varia- 
bles with satisfaction and happiness were inves- 


1 
a 
p 

i 
i 


METHOD 
Instrument * 
A questionnaire was developed for use in this 
study. The first section asked for demographic 
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SELF-IMAGE AND STEREOTYPES OF FEMININITY: 
\NCINE E. GORDON ê 
Stanford University 
(he relationships of self-image, the female image of a feminine woman, and the 
‘emale perception of the man’s image of a 
oping behavior, satisfaction, and happiness in a 
women were explored and were found to 
yf the male’s stereotype of femininity was 
of conflicts experienced. The woman's 
function of her self-image. Self-image was 
and happiness. The impact of male stereotypes on women’s conflicts was 
discussed in terms of current social di 
: The c ousness-raising activities of women’s 
liberati have resulted in a renewed interest 
in wom roles, the conflicts inherent in those ally, 
roles, a> die behavior which copes with such 
conflic an earlier study of conflicts in tigated. 
women of the present authors classified 
women's conflicts by sources: self, home roles, 
and non me roles (Hall, 1970). Conflict may 
arise f pressures between different home 
roles, between different nonhome roles, between 


home 
of the 
nonhor 


nonhome roles, between different needs 
or between the self and home or 
roles, 

Hall & Lawler (1971) also classified methods 
of coping with role conflicts as follows: Type 1, 
structural role redefinition (redefinition of expec- 
tations held by others); Type 2, personal role 
redefinition (modification of one’s own expecta- 
tions of one’s own behavior); Type 3, responsive 
role performance (attempting to do everything 
expected without changing others’ expectations). 

The present study explored the relationships 
of types of role conflicts and methods of coping 
behavior to (a) the woman’s self-image, (b) her 
image of a feminine woman, and (c) her per- 
ception of the male image of a feminine woman“. 
These two variables, conflict type and coping 
method, were also related to the discrepancy 


1 This research was supported in part by a Ford 
Foundation grant to the Department of Adminis” 
trative Sciences, Vale University. The cooperation of 
the Alumni office of the University of Connecticut 
is gratefully acknowledged. 

2 Requests for reprints should be sent to: 
E. Gordon, Graduate School of Business; 
University, Stanford, California, 94305. ; 

3 Hawley (1971) has shown that role behavior 
of women is strongly affected by their perception 
of the male’s stereotype of women, regardless of the 
actual stereotypes men have. 


Francine 
Stanford 


information to divide the sample according to 
marital status. The second part of the question- 
naire assessed stereotypes and self-image using 
three sets of semantic differentials. The first se- 
mantic differential asked the woman to rate her- 
self on 28 bipolar adjective scales. Each scale 
had seven positions (1 to 7 points). The second 
semantic differential asked the subject to rate 
her image of a feminine woman on 27 adjective 
scales (excluding the pair “feminine-unfemi- 
nine” because the woman was rating a “femi- 
nine” woman). Finally, each subject was asked 
to rate what she perceived to be the average 
man’s image of a feminine woman, using the 
same 27 pairs of adjectives which she had used to 
rate her own image of a feminine woman. 

Each semantic differential was factor ana- 
lyzed separately. The same four factors emerged 
in each case (Gordon, 197 la). The factors were 
as follows: (a) potency: emotional and physical 
strength; (b) supportiveness : nurturant, agree- 
able behavior in interpersonal settings; (c) 
spontaneity: willingness to take immediate ac- 
tion; and (d) emotionality: level of emotion. 

The third section consisted of the following 
questions: (a) What conflicts or strains, if any, 
have you experienced between your various roles 
in life? (b) How do you deal with these con- 


Details about the instrument, scoring or analysis 
will be made available upon request. 
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from the corresponding factor score on the rating 
of woman's perception of the male image of a 
feminine woman. 
Responses to the question on conflict (section 
3, question 1) were coded using the Hall & Law- 
ler (1971) classification system described earlier. 
If the subject indicated that she did not experi- 
_ ence conflict, she was given a score of 1 for the 
category “no conflict”; if she reported conflict, 
4 she was given a score of 0. Each conflict was 
classified in terms of the competing pressures 
which gave rise to it. For example, a conflict 
between the roles of wife and mother should be 
put in the home versus home category because 
it involved pressures from two roles within the 
home. The remaining five categories were home 
versus nonhome, home versus self, nonhome 
versus nonhome, self versus nonhome, and self 
versus self (intrapsychic). Conflicts were scored 
dichotomously with the subject given 1 if she 
experienced a particular type and 0 if she did not. 
The individual pressures leading to conflict 
were also coded for each respondent, indicating 


Peer yet >e 


e 


ay. 


2242 Snort Nores 
between her self-image and either her own image whether or not she had any conil olving 
flicts? (c) Overall, how satisfied do you feel home pressure, nonhome pressures r pres 
with the way you deal with your roles in life? sure from the self. 
(d) In general, how happy would you say you Methods of coping were classific three 
are? types according to the Hall (197( ne de- 
scribed earlier. The coding score each 
Semple woman were the number of each ty oping 
Ninety female graduates of the liberal arts that she employed, Sixteen strategic oping 
college at the University of Connecticut were with role conflicts were identified. O les of 
selected randomly from each of five graduating 15 and 30 questionnaires, two cc agreed 
_ classes. Analyses showed no significant differ- 61% and 749%, respectively, on res; Next, 
ences between classes, and data presented in this the 16 coping strategies were allo: to the 
dc tudy include members of all classes. three types of coping referred to e so that 
; Of the 450 forms mailed out, 261 (58%) were each woman received one score for number 
$ completed and returned, Of these, 56 were from of Type 1 coping strategies, one si r Type 
~ the class of 1948, 49 from 1953, 55 from 1958, 2 strategies, and one score for Type ıtegies. 
84 from 1963, and 47 from the class of 1968. The question on satisfaction was i using 
No demographic information was available from a scale from 1 (dissatisfied) to emely 
_ the university to determine if the sample was satisfied). Happiness was also scored (very 
representative. Among the 261 returns, 229 had unhappy) to 5 (very happy). The u eneral 
been completed by women who were married. single item questions to measure thes global 
These women constituted the present sample. constructs is based on the work o dburn 
d S, 5 (1969) and Gurin, Veroff, and Field Yi 
Scoring and Analysis 
Four scale scores corresponding to the four RESULTs * 
factors were computed for each person on each Self-Image and Stereotypes Versus C ts 
semantic differential. Each scale consisted of ad- h E: 
_ jective pairs that had their highest loading on Almost one fourth of the sample ; a 
the corresponding factor. Then, the score for Ported no conflicts. The most comm ou 
ech factor scale on the self-image rating was was that between a home role and a home 
subtracted from the corresponding scale score role (31%). The mosh. Coming eee 
on the rating of the woman's own image of a ure was the woman’s home roles (64 
feminine woman. In a similar fashion, each scale None of the self-image or stereotyp- scores 
score on the self-image rating was subtracted correlated significantly with the existence or 


nonexistence of conflict per se. The woman's 
image of the male’s image of a feminine woman 
was the one most closely associated wilh con- 
flicts experienced by the woman. All four factor 
scales of the perceived male stereotype corre- 
lated significantly with home versus nonhome 
conflicts (—.20, p<.01; —.16, p<.05; —.15, 
5 S. 05; 18, p<.01) and with nonhome only 
pressures (—.19, p<.01; —.15, p< .05; —.16, 
b<.05; .18, p < .01). Supportiveness also was 
related to self versus home conflict (r =.18, 
p< 01) and self-only pressures (r= 20, p < 
.01). Thus, the less potent and more emotional 
a woman believed men felt women were, the 
more conflicts she experienced in the home versus 
nonhome category. 

Correlations between the discrepancy scores 
between self-image and perceived male image of 
a feminine woman and conflicts were similar in 
pattern to, but lower than, those involving the 
perceived male image by itself. 


Complete tables will be made available upon 
request. 
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The dis between the woman's self- 
image and ‘age of a feminine woman was 
more stron ited to conflicts than was either 
self-image image of a feminine woman, 
Supportive ems to be an important dimen- 
sion in th crepancies. The less supportive 
a woman Í ative to her stereotype of femi- 
ninity (gr he discrepancy), the less likely 
she was t ience nonhome pressures (—.19, 
$< 01), e more likely she was to experi- 
ence home us home conflict (.16, p <.05). 
Self-Imag Stereotypes Versus Coping 

The be -dictor of coping behavior in this 
study wa woman’s image of herself and in 
particular perceived supportiveness. There 
was a ten v for a woman who perceived her- 
self as m upportive to cope with conflict by 
structura” :edefining her roles (Type 1, r= 
16, p< and not attempting to meet all 
demands on her (Type 3, 7=—-20, P< 
O1). 

Self-Ima> ond Stereotypes Versus Outcome 

The b- predictor of both satisfaction and 
happiness was the woman’s image of 
(which > +< a poor predictor of conflicts). The 
more potent, supportive, and unemotional the 
woman icit, the more satisfied ( 32, 30. 
—.24, respectively, p <.01), and the happier she 
was (r= 235, .20, —.25, respectively, p<.01). 


The woman’s perceived male image of a femi- 
nine woman and her own image of a feminine 
woman were not correlated with her satisfaction 
and happiness, 


Discussion 


Of all the various measures and discrepancy 
scores used in the present study, the woman's 
perception of the male stereotype was overall the 
most closely associated with conflicts. In par- 
ticular, the number of conflicts involving non- 
home roles tended to decrease the more potent, 
supportive and unemotional the woman per- 
ceived the male’s stereotype of femininity to be. 
This finding suggests that in the more male- 
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dominated environment outside the home, 
man’s standards of femininity may create d 
culties for women, It is interesting that wo 
who work (ic, have more nonhome invo 
ment) perceive a greater difference between 
feminine woman and that of 


ceived male stereotype of femininity had rela · 
tively the strongest influence of the variables 
measured. This finding lends support to the ar- 
guments of women activists that men coni 
women not only through discriminating behavio 
but also through their impact on women's 
tudes and inner conflicts. 
Whatever male influence exists appears to have 
its limits, Ultimately, it seems to be the won 
an's own self-image that determines her overall 
satisfaction and happiness. 
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EFFECTS OF EXPECTATION, SOCIAL CLASS, AND EXPERIEN‘ 
ON IN-SERVICE TEACHER BEHAVIOR IN SMALL GROUPS 


LEN SPERRY? 
Marquette University 


Eigħty-one teachers were randomly assigned to planning groups of three and 
then randomly assigned to one of four treatments to test the hypotheses that 
expectation and social class affected small group behavior. Two levels of 
expectation and two levels of school social class were simulated. Results indi- 
cated that the instructional material produced by groups exposed to the 
high-expectation treatment rated significantly higher (p<.01) than that of 
groups exposed to the low level; and that groups exposed to the high- 


expectation, upper-social-class treatment produced instructional materials 

3 that were more intellectual than vocational in orientation. 
l A considerable amount of research has been More than 100 educational studies been 
__ feported concerning variables that affect the in- based on Getzels’s social systems mi (Get- 
j structional planning done by in-service teachers, zels, Lipham, & Campbell, 1968; Wahn 969). 
Olten, such research is both atheoretical and The present investigation was like wis. d on 
4 honcomprehensive, since it has focused on only Getzel’s model and studied three sets o bles 
one set of variables: role, cultural, or personal that might influence teacher in-service ior, 
_ Variables, Subsequently, such studies offer only The effects of the following variables on rvice 
partial explanations and a limited basis for gen- teacher behavior were examined: (a) tural 

eralizations and subsequent research. Thus, there variable—school social class level, ( role 
8 ample indication that a theoretical and a variable—administrative expectation, and c) a 
_ comprehensive framework is required for a personal variable—years of experience in -acher 
better understanding of in-service teacher be- planning. 
s havior. OREA 
© Getzels (1963, 1968, 1969) has offered a gen. Desi 
S N f 2 esign 
_ tral yet comprehensive sociopsychological model 
lor understanding social behavior in institutional To determine whether administrative expec- 


settings, Basically, Getzels’s model assumes that 
human behavior is determined by both norma- 
tive. or sociological, and personal, or psycho- 
logical, factors and that these factors arise out 
of a cultural, or anthropological, context. Within 
an institution, such as a school, there are expec- 
tations shared for various roles, such as teacher, 
administrator, and student, and for personal need 
dispositions. The framework, based on the socio- 
logical theory of Parsons, suggests that both the 
institution and the individual are embedded in 
culture or subcultures, each with its own values 
and ethos. These factors interact in the deter- 
mination of social behavior. 


* This article was based on a doctoral disserta- 
tion submitted to the Graduate School, North- 
Western University, June 1970, An earlier version 
of this article was Presented at the American Edu- 
cational Research Association meeting, February 
1971. 
__* Requests for reprints should be sent to Len 
3 Department of Educational Psychology, 
Marquette University, 502 North 15th Street, Mil- 

„Wisconsin 83233. 


tations could have a significant effect on an 
assigned instructional planning task, the effects 
of two treatments of administrative expectation 
on the outputs of a group task were observed. 
The two treatments were designated as high- 
and low-administrative expectation. The first 
major hypothesis suggested that high expectation 
elicits higher output than low expectation., The 
actual writing of an instructional unit by each 
group constituted the group instructional task, 
which was rated in terms of a quantified index, 
the Rating Scale for Curriculum Evaluation 
(Talmage, 1970) and yielded a total output, the 
group instructional task score, 

A secondary consideration of this study re- 
lated to the social class level of the school! and 


schools produce higher group instructional task 
scores than teachers in lower-class schools, Upper 
was designated to mean upper-middle and upper 
class, while lower was designated to mean lower 
and lower-middle class, An upper- and a lower- 
class school were simulated to ascertain whether 
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social clas significantly affect group in- 


structional ores under the two treatments 
of admini expectation. Furthermore, a 
corollary sis suggested that teachers in 
upper-clas s produce an instructional unit 
that is pr intellectually oriented, whereas 
teachers i class schools produce an in- 
structiona! that is primarily vocationally 


oriented. lection of variables yielded a 


2X2 cova: ign, with two levels of ad- 
ministratiy pectation and two levels of social 
class, year xperience being the covariate. 
Simulatio» erials 

One se mulation materials was produced 
for each four treatments. These materials 
were simi format to those included in the 
In-Basket devised by Hemphill, Griffiths, 
and Fred ı (1962). The simulated materials 
included rips, newspaper clippings, minutes 
from faci cetings, and pertinent demographic 
data. In ion, audiotaped instructions served 
to standa: © the treatment conditions in terms 
of timing and emphasis. These directions, 
along with e other simulation materials, con- 
veyed the vet impression that in-service instruc- 
tional ple ig was of primary concern, or of 
little concen to the administrator of the particu- 
lar school, in terms of the upper- or lower-class 
setting of the administrator’s school. 


Source oj the Data 


Data for this study were obtained from two 
sources: («) from each participant’s Personal 
Inventory Form which indicated the following 
eight variables: age, sex, education level, teach- 
ing experience, curriculum experience, subject 
teaching experience, grade level experience, ad- 
ministrative experience; and (b) from solutions 
to the group instructional task and quantified by 
the Rating Scale for Curriculum Evaluation. 

Reliability for the Rating scale, determined by 
Hoyt’s estimate of test reliability based on the 
analysis of variance (Holt & Stunkard, 1952), 
was 94. Rater reliability among the three raters 
who evaluated the data was 88 (p <.01). 


Sample 


Eighty-one volunteer and nonvolunteer teach- 
ers were randomly assigned to groups of 3. Nine- 
teen of the participants were members of minor- 
ity groups, of which 16 were blacks. Chi-square 
tests to determine equalization of distribution of 
the eight characteristics on the Participant In- 
ventory Form indicated no significant difference 
at the Os level of significance. An F test at 0¹ 
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level showed no significant differences bet 1 
scores obtained by volunteer and nonvolunte 
These 27 groups were then randomly a 
to the four treatments. In a one and one 
hour session, each group was exposed to 
simulation and completed an assigned task that 
consisted of an upper grades, general social stud 
ies unit. 
Other Preliminary Analysis of Date 


The two assumptions underlying the use of the 
analysis of variance are normality of distribu 
tion and homogeneity of variance, Results from 
a probit analysis justified the assumption th 
data of this study represented an unbiased s: 
ple from a normal population, Results fron 
Hartley's Fmax justified the assumption of homo- 
geneity of cell variance. 


ResuLTS 


The main variables of this study were lev 
of expectation and level of social class, wh 
the suggested covariate was number of years of 
experience in instructional planning for each 
individual. In order to justify the use of a co- 
variance design it had to be established that a 
sizable correlation existed between the covariate 
and the dependent variable. Since this correla- 
tion was quite low, the covariance design i 
replaced by 2X 2 analysis of variance design 
that used the least squares method for unequal 
cell frequencies (Winer, 1962). Post hoc com- 
parisons of cell means were performed. 

First, it was found that groups exposed to the 
high level of administrative expectation rated 
significantly higher in terms of group instruc= 7 
tional task scores than groups exposed to the 
low level of administrative expectation (F= 
37.04, df =1, 77, p<.01). Therefore, the first 
null hypothesis was rejected, 

Second, it was found that groups exposed to 
the upper level of the school social class treat- 
ment did not rate significantly higher than groups 
exposed to the lower level of school social class 
(F=1.00, df=1, 77, p<.01). Therefore, the 
second null hypothesis was not rejected. 

Third, it was found that groups exposed to 
the high-expectation, upper-social-class treat- 
ments significantly favored an instructional unit 
that was intellectually oriented over one that 
was vocationally oriented (F = 9.70, df = 1, 77 
p<.01). Therefore, the third null hypothesis 
was rejected. : 


a 


DISCUSSION 


The theoretical significance of this study was 
its validation of a systems model that accounted 


application of the noncom- 
technique developed for the 
Suggested that such techniques 
possibility not only of increasing the 
ope of theory but also of modifying in-service, 
as well as preservice, education by the use of 
such techniques for training and evaluation, 
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I r OF TERMINAL RANKING ERROR ON MEANINGFUL 
OCCUPATIONAL CHOICE 


EDWIN E. WAGNER! ayp THOMAS O. HOOVER 


University of Akron 


existence of terminal error (the tendency to overevaluate the first and 
last positions in ranking a set of entities) was investigated in the ranking 
occupational preferences for trustworthiness. It was found that terminal 
or occurred, but that it disappeared as 
upations to be ranked became more pronounced. 


It has hown that there is a tendency to 
overevalu e first and last positions in rank- 
ing a se ve entities (Wagner & Hoover, 
1974). H z, the initial series of experiments 
illustratir phenomenon dealt with words 
which we ked according to “imagery value” 
and it sec questionable whether these results 
could be polated to more meaningful situa- 
tions suc foreman ranking his workers on 
productivi or a potential voter ranking a group 
of candic The present study explored the 
biasing eñe t of “terminal error” by asking sub- 
jects to d occupations along a dimension of 

trust“ — ‘ask which presumably evokes com- 
mitment emotional involvement and might 
therefore more closely resemble real life situa- 


tions where choices have practical consequences. 


METHOD 


In a study by Rotter and Stein (1971) sub- 
jects rated 20 occupations on the degree to 
which they felt that typical members of each 
occupational group were trustworthy and then 
rank-ordered the occupations from 1 through 20 
on the degree of trust attributed to them. In the 
present experiment, the Rotter and Stein list of 
20 occupations was subdivided into three sepa- 
rate lists of 5 occupations each and presented 
to three groups of 120 undergraduates from the 
University of Akron. Each list of 5 occupations 
was presented in completely counterbalanced 
order (in every possible combination, 5!=120 
arrangements) with instructions to rate the oc- 
cupations on trust. The three groups of subjects 
were instructed to give the highest trust rating 
the rank of 5; the occupation given the next 
highest trust rating was ranked 4, etc.; the rank 
of 1 was assigned to the occupation which was 
trusted the least. List 1 consisted of the first 5 
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1 Requests for reprints should be sent 
E. Wagner, Department of Psychology, 
of Akron, Akron, Ohio 44304. 


intrinsic differences among the 


occupations included in the original 20, that is; 
those ranked 20, 19, 18, 17, and 16 along the 
trust continuum: physicians, clergymen, dentists, 
judges, and psychologists, List 2 included those 
occupations rated 20, 17, 14, 11, and 8: physi- 
cians, judges, psychiatrists, law enforcement of- 
ficials, and executives of large corporations. List 
3 was comprised of those occupations rated 20, 
16, 12, 8, and 4: physicians, psychologists, law- 
yers, executives of large corporations, and auto 
repairmen. By spacing the rankings in this man- 
ner, it was possible to study the influence of the 
degree of known differences among occupations 
on subsequent ranking error. 


RESULTS AND DISCUSSION 


Friedman two-way analyses of variance were 
computed to test the null hypothesis that rank- 
ing of the occupations would be unaffected by 
serial position. Table 1 reports the results along 
with the median value for each ranking position. 
The null hypothesis was rejected in List 1 be- 
yond the 001 level but was accepted for Lists 
2 and 3. Inspection of the medians in List 1 
confirms previous research, inasmuch as the 
terminal points (ranks 1 and 5) were overranked | 
in relation to the center positions. As might be 
anticipated, the greater the intrinsic differences 
among data, the less susceptibility to ranking 
error; Lists 2 and 3, which were more hetero- 
geneous in original selection value than List 1, 
were unaffected by terminal error. However, 
much of the ranking tasks which are encountered 3 
in actual industrial and social situations (e. g, 
workers ranking supervisors or enlisted men 
ranking officers) probably involve fairly close 
discriminations Where terminal error can be ex- 
pected to occur. Since correction procedures for 
ranking error that have been presented in the 
literature (Rosander, 1936; Symonds, 1936; 
Thurstone, 1931) do not deal with terminal 
error, it seems that more attention should be 
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Seve earchers concerned with the assess- 


ment erformance (Campbell, Dunnette, 
Arvey, í Jervik, 1973; Fogli, Hulin, & Blood, 
1971; 7 k & Baker, 1972) have used behav- 
iorally ved rating scales that are developed 
throug! rocedure reported by Smith and 
Kenda $3). The procedure is an iterative 
techniq: results in scaled expectations of 
indepen performance behaviors. 

An of the behavioral expectation scale 
(BES) edure which has been somewhat ig- 
nored is ‘hat of interrater agreement for ob- 
tained e scores. Though developers of be- 
haviors! expectation scales have argued that 
high-scaic reliabilities are evidence of the value 
of behavioral expectation scales, Fogli et al. 
(1971) and Zedeck and Baker (1972) have indi- 
cated that the reliability estimation procedure is 
inappropriate and artifactual. A more appropriate 
reliability index is interrater agreement; that is, 


the agreement between two or more evaluators 
for a distribution of scale scores should be as- 
Sessed when the scales have been applied in an 
evaluation situation. Two studies that have ex- 
amined interrater agreement of specific behav- 
loral expectation scales (Campbell et al, 1973; 
Zedeck & Baker, 1972) found low to moderate 
Correlations (rs ranged from .24 to .55). 
_Zedeck and Baker (1972) also assessed the 
discriminant validity of behavioral expectation 
Scales when used for evaluations; that is, the 
empirical independence of the scales was exam- 
ined after the development procedure yielded 
Conceptually independent scales. In general, re- 
sults indicated poor discriminant validity. 
Zedeck and Baker presented several alterna- 
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“LOPMENT OF BEHAVIORALLY ANCHORED RATING SCALES 
AS A FUNCTION OF ORGANIZATIONAL LEVEL 


MOSHE KRAUSZ asv TIMOTHY OLENO 
University of California, Berkeley 


‘wo levels in an organization (supervisory level and subordinate level) inde- 

ndently participated in the development of behaviorally anchored rating 
cales. Results indicated that though the two organizational levels identified 
imilar behavioral dimensions, the behavioral incidents defining the dimensions 
vere valued differently. Alternative explanations for the obtained differences 
ire offered and a discussion of behavioral expectation scale methodology is 


NICHOLAS IMPARATO 
University of San Francisco 


tive explanations for their obtained results, First, 
and most importantly, was the fact that the two — 
supervisory evaluator groups, head nurses and 
supervisors, were at different organizational lev- ` 
els in the organizational hierarchy, each with 
different functions, responsibilities, and perspec- ` 
tives. It is possible that different superviso 
groups differ in their opportunity to observe and 
evaluate subordinates. a 
A second possibility is that the different super- — 
visory groups valued or perceived diferent be- 
haviors as important in terms of meeting job 
requirements. If hierarchical position has some 
effect on the kinds of behaviors expected, 
furthermore, one could anticipate differences in 
behavioral expectations between not only two — 
supervisory groups representing different organi- 
zational levels but also between a supervisory — 
group and a subordinate group. 
A third alternative is that two different super- | 
visory groups can yield correlated ratings only 
when the behaviors used as anchors are valued ~ 
similarly. That is, the two supervisory groups in 
each study may have valued the same behaviors ~ 
differently. There is no evidence that the value 
placed on some behavior by one group of em 
ployees is the same as the value placed on the 
same behavior by a different group. 
In short, there is reason to believe that a 
subordinate group may differ from a supervisory 
group regarding the behavior believed to be 
necessary for adequate performance. The possi 
bility exists that there are value differences in 
expected and specific job behaviors, as indicated 
in the development of behavioral expectation 
scales, between two levels in an organization 
(supervisory level and a subordinate level). Al- 
though it was possible that different behavioral 
dimensions or qualities would be identified by the 
two groups, the focus in the present study was 
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on the expectation that similar specific behav- 
iors would be valued differently by the two 
levels. More exactly, it was anticipated that the 
Subordinate group would rate a given set of 
behavioral examples as indicative of better per- 
formance than would a supervisory group. 


METHOD 
Subjects 


The research site was a large, private univer- 
sity medical center in Northern California. The 


participated throughout the study, In the last 
stage of the procedure, however, only six of the 
nurses in each of the groups participated. 


A modified version of the procedure reported 
by Smith and Kendall (1963) was used. The 
first stage required the determination of general 
_ behavioral dimensions on which nurses are to be 
“evaluated. Several conferences were held inde- 
pendently with the head nurses (ns ranged from 
6 to 8) and the RNs (ns ranged from 8 to 9) 
to achieve the objective. Once general dimen- 
‘sions were agreed upon, specific examples of 
behavior in each dimension were provided and 
edited into the form of expectations of specific 
behavior. Each nurse submitted three examples 
for each dimension; one example reflected good 
performance; one reflected satisfactory perform- 
ance; and one reflected poor performance. 
The second stage required that members of 
: both groups indicate, independently, the dimen- 
_ sion that was illustrated by each example ob- 
_ tained in the first stage. Examples that were not 
reallocated to the dimensions for which they 
were provided were eliminated. The criterion for 
retention of an example was 75% agreement. 
A That is, 8 of the 12 nurses, with a minimum of 
4 from each group, had to indicate that the 
example illustrated the same behavioral dimen- 
Jon. If less than 8 placed a specific behavior in 
the same dimension, that behavioral example was 
eliminated. 

The third stage required that all remaining 
examples be assigned numerical values which re- 
flected the desirability of the behavior illustrated, 
That is, each example was assessed on a scale 
ranging from 1 (poor) to 7 (good) by each of 
the nurses (RNs—n = 6; head nurses—n = 6). 
Means and standard deviations were computed 
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for each example. Examples with rd 
ations greater than 1.50 were elim 


Resutts AND Discuss: 


The first stage, the identification ha 
dimensions, was conducted independ a in g 
two groups. Several conferences w të h 
nurses resulted in the identification « “di 
sions; the RNs also identified a set dim 
sions. There were 12 overlapping or ilar di- 
mensions. Five dimensions were idi d ont 
by the head nurses and 4 only by t vs. 
question arose as to whether the ed d 
ferences were due to either subst di 
ences in job perspective or due to differ- 
ences precipitated by inattention « ersigh 
Additional conferences were held to re 
the latter possibility did not accou: for 
obtained differences. The role of the tigi 
who participated in both conferenc ith 
head nurses and the RNs was mer: forma- 
tive and, thus, they suggested addi quali 
ties to each group. The purpose of | confer- 
ences was to elicit a comprehensive ription 
of the operating room and the reco. room 
nursing position and not to impose the \lues 0 
assessment system of the investigator on 
participants. These additional meeting: res 
in common agreement that 22 dimensio > o 
to be considered in evaluating nursing sonnel 
The dimensions were as follows: adz; ‘ability 


clinical judgment; communications—patient di- 
rected; communications—co-worker ‘ir ed; 
cooperation and attitude; dependability., initia- 
tive; knowledge—aseptic technique; leade 

ability; mechanical ability; organizational ab 
ity; operating room and recovery room orienta- 
tion; perceptiveness; professional conduct; p 0- 
fessional growth; quality; reaction to and func 
tion under stress; scrubbing ability; and teaching 
ability.? 


three examples for each dimension, the sult 
was 420 diferent examples of behavior. In 
second stage, each nurse reallocated each of ie 
420 examples. Thus, each nurse evaluated exam- 
ples submitted by her peers and by members ot 
the other organizational levels. Elimination oF 


21 dimensions. The quality dimension was e 


Definitions of each dimension may be obtained 
from the first author. 
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iples provided for it were 
dimensions, 


mated since 
reallocated | 


The sign se Siegel, 1956, pp. 68-75) 
was used to c the data obtained in the 
third stage the assignment of numerical 
values to ca e 249 examples. The purpose 
of the sign to analyze the extent of the 
Gfference i valuation of the behavioral 
examples b he head nurses and RNs. Of 
the 249 ex valuated, 177 examples were 
assigned hiz in values by the RNs (719%), 
the subordi roup, whereas 72 examples were 
assigned | equal values by the head 
nurses, the ivisory group. Note that this 
latter situs 72 “less than” or equal“ 
evaluation light departure from the usual 
Sign test . Ties or equals are usually 
omitted fr: e analysis yielding a reduction in 
the total A his study, ties were counted as 
“minuses' vorked against the hypothesized 
results; thu conservative sign test was even 
more consi ve employed in this fashion, 
Neverthele frequency of higher valued 
examples b RNs was significantly different 
($ < .001). 

An analy: mean values within each of the 
21 dimensi indicated that the overall 71% 
(177/249) er mean value assignment by the 
RNs was a ed for all but three dimensions: 
appearance echanical ability, and perceptive- 
hess. For quality of the appearance dimen- 
sion, five + mples were assigned higher mean 
values by te RNs, whereas seven examples were 
assigned higher mean values by the head nurses. 


For the nanical ability dimension, seven 
examples were evaluated. higher by the RNs, 
whereas eight examples were evaluated higher 
by the head nurses. For the perceptiveness di- 
mension, three examples were evaluated higher 
by the RNs and three examples were evaluated 
higher by the head nurses. Examination of the 
individual examples in these three dimensions 
Provided no reasonable hypotheses for the de- 
Parture from the overwhelming result of greater 
mean value assignment by the RNs, the subordi- 
nate group. For the 18 dimensions within which 
the differences between the RN and the head 
nurse in mean values support the hypothesis, the 
Percentages of higher value assignment by the 
RNs ranged from 64% to 92%. 


CONCLUSIONS 


The main aspect of performance evaluation in- 
vestigated in this study was whether there are 
differences in the values assigned for a given 

havior by two levels in an organization. Re- 
Sults for this study indicate that there are. Par- 


subordinate level, to 
RNs themselves. This 
prising if we accept the notion that 


of what the subordinates actually do. The 3 
is surprising if we examine the nature of be 
position, nursing care in operating and 
rooms, The impressions conveyed to the 
gators by the head nurses and the RNs was t 
only those nurses who exemplify high-standard — 
nursing proficiency are employed in the medical 


roperly, however, the present research method- 


p 5 
ology should be extended up and down the hier- 


assigned values, conferences should be held for 
the purposes of discussing and resolving the 
differences. S 
One methodological problem in this study was 
that very few examples depicting satisfactory or 
average performance on a poor to good contin- — 
uum were identified. This problem was recog- 
nized by other researchers using the behavioral - 
expectations scale (Harari & Zedeck, 1972; 
Landy & Guion, 1970). Landy and Guion (1970) 
provided examples with values of poor and good 
performance for each dimension and then re- 
quested that participants assign values to 
remaining examples. The anticipated result 
more intermediate performance examples would 
be retained was not achieved. ‘ 
This study specifically requested that par- 
ticipants indicate average or satisfactory pe 
formance. Nonetheless, few examples of average 
value were retained. A strategy for future inves- 


tigations might be to provide specific anchors 
with values for poor performance, good per- 
formance, and average performance prior to as- 
signment of values to the remaining examples, 
That is, specific behavioral examples which par- 
ticipants agree represent poor, good, and average 
performance should be identified in conferences, 
Then, with these as frames of references, values 
Would be assigned to the remaining examples. It 
is anticipated that more intermediate examples 
would be obtained. If this did not occur, at least 
7 the one anchor of average value, developed in 
the conference, would be usable, 
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ERRATUM 


: In the article by Joseph Weitz and Seymour Adler, “The Optimal Use of S 

tion,” which appeared in the October 1973 issue (volume 58), a sentence oÑ 
223 was incorrectly set. It should read: If, on the other hand, the very first . 
the real task is of great consequence (such as in emergency ditching procedures), 
then simulation training may be worthwhile. 
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RACIAI FERENCES IN THE IMPACT OF MATHEMATICS 
TRAIN AT A MANPOWER DEVELOPMENT PROGRAM ' 


MICHAEL E. GORDON # axo RICHARD D. ARVEY 
University of Tennessee 
WILLIAM C. DAFFRON, Jt, Ax DENNIS L. UMBERGER 
Tennessee Eastman Company, Kingsport, Tennesiee 
pact of mathematics instruction on the measured numerical compo- 
„% class of 80 black and 83 white trainees was investigated at a 


{ul manpower development project. The California Achievement Test. — 
high school level—was administered at successive six-week intervals 


ie as than did the blacks. These findings imply that racial differences 
assured mathematical competence are not reduced by training but may 
t be increased. 

The pr of equal educational oppor- tion in basic skills designed to upgrade 
tunities a cial training programs for trainee’s ability in English and mathematics. 
disadvan caments of the population is With regard to racial differences, most 
frequently ed as a means of reducing the evidence on the effects of special educa 
discrepancies: in measured cognitive compe- tion on measured differences in performance: 
tence and doing have an impact on the on cognitive tests is cross-sectional research — 
limitations vocational achievement that and, unfortunately, does not provide a direct 
these disc -ncies impose. The large number test of the impact of training on the relative 
and variet of publicly and privately spon- numerical and verbal skills of black students 
sored pro. ams testify to the enthusiasm and white students (for a recent review of the | 


with which medial or compensatory instruc- impact of special educational programs, see 
tion has been embraced as a means of reduc- St. John, 1970). The few longitudinal valida- 
ing these disparities. Special educational pro- tion attempts have been directed at programs 
grams exist for children of all ages, from the for very young children (kindergarten th 
widely publicized preschool Head Start pro- second grade) and, typically, have indicated 
gram, through elementary school programs that initial racial differences in cognitive 
such as Project Concem (Mahan, 1968), to competence are reduced because of the spe- 
high school and college level programs such cial educational opportunities provided (e£, 
as Project ABC (Wessman, 1969). Industrial Katzenmeyer, 1963; Stallings, 1959). 
training for the hard-core unemployed and Precious little longitudinal research has 
the disadvantaged also has stressed instruc- been performed to determine the effect of 


1 Support for this stud: was secured by a grant % 
from the U.S. Daa of Labor and the Atomic ences in test ‘performance between mature 


Energy Commission under interagency agreement blacks F 
(DOL 82-45-69-09, AEC 40-57-68). ing literature revealed only one relevant study 


b This report is based in part on the theses prepared (Miller, 1971) that attempted to monitor be 
y the third and fourth authors, under the direction effects of a basic education program on adult 


of P f 

a Tas a 115 „ s a test performance by eight Caucasians and 

industrial management at the University of Tennes- eight Chippewa Indians. 

Ea There are several important reasons which 
Requests for reprints should be and Michael ocessitate the study of the impact of indus- 


E. Gordon, Department of Industrial and Personnel ne. “ae ; ; 
agement, eiS of Tennessee, Knoxville, trial training on the intellectual competence 


Tennessee 37916. of blacks vis-à-vis whites. First, it is impor- 
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tant to examine whether either group (blacks 
or whites) profits from a given learning ex- 
perience in order to ensure that the training 
program is having a desired effect. Second, if 
differential training effects are found between 
blacks and whites, quite possibly different 
training techniques and procedures might be 
implemented. Third, it is useful to examine 
any differences that may exist between racial 
groups rather than to ignore or attempt to 
mask them. To ignore differences that may 
actually exist could result in sponsorship of 
training which exacerbates the problem of 
racial differences. 

The present research was designed to com- 
pare differences in test performance between 
blacks and whites before, during, and after 
exposure to a federally sponsored manpower 
development project for the unemployed and 
underemployed. Specifically, the trainability 
of black students and white students in the 
area of mathematics was investigated in a 
longitudinal study involving the use of stan- 
dardized tests at six-week intervals over a 
period of six months. 


METHOD 
Research Site 


The study was performed at a manpower develop- 
ment project located in eastern Tennessee that pro- 
vided instruction in six industrial skills: machining, 
welding, mechanical operations, physical testing, 
electronics, and drafting. In addition to the skill 
training, related instruction was provided in com- 
munications and mathematics. The primary objec- 
tive of the mathematics instruction was to provide 
the trainee with a ninth-grade high school level of 
competence. To this end trainees attended classes 
three times a week on the average to receive in- 
struction in computations involving fractions and 
decimals, recognition of formulas and symbols, 
algebra, and trigonometry. The related instruction 
was based on the mathematics requirements of the 
particular industrial skill that each trainee was 
learning. Private tutorial sessions were also provided 
to help those trainees who were falling behind in 
the classroom sessions. Although no formal records 
were immediately available to substantiate what 
occurred in these private sessions, the supervisor of 
related instruction indicated that blacks were more 
frequently involved than were whites in these spe- 
cial tutorials. 

The effectiveness of this particular training pro- 
gram has been well documented in terms of both 
economic criteria, for example, cost-benefit analysis 
(Kirby & Castagna, 1969), and behavioral criteria, 
for example, the effects on the personal lives of the 
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trainees (Gordon & Scott, 1972). T? ect 
be adjudged successful by compari ith 
manpower development programs ir s of 
high percentage of trainees (79% for tire d 


of trainees involved in this study comp 
the training and are graduated 


Subjects i 
The subjects comprised the class vegan its 
training in April 1970, The trainee recruited 
from a four-state Appalachian regior variety 
of agencies such as the Chattanoog cntrated 
Employment Program. A total of 18 cts 
resenting all six training areas were on 
entry into the program. Of this nuw. 80 we 
black and 83 were white. The black | and 
white trainees were quite similar oi avel 
with respect to age, education, an loyn 
status on entry, but differed with : t to 
percentage of trainees that completed progi 


(84% of the white trainces versus 75‘ he black 
trainees completed the program). 
Four American Indians and three Sy 


trainees were also tested, but their dat gnore: 
in subsequent analyses. Ten additi subj 
whose race was not determined were 3 card 
Criterion 

The two arithmetic subtests which rise tl 
junior high school level of the Califo Achie 


ment Test (CAT; California Test Bu: , 196 
were used as measures of competence in 
The Arithmetic Reasoning subtest consi 
questions and was divided into thre 
meanings, worded problems, and symbols, rules, am 
equations. The Arithmetic Fundamentals Test © 
sisted of 80 items and was subdivided into 
sections: addition, subtraction, multiplic ion, 
division. The criterion employed was the ictal si 
based on both subtests. a 
The four alternate forms of the CAT (W, X. 
and Z) were used in this research. Uniortunatel 
the published norms, reliabilities, and validities ap 
plied only to Form W.s Although they contal 
parallel content, the equivalency of the altermal 
forms has not been verified empirically. 


Procedure 


Premeasures. In order to test equivalency, ead 
subject took two different forms of the CA’ 
part of the orientation program for new trali 
The 153 subjects who were present for the fis 
day of orientation were randomly subdivided so tà 
approximately equal numbers took one of the si 
possible pairs of the four forms. Equivalency W 
checked by comparing the means and variances 4 
the four forms and by comparing the magnitude 0 
the six intercorrelations among all forms of th 
Separate normative data on blacks and on 


3 All subtests of Form W have reported relial 
ties about .80. 


were not d due to the limited sample size 
of each gro 

Bartlett’ was used to determine whether the 
variances homogeneous. The observed chi- 
square of (di=3) was not significant (p> 
10) which i>dicated that it was tenable to accept 
the null h besis of equivalent variances among 
the alternate forms. 


A one-way «nalysis of variance was performed to 
test the icance of the difference among the 
Means of the alternate forms. It appears as though 
the means oi the forms were equivalent except for 
random variation, since the obtained F ratio of 1.59 


(df= 3/149) was not significant (> > 10). 
Finally, the homogeneity of the six intercorrela- 
tions among the four forms was determined using 
the chi-square procedure originated by Fisher (see 
Edwards, 1960). The intercorrelations were in the 
range between .93 and .96 except for the correlation 
between Forms X and Z which was .86. The ob- 
served chi-square of 5.45 (df=5) was not signifi- 
cant (p>.10), a fact which suggested that the 
four forms could be considered equivalent since 
their intercorrelations were all estimates of the same 
Population value. 
Postmeasures. 
CAT forms was demonstrated, 
administrations were scheduled at six-week intervals 
over the training program in order to measure 
changes in arithmetic competence. The tests were 
_ administered in regularly scheduled math classes by 
the related instruction teacher. The subjects were 
given each of the four forms of the test at least 
once, although the order of administration was 
randomized for each trainee. Owing to limited re- 
sources, no effort was made to follow up on a 


“Tables containing means, variances, and inter- 
correlations for each of the four California Achieve- 
1555 Tests (CAT) forms are available on request 
tom the first author. 
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cure 1. Mean California Achievement Test (CAT) scores for blacks and 
for whites for each administration in Groups 1 and 2. 
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trainee who was absent on the day a test was 7 
administered. Only those subjects who had com- 
plete longitudinal data covering a minimum of the 
first 12 weeks of training, that is, all four CAT 
administrations, were used in the study. Using these — 
guidelines for inclusion, data were available for 38 
subjects on all five 
administrations of the (Group 1). Another 
group of 13 black subjects and 17 white subjects 
had complete data on the first four tests (Group 
2) Overall, 60% of the black trainee population 
and 73% of the white trainee population were the 
basis for subsequent analyses. 


RESULTS 


The data from Groups 1 and 2 were evalu- 
ated separately, although the analytical pro- 
cedures employed were identical in both 
cases. Figure 1 contains plots of the means 
on each of the administrations for each ra- 
cial group. Note that Administration 1 in 
both graphs represents the average of the 
two tests given on successive days during 
orientation as premeasures. Examination of 
the standard deviation within racial groups 
for each trial did not reveal any important 
differences or trends either between or within 
groups with respect to variability of the cri- 
terion distributions. 

The data summarized in Figure 1 were 
subjected to a two-factor analysis of variance 
with one repeated measure (Winer, 1962). 
The assumption of homogeneity of variance- 


5 This group was composed primarily of welding 
trainees who received just three months of math 
instruction. 


TABLE 1 
Summary or ANALYSIS OF VARIANCE FROM Groups 1 AND 2 
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e n 


1 


Source of variation 


Race (A) 

Subjects within A 
Administrations (B) 
AXB 
B X Subjects within A 


* Significant at thea = .025 level. 


P MS 
5.115 10195 | 1 
1,125.73 28 
44.48"* 839.08 | 2 
3.19° 248.29 | 2 
26.84 56 


. Significant at thea = .10 level (df = 1/77) based on the conservative test suggested by Greenhouse anc 


** Significant at the a = .01 level (df = 1/77) 


covariance matrices on which this analysis 
of variance procedure is based was tested, and 
the observed chi-squares were significant in 
both groups (5 N .01). Therefore, the more 
conservative test suggested by Greenhouse 
and Geisser (1959; see also Winer, 1962, p. 
305) was employed to determine the signifi- 
cance of the F ratios obtained in the analysis 
of variance procedure involving the repeated 
measure (the administrations factor). These 
results are reported in Table 1. 

Examination of the results revealed that 
the administrations and the Race X Admin- 
istrations effects were statistically significant 
in both samples. The significant interaction 
terms prompted tests of simple main effects 


TABLE 2 


INTERCORRELATIONS AMONG ADMINISTRATIONS IN BOTH 
Groups ror Each RACE 


Group 2 


Group 1 
Administration 
2 | 3 | 4 2 3 

Blacks 
1 92 80 -80 97 04 
2 92 76 -93 
3 80 

Whites 
1 22 8 || eke 9) 
2 94 91 97 
3 93 


2901 All of the reported correlations are significant at 
* = 01. 


based on the conservative test suggested by Greenhouse anc 
*** Significant at the a = .01 level (df = 1/28) based on the conservative test suggested by Greenhouse and 


of race for the first and last administration: 
both groups. In Group 1 the differences be 
tween the means of the whites and the b 
were not significant (p> .10) on the 
measure (F = 1.66, df=1/77) but th 
were significant (p < 01) on the fourth ad 
ministration (F = 7.54, df = 1/77). In Grow 
2 the simple main effects were not significan 
(p> .10) for either the premeasure (F= 
36, df = 1/28) or the third administratit 
(F = .36, df = 1/28). 

Finally, the stability of the trainees’ rel 
tive competence in math was examined | 
computing the intercorrelations among t 
results on each administration for each ra 
separately. These results are reported 
Table 2. All the observed correlations we 
significant (p< .01), and these res 
strongly suggest that the trainees maintain 
their relative standing within their rae 
groups throughout the training period. 


DISCUSSION 


The results from Groups 1 and 2 ind 
a differential training effect for blacks 4 
for whites. The significant main effects 
the administrations suggested that both rac 
profited from the training. However, t 
whites clearly derived greater benefit from ti 
training experience than did the bla 
fact revealed by the significant Race x 
ministrations interaction terms observed 
both Groups 1 and 2. This differential ti 
ing effect was especially prominent for 
1 in which the instruction produced 
cant racial differences in terms of mei 


mathematics 
subjects tha 
sure. 


erence between groups of 
ed alike on the premea- 


Before t erential training effects ob- 
served are ited to different racial levels 
of ability 1, a number of potentially 
confoundi: bles must be examined. 
Three suc bles that suggest viable al- 
temative ations for the results are 
discuss ed 

First, e of socioeconomic status as 
a determi { learning ability has received 
a great ıttention in educational litera- 
ture (Je 1968). It has been noted 
frequent! low-socioeconomic-status chil- 
dren per more poorly on cognitive tests 
than do cioeconomic-status children and 
that the: significant correlation between 
educatio: hievement and socioeconomic 
status. I ier to rule out the possibility 
that diff in socioeconomic status might 
have be. lated to the differential training 
effect ob ed, additional background data 
on the sv cts were scrutinized. Examination 
of the lit d socioeconomic status informa- 
tion ava‘: e on the subjects in both Groups 
Land 2 : caled that 94% of the blacks and 
61% of tse whites were classified as “disad- 
vantage. Learning curves were prepared 


for Groups 1 and 2 on the basis of this mea- 
sure of socioeconomic status. 

The 4? disadvantaged subjects in Group 1 
were composed of 20 whites and 22 blacks, 
while the 17 advantaged subjects were com- 
posed of 15 whites and 2 blacks, These 59 
subjects for whom socioeconomic status data 
were available represented approximately 
75% of the original 79 subjects in Group E 
The same 2 X 4 analysis of variance pro- 
cedure used previously was applied to Group 
1 data subdivided on the basis of socioeco- 
nomic status. The results indicated that the 
main effect for socioeconomic status was not 
significant (F = 1.85, df = 1/51, a= lO): 
Consequently, socioeconomic status probably 
©The term “disadvantaged,” according to the U.S. 
Department of Labor guidelines, is defined as fol- 
lows: “A disadvantaged individual, for manpower 
Purposes, is a poor person who does not have 
suitable employment and who is either (1) 3 school 
dropout; (2) a member of a minority; (3) under 
22 years of age; (4) 45 years of age or overs or 
(5) handicapped.” 


rr „ 
, A T a 


„ DIFFERENCES IN TRAINING aT A MANPoweER PROGRAM 


in this research.’ 
Second, a closer look at the nature of the 
learning environment also is necessary in 
order to afford a clearer interpretation of the 
results previously presented. Obviously, if the 
black trainees did not receive treatment and 
instruction of the same caliber as that re- 
ceived by the white trainees, the differential 
effects could be explained easily in terms of 
environmental factors in the training situa- 
tion. However, it is very difficult to collect 
and present objective data on the parameters — 
of a learning situation that might reveal the 
existence of either persistent or intermittent 
racial discrimination or favoritism. Conse- 
quently, the only evidence regarding the ra- 
cial climate that can be offered is anecdotal. 
The first author acted as a consultant on 
personnel problems and research for two years 
at the project described in the present study. 
During this time, he had ample opportunity — 
to observe the staff of trainers which ap- 
both dedicated and egalitarian, Al- 
though there were occasional reports of racial 4 
problems at the project. close scrutiny of the 
data available on training personnel and 
trainees failed to substantiate any of these 
isolated reports. Certainly there was no sys- 
tematic or pervasive discrimination against 
blacks that might be offered as a viable ex- 


ing effects. Furthermore, the high stability of 
measured competence within the black sample 
over the duration of the training militates k 
against an explanation of the lower CAT , 
performance on the basis of frequent inci- 
dents of discrimination against randomly 
selected trainees (the median coefficient of 
stability for the six-month time interval was 
80 for the blacks in Group 1). 4 

A third relevant variable which deserves 
attention is the motivation to perform well 
on tests and/or the will to achieve in he 
training program. It is possible that he 
blacks perceived little reason for exerting 
themselves on the tests or in the classroom 
because they saw few, if any, rewards con- 
tingent on high performance. Certainly, the 

7A similar analysis on Group 2 was not attempted 
given the limited number of advantaged subjects 
identified. 


, 
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historical record of discrimination in hiring 
practices in the United States tends to de- 
velop and support lower expectation among 
the black trainees. Unfortunately, data on 
expectancies and perceived rewards were not 
available on these subjects. 

Although these results suggest that indus- 
trial training programs may be actually 
increasing the disparity in measured compe- 
tence between black trainees and white 
trainees, one must not overlook the benefits 
which accrued to members of both racial 
groups. A ninth-grade level of competence in 

math was attained by approximately 90% 
of the white trainees and approximately 
80% of the black trainees. Furthermore, 
_ there was nothing in these results which sug- 
= gested that the mathematical learning in 
black subjects had reached a final asymp- 
~~ tote. More concentrated or protracted train- 
ing of blacks relative to whites might have 
caused the former trainees to reach the same 
level of competence as the latter. However, it 
is of substantial importance that trainers rec- 
ognize the possibility that the instruction 
_ they provide may have to be modified in 
order to avoid the differential training effects 
noted in the present study. 

These considerations suggest the need for 
further research to replicate and extend these 
findings. A rigid interpretation of the results 

in terms of learning ability certainly is not 
warranted at this time. Motivational ele- 
ments which might account for these data 

Should be explored as should the relationships 

of these test differences to future job per- 


| Finally, a related methodological problem 
has been raised by these results, namely, the 
= possibility of the CAT unfairly discriminating 
against black applicants. Although the guide- 
lines for considering a test unfair to blacks 
are not entirely clear, one standard index 
typically examined in the determination of 
the fairness of a test is the mean difference 
between racial groups (Bartlett & O'Leary, 
1969). It is interesting to note that in terms 
of the results for Group 1 the fairness of the 
CAT would not be questioned for the un- 
trained subjects but would create serious mis- 
givings at the conclusion of training, Clearly, 
the absence of a job performance criterion in 


_ formance criteria. 


eS lee ee Ur 


GORDON, Arvey, Darrron, AND UMBERGER 


the present study prohibits : 

the validity of the CAT for 

groups, However, given a sel: 

less than .75, any company u 

to select graduates of the pr 

entry level employment wo: 

substantially more whites tha 
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social problem in inner-city 
s the relationship between po- 
lents. It has been considered 
police officers should be re- 
sponsive mmunity needs (Clark, 1970). 
Howeve e are differences of 

about | ie definition of a good, respon- 
sive polic an, and the best method of te- 
cruiting training him to maximize re- 
Sponsiver, Police officials tend to believe 
that a go | policeman is a good policeman, 
regardles: of race, color, creed, or cultural 
backgron (Bouma, 1969; Radelet, 1966; 
Wilson, 1°48). Conversely, community lead- 
ers tend argue that responsiveness is best 
found among policemen who are similar to 
the community in terms of race, place of 
residence, and cultural upbringing (Edwards, 
1968: Mast, 1970). 

The recruitment of minority group mem- 
bers as policemen and the effect of that re- 
cruitment on police forces and communities 
tequired exploration. Earlier studies (Alex, 
1969; Hunt & Cohen, 1971) reached differing 
conclusions about whether minority group 
policemen are as service oriented as white 
policemen. Kelly and West ( 1972), compar- 
ing 193 black and 316 white Washington, 
D.C., policemen, found that a higher propor- 


‘The data used in this article was collected and 
analyzed under contract BOO-5133, between the 
Office of Economic Opportunity and the American 
Institutes for Research. The opinions expressed herein 
are those of the authors and should not be construed 
as representing the opinions or policy of any agency 
of the United States government or of the American 
Institutes for Research. Gratitude is expressed to 
both institutions for permission to use the data. 

2 Requests for reprints should be sent to R. Kelly, 
10 Wilson Terrace, Elizabeth, New Jersey 07208. 
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responsiveness was assessed via a technique identifying positive and 
‘ive attitudes of policemen towards inner-city residents, Analysis of some 
graphic characteristics of the most positive (responsive) and most Dega- 
(nonresponsive) groups revealed that race was the most important dis- 
ishing variable between them. Evaluations of the behavior of tl 
their immediate superiors supported the assumption 
itive group are rated higher than those in the negative group. 


f 
f 


polce in the 


tion of blacks than whites were service oris 
ented. They also found, contrary to popular 
assumptions, that the transition of the Wash- 
force from one with 


sought to develop an operational definition of 
“responsiveness.” It addressed the policy 


recruited, 
blacks as compared to whites? (b) If blacks 


r 
initial results of analyses of the connection 
between demographic variables and the most 
and least responsive police groups are pre- 
sented. 


METHOD 
Subjects 


rate by extent of contact was also small, ranging 
from 84% to 909%, with those having least direct 
street contact with 
response rate. Of the 509 
black, the exact percentage of blacks on the police 
force at that time. 
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Analysis of four Guttman scales (see Design) 
revealed that those policemen who scored negatively 
on the Fear of Citizens Scale (the Police Defensive 
Posture Scale) also tended to score negatively on 
the other three Guttman scales. Therefore, a card 
sort was done on the 509 subjects to determine how 
many scored positively on all scales and vice versa. 
This study is based on the sample of the 30 police- 
men who scored negatively and the 32 who scored 
positively on all four scales. These 12% of the 
original 509 subjects could be classified as “ex- 
tremes.” It was felt that the scales might have 
tapped a basic attitudinal orientation toward re- 
sponsiveness or nonresponsiveness. Therefore, the 32 
most positive group was designated “responsive,” 
and the 30 most negative group was called the 
“nonresponsive” group. 


Design 


A portion of the questionnaire contained attitude 
scales designed to elicit responses regarding police 
attitudes toward and relationship with residents of 
their work areas (Kelly, Dockett, Farber, West, & 
Gray, 1972). Five of Guttman’s criteria of uni- 
dimensionality were applied (Stouffer, et al., 1950): 
(a) total sample and individual item reproducibility 
near .90, (b) an adequate range of marginal fre- 
quencies, (c) lack of substantial nonscale items, (d) 
at least four items per scale, and (e) sufficiently 
numerous and varied response categories. Because of 
time and expense, the last criterion was not strictly 
applied. The original categories (strongly agree, 
agree, neutral, disagree, strongly disagree) were col- 
lapsed into positive/negative dichotomies with neu- 
trals being considered positive. Using these criteria, 
four scales were identified. 

The first scale, Policemen’s View of Citizens’ Sup- 
port, asked how often residents (a) take a hand 
when juveniles and children are causing trouble or 
are on the verge of breaking the law; (b) identify 
themselves when necessary and appear in court 
when requested; (c) help police identify criminals; 
and (d) report crimes they observe. 

The second scale, General Faith in People, in- 
quired about the extent of agreement with the state- 
ments: (a) No one is going to care much what 
happens to you, when you get right down to it. (b) 
Human nature is fundamentally cooperative. (c) If 
you don’t watch yourself, people will take advantage 
of you. (d) Most people can be trusted. 

The third scale, Police Fear of Citizens or Police 
Defensive Posture, was measured by intensity of 
agreement/disagreement on five items: (a) In this 
city, physical combat skills and an aggressive bear- 
ing are more useful to a patrolman on the beat 
than book learning and a courteous manner. (b) 
Most people in my district do not respect police- 
men. (c) Residents in this city view the police as an 
alien-occupying army. (d) A policeman must be on 
guard or residents will take advantage of him. (e) 
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The likelihood of a policeman beir i by dti- 
zens in this city is very high. 

The fourth scale, Police Stereoty; Poor Peo- 
ple, was composed of these items the truth 
were known about poor people, it they are 
lazy and don’t really want to work ople live 
in poverty areas only because the unwilling 
to help themselves. (c) All people i rty areas 
want is a handout without working (d) One 
of the main causes of poverty of moral 
Strength and willpower. 

RESULTS 
Responsiveness and Supervisor ings 

The two extreme groups we: vared to 
determine how the operational ition of 
responsiveness might be related police- 
men’s actual behavior. Comp: is were 
based on ratings of policemen heir su- 
periors in the areas of “bearing a: *havior” 
(the extent to which the ratee mé andards 
of bearing, dress, courtesy, avo motional 
control, and enhances the image e police 
department) and “human rel: (the 
degree to which the ratee promo armoni- 
ous relationships with other men of; the 
police department and with civil associ- 
ates). As expected (see Table 1), ‘ficers in 
the positive group rated well above their 
negative counterparts on both ensions. 
Compared to only 30% of the ogatives, 
69% of the positives were rated above aver- 


age in bearing and behavior. (Note that 40% 
of the negatives were under 25 years of age 
and were too new on the force to have had 
ratings placed in their files.) On the human 
relations dimension, 72% of the positives 
were rated above average, compared to only 
33% of the negatives. 

An overall rating for each policeman con- 
sidered both these dimensions and such things 
as learning ability, knowledge and skill, and 
acceptance of responsibility. The two groups 
showed no statistically significant difference 
in this study, but more of the positives than 
the negatives were rated above average- 

This correlation between ratings and te- 
sponsiveness group indicates that the Guttman 
scales are assessing both a basic attitudina 
orientation and a strong behavioral predisP 
sition, 


ind Demographic 


ial questions are: Will better 
men be more responsive in 
s? Will black policemen be 
Although the particular data 
is study do not answer these 
do permit an assessment of 
What proportion of the police- 
ch of the extreme groups have 
nographic characteristics? So 
can judge what potential con- 
efined categories of responsive 
ight have to these direct policy 
me brief analyses are discussed 
se Table 2.) 
». All 30 policemen in the most 
coup are white (two had Spanish 
_ Of the 32 most positive respond- 
ts, 56% are white. Race thus seems quite 
" Significant in determining the extent to which 


TABLE 1 
RESPONSIVENESS AND SUPERVISORS’ 
RATINGS 
Least re- Most re- 
Supervisors rating sponsive sponsive 
(n = 30) (n = 32) 
— 
J Bearing and behavior* 
(%) 


_ Average or below 30 22 
Above average 30 69 
NA 40 9 


Human relations” 


(%) 
————— — —— —e— 
__ Average or below 27 19 
Above average 33 72 
NA 40 9 
Overall evaluation® 
i - (%) 
— xx S 
5 Average or below 27 25 
Above average 33 65 
NA 40 5 


Vote. NA = not appli 

NA = plicable. 

en for Fisher's Exact one-tailed test = .067. 
4 for Fisher's Exact one-tailed test = 081. 
' for Fisher 's Exact one-tailed test = -20. 
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IDENTIFYING RESPONSIVE Inner-Crry POLICEMEN 


TABLE 2 


RESPONSIVENESS AND SELECTED Deuocrarme 
CHARACTERISTICS 


: 


sponsive 


Note. NA = no answer. 
$p <.01 for x. 


policemen perceive the inner- city residen 
negatively, but does not seem as strongl; 
associated with the composition of the posi 
tive group. 

2. Education. Tt is often asserted that indi 
viduals with advanced education beyond hig 
school level make better policemen than thos 
with lower educational attainment. An analy 
sis of level of education, however, indicati 
that the groups are almost identical in tł 
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"distribution of the respondents within the 

various educational levels. 

3. Rank, No sharp differences exist in terms 
of the rank of the policemen either, although 
the proportion of respondents with higher 
ranks in the most positive group was 10 per- 

_ centage points greater than in the most nega- 

tive group. 

4. Age. Although the difference in age be- 
tween the two groups is not statistically sig- 
nificant, the largest percentage of the most 
negative are 24 or younger, while the largest 
proportion of the most positive sample is 35 
ot older. New recruits and younger police- 
men might be more negative toward residents 
because (4) they tend to be more idealistic 
in their expectations; (6) they are embarking 


ic scrutiny; or (c) recruitment criteria might 
be lower now than previously. Whether po- 
į licemen’s attitudes become more positive with 
time or those with the more negative attitudes 
drop out cannot be determined with these 
data. 


5. Place of residence. A sharp difference 
exists with regard to where the policemen in 
the two groups reside. This significant dif- 
_ ference is expected due to the direct relation- 
ship between race and place of residence 
within the Washington, D.C. police force (see 
Kelly & West, 1972). Only 17% of the most 

negative officers live in the city, whereas 53% 
of the most positive ones reside in the city 
Which they serve. The effect of race on place 

of residence can be seen by the fact that of 
the 17 most positive respondents living in 


CONCLUSION 


The data indicate that an empirical ap- 
proach to identifying responsive/nonrespon- 
_ sive policemen is feasible. The fact that 12% 
of the original 509 policemen surveyed fell 
into one of the extreme groups indicates that 
the dimensions assessed by the four Guttman 
Scales are real. Further, the results from 
demographic analyses reveal that the identifi- 
cation of the most positive/negative police- 


Washington, D.C., 13 (77%) are black. Con- 


men by the Guttman scaling te 
with popular conceptions of wh 
measure of this dimension wou! 

Further research is needed to 
precise policy implications of t 
ings for police selection, The 
study provide only a crude e 
answer to the policy questi 
phrased: If responsive police: 
cruited, what are the chance 
blacks as compared to whites? ( 
responsive policemen, 44% wer 
District of Columbia Metropolit 
partment is already 38% black 
D. C., is a city of 72% black. I. 
if the District of Columbia Me 
lice Department were to choose 
use these scales only as the rec 
tion criteria, the supposed in 
sponsive policemen would not 
alter the black-white ratio of th: 
ously, such a decision could ne 
in any case, since the more stan 
for selection are needed as well 
gested in this study. 
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EQUAL UNPLEASANTNESS MATCHING OF MALODO! 8 
SUBSTANCES IN THE COMMUNITY * 
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Five different odorants, produced through combustion procedures of anim 


manure, were matched with hydrogen sulfide with regard to unpleasantness 


30 subjects. Power functions described the unpleasantness matchings for ñy 
combustion toilets tested. The exponents of the equal sensation functior 
indicated a relative invariance of the various combustion procedures on t 
exponent. Different combustion procedures resulted in different levels of od 
unpleasantness, but these differences were small from a practical point of view 
The experiment demonstrated the practical applicability of psychophysic:! 
methods in environmental health research. 


Pollution of the atmosphere is a major 
concern of all nations. In particular, the gen- 
eral environmental problem of unpleasant 
odors is of increasing concern. It appears 
that information regarding the perceived in- 
tensity and unpleasantness of odor-producing 
stimuli may assist environmental planners 
in their efforts to reduce offensive pollutants. 

Certain psychophysical methods should 
have application to this problem. Thus, cross- 
modality matching of the subjective intensity 
of stimuli has been carried out for several 
pairs of sense modalities. For example, force 
of handgrip has been used as a matching 
continuum for equating the perceived inten- 
sity of different sensory continua, such as 
‘white noise, white light, warmth, and cold 
(Stevens, Mack, & Stevens, 1960; Stevens & 
Stevens, 1960). In connection with olfaction 
studies, finger span has been used as the 
matching continuum by Berglund, Berglund, 
and Lindvall (1971), and Ekman, Berglund, 
Berglund, and Lindvall (1967). Thus, the 
usefulness of cross-modality matching in de- 
veloping “objective” yardsticks of subjective 
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sensory impressions has been d: rated. 

A similar method used is th intra- 
modality matching” which is a ate in 
developing matching scales wit! sense 
modality. In this technique subject 
matches the subjective intens some 
other attribute of, for example, ifferent 
auditory frequencies or two differe: orants. 
That is, in cross-modality ma ig the 
matching is performed betwee: ntinua, 
while in intramodality matching the .atching 
is between two qualities within one -odality. 


While intramodality intensity mat: ing has 
been shown to be a reliable and valid method 
for vision and audition, because o° ‘echnical 
difficulties, it has been applied oniy occasion- 
ally to the development of scales oi odor in- 
tensity (Beck, Kruger, & Calabreisi, 1954; 


Cain, 1969; Kruger, Feldzamen, & Miles, 


1955). However, suitable olfactometers are 
now available for this purpose (e.g, Lind- 
vall, 1970). 

The present study is concerned with the 
evaluation of unpleasantness of malodorous 
effluents similar to those emitted from com- 
bustion toilets in recreation areas. These 
effluents are of growing importance as en- 
vironmental nuisances. However, existing 
United States’ guidelines for maximum pêr- 
missible odor discharge from gas-fired incin- 
erators (USA Standard, 1966) are considered 
unreliable and not applicable to the problem. 

Specifically, this study (a) develops and 
evaluates a new measurement technique for 
using psychophysical methods and (2) 1- 
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vestigates a functional relationship 
between pl | continua found in cross- 
modality 4 ramodality matching of per- 
ceived inte can be generalized to intra- 
modality ing of olfactory unpleasant- 
ness. 
METHOD 
Derivati he Functional Relationship 
Assumir the power function, Y u , 
adequate! s the relation between perceptual 
(¥) and ($) continua for the perceptual 
attribut: nsity and unpleasantness, we may 
write thc ing for two different continua, ne- 
glecting 87 
vi = oi” 
y: = or. 
In c ality as well as in intramodality 
matchir ubjects are instructed to equate ** 


ya rent stimulation levels. That is, the 


equal s function will have the form: 

oi" . uJ 

Solvin „and o in Equation 1, we obtain: 
= ps, where bis = n/m (2) 

= gst, where bu = m/n B] 

Equatic. 2 is the predicted function that has 
been exo :imentally obtained by cross-modality 
matching of the subjective intensity of stimuli on 
two different continua for several pairs of modali- 
ties (e.g., Stevens, 1960). This equation has also 


been verified experimentally for auditory signals 
(Stevens, 1966), for heterochromatic matching of 
saturation to saturation (Indow & Stevens, 1966); 
for vibrotactile stimuli (Stevens, 1968), and for 
matching of linear aliphatic alcohols (Cain, 1969). 
It is predicted that this equation will ize to 
intramodality matching of olfactory unpleasantness. 


Experimental Equipment 


The laboratory used in the present study was @ 
mobile unit employed in the sensory evaluation of 
odorous air pollutants. The olfactometer was based 
on the use of a well-defined dynamic flow of gas 
that permitted stable exposure concentrations, rapid 

anges in concentration, and fairly natural respira- 
tory conditions (Lindvall, 1970). The equipment 
consisted essentially of an air-conditioned test 
chamber in which two teflon-coated exposure h 
had been installed. The odorant gas was mixed with 
Purified air in fixed dilutions and added to the flow 
of air through the hoods by means of electromag- 
hetically operated steel and teflon cannulae of vary- 
ing bore. The unit had a special waiting room for 
the subjects with ventilation comparable to the 
actual test procedure in order to provide optimal 
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adaptation to the experimental situation. By wig 
suitable filter equipment (active charcoal Silters 
particle filters) it was posible to carry out evalua 
tions with minimum contamination of the labora: 
tory air. i 
Each odor hood received a continuous flow 
100 liters per minute of purified air co 
to a rate of approximately I meters per second, 
signal lamp indicated the beginning and the end 
a presentation. The responie panel consisted of an 
adjuster that could be rotated in 13 discrete steps, 


Stimuli 


Five types of odorous gases were 
through combustion of animal manure. The di 
odorants were obtained from three electrical 
bustion units (A, D, and E), one gas-fed 
(B), and one oil-fed combustion unit (C). 

The manure used was first and 
split into samples of about 35 liter, All samples 
immediately deep frozen and kept at % 
24 hours before we in the combustion toilets. 
dry weight of the manure was 11.9%; the 
of total sulphur was . 79% of the dry weight. 2 


the manure was burnt in 


ried out for all the odorous gases produced (for a 
description of the methods, see Lindvall, 1970). The 
absolute thresholds obtained were approximately 
the same for all combustion processes investigated 
(Table 1). 

Each of the five combustion odors was diluted 
with purified and conditioned air into four steps of 
physical intensity in a geometrical series. Concen= 
tration valucs are as arithmetic means of 
dilution factors in log parts per million (volume 
per volume). Stimuli are expressed as the differences: 
between stimulus concentration values and threshold 
concentration values. Table 1 presents these stimulus 
values as well as the absolute threshold values 
obtained for each combustion unit. It should be 
noted that the stimulus values are oi lower magni- 
tude than the threshold values simply because the 
former are calculated as a difference between 
the stimulus and threshold concentration values. 


Standard Odorant ] 


be used as a 
the acceptability of the complex malodorous cfu- 
ents. To obtain the 
different air pollutants, the standard odorant has t 
be scaled; for example, by magnitude estimation 
Scale 
order to find the psychological scales for differen 
air pollutants. 

Hydrogen sulfide was chosen as the matching ga 
because of its practical applicability as a possib] 
index substance in field studies as well as its ur 
pleasant smell. The hydrogen sulfide (Matheso! 
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TABLE 1 


ABSOLUTE THRESHOLD CONCENTRATIONS AND RELA- 
TIVE STIMULUS CONCENTRATIONS FOR THE Five 


ODORANTS 
Threshold 
Combustion procedure | concen- Stimulus 
tration | 
Electrical combustion 
unit (A) 1.90 19 6S | 1.14) 1.09 
Garfed furnace (B) 2.29 aa 721.24 182 
Oiled combustion 
unit (C) 2.02 18 71 | 18) 173 
Electrical combustion 
unit (D) 2.43 AD | 6S | 1.21 | 1.76 
Electrical combustion 
ünit (E) 2.51 19 | 59 | 1.18 | 1.73 


technical quality) was diluted into 13 steps of 
physical intensity in a geometric progression. 


Chemical Analysis 
Analysis of the concentration of hydrogen sulfide 


exposure concentrations in the hood itself could 
d not be analyzed due to the relative intensitivity of 
chemical methods available. Samples were therefore 
taken via a teflon tube immediately before the 
capillaries delivered the appropriate dosage. The 
3 concentration of hydrogen sulfide was estimated by 
direct measurement of the black colored part of the 
gel tube. Its length was compared with a standard 
curve based on the desorption of the compound, as 
obtained by a wet chemical analysis (Jacobs, 
Braverman, & Hochheiser, 1957). 
. The composition and concentration of the gas is 
unknown and cannot be easily chemically analyzed. 
‘Therefore the combustion odors were simply con- 
trolled by the dilution system of the olfactometer, 
chat is, by known flows of odorous gases into the 
main air flow. 


Subjects 


Thirty female subjects, aged 20-47, participated in 
. the experiment. They were housewives and students 
without any previous experience in olfaction experi- 
ments. 


TADDA eee 


Procedure 


The method of intramodality magnitude matching 
was used. In the left exposure hood the subject 
was exposed to fixed concentrations of the com- 
bustion odors, one at a time, and in the right hood 
to various concentrations of hydrogen sulfide ad- 

justable in discrete steps by the subject. By rotating 

the adjuster, different calibrated steel and teflon 
cannulae were operated in or out of the main air 
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stream to the exposure hood. Tt was asked 
to operate the adjuster until sh < different 
odors in the two hoods sub; qual with 
regard to perceived unpleasantr 

In each session the subject n hydrogen 
sulfide to the four different conce s of the 
combustion gases. Consequently, of these 
four concentrations could be v different 
ways. These possible permuta rotated 
randomly over the sessions of th nt. The 
experiment lasted for nine day subjects 
participating in 20-minute sessions irregular 
order. All odors were judged onc ry each 
subject. 

In order to control adaptation ar temporal 
effects, as well as position effect matchings 
were performed in the following w The sub- 
jects had to wait for at least | in the 
air-conditioned waiting room s king or 
eating. (5) Each subject was ca» it tructed 
and trained in her specific task in the hamber. 
(c) The subject had to take just in each 
hood: first in the hood where the n gases 
were presented, and then, immedis exhal- 
ing, one sniff in the matching hood did not 
perceive the odors equally unpleas had to 
adjust the hydrogen sulfide concent: and wait 
for 15 seconds until a light signal i ted stable 
concentrations in the hoods. Then shc i to make 
another comparison, smelling in the is in the 
same order as above. This cycle was ted until 


she found the specific concentration the com- 
bustion gas and the adjusted hyc sulfide 
concentration to be subjectively equai h respect 
to unpleasantness. Her decision was to the 
experimenter through a signal system, and the 
required concentration of hydrogen sulfide was 


registered. 


RESULTS AND Discussion 


The results from the 30 subjects were 
averaged (arithmetic means) into group data. 
The five equal sensation functions of un- 
pleasantness are plotted in Figure 1. Concen- 
trations of the matching gas (hydrogen sul- 
fide) are read on the ordinate and relative 
concentrations of the combustion gases on the 
abscissa. Absolute combustion gas concentra- 
tions can be calculated from Table 1. The 
hydrogen sulfide concentrations are expressed 
as log parts per billion and the combustion 
gas concentrations in relative log units (dilu- 
tion factors). From Figure 1, it is possible to 
determine the physical value of the intensity 
of hydrogen sulfide which has the same 
degree of unpleasantness as a given physical 
intensity of a certain combustion gas. A line 
is drawn, parallel to the ordinate, from the 
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issa representing the com- 
ction until it intersects the 
equal senss inction of the specific com- 
bustion ga en, another line is drawn, 
parallel to scissa, from the intersection 
point to t! nate, The ordinate intersec- 
tion valuc concentration of hydrogen 
sulfide w} considered as unpleasant as 
the conc n of the combustion gas. 
| Every p the diagram represents the 
average í ‘pservations except for com- 


point on il 
bastion con 


bustion ; In this case, each point is 
based on observations, owing to tech- 
nical pr 

From e 1, it is evident that the trend 
of the ensation plots can be repre- 
sented a imately by straight lines in a 
log-log | lowever, the standard deviation 
of each joint is considerable. The ap- 
parent | iy in log-log plots indicates an 
equal sc n function of the basic form 
presented » Equation 2. 

In th ent context the following form 
of the sensation function was judged 
most de ive. 

Aye Seve 
Se La: =), 05 


where S, is hydrogen sulfide concentration 
in parts per billion, and Spm the co i 

absolute threshold in the same unit.“ S. is 
combustion gas concentration in parts per 
million, and S, is the corresponding threshold 
in the same unit, With the chosen form of the 
equal sensation function the multiplicative 
constant @ is independent of unit of measure- 
ment for the reference and matching stimuli. 

In logarithmic form Equation 3 reads: 


log Sm — log Som 
= log a + b (log S. — log S.). [5] 


The reference stimuli used in the experiment, 
log S, — log S,,, can be found in Table 1 and 
the exponent b and the multiplicative con- 

stant a in Table 2. The obtained exponents 

are of the same order of magnitude for all 

— — 

The absolute threshold for hydrogen sulfide, i i 
log parts per billion, was estimated in an earlier 
experiment using the same equipment (Lindvall, 
unpublished material). 


the combustion gases studied, that is, 
proximately .5. 
The results showed that to obtain a ch 


in perceived unpleasantness of the same c 
change in log physical concentration It 
The form of the equal sensation “a 
studies both within and between modalities 


der of magnitude for both hydrogen d 
and any one of the combustion gases, . 
drogen sulfide had to be 6 1. 
change of any combustion gas (0 ~ 
a 1 function tor 
the attribute of unpleasantness within ollac. 
tion is in accordance with earlier matching 
(Stevens, 1960; Stevens, 1966; Stevens, 1968 
Indow & Stevens, 1966). 


TABLE 2 


EXPONENTS AND MULTIPLICATIVE CONSTANTS 
or Pow] ] ß FUNCTIONS 


Electrical combustion 
unit (A) 

Gas-fed furnace (B) 

Oil-fed combustion 


unit (C) 
Electrical combustion 
unit (D) 
Electrical combustion 
unit (E) 


In addition, the results clearly showed that 
the effect of the combustion units differ in 
_ that the same unpleasantness was caused by 
different concentrations for the different com- 
_ bustion units. For example, an unpleasantness 
_ percept corresponding to 1.49 log parts per 
_ billion hydrogen sulfide was caused by the 
© concentrations: 2.13, 3.29, 2.93, 3.34, 4.61, 
measured in log parts per million, from the 
. combustion gases A, B, C, D, and E, respec- 
tively. Although the investigation showed 
that combustion unit E was slightly superior 
to the other combustion units, the differences 
were so small that they could be ignored for 
practical purposes. It may therefore be stated 
that all investigated units were perceived 
equally unpleasant on liminal as well as 
supraliminal levels. 

In the present context it is important to 
consider the subjective range of the attribute 
used. as illustrated by Engen and McBurney 
(4963). They showed that the subjective 
_ tange for pleasantness of odors obtained with 
_ the method of magnitude estimation far ex- 

ceeded the range of subjective intensity for 

different odorant concentrations estimated by 

the same method. In this study, the attribute 

unpleasantness has been used as a dimension 
on a quantitative rather than a qualitative 
continuum. Naturally, a possible standard 
odorant for general use in environmental 
_health studies has to be scaled for different 
common attributes, for example, intensity and 
‘unpleasantness. Due consideration must be 
given to the choice of attribute in scaling 
experiments. This was convincingly shown in 
a study on aircraft noise where subjects 
were found to be able to differentiate and 
quantify various psychological aspects of 
noise, such as loudness, noisiness and annoy- 
_ ance (Berglund, Berglund, & Lindvall, 1974), 
Once these dimensional problems are solved, 
" matchings of malodorous pollutants to a 
Standard odor will give the psychological 
Scales for the pollutants simply by scale 
_ transformations, 


Need jor a Standard Odor 
E. From the point of view of basic research, 
it is desirable to make the relative values 


obtained in olfactory research comparable 
from one laboratory to another by including 
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a standard odorant in all scaling 
A similar view was expressed by 
who also advocated validation oí 
in the equal sensation function | 
the same experimental setting v he same 
standard odor and with other s being 
scaled individually and matched urs. 


eriments 
n (1971) 
exponent 
e use of 


Undeniably, there are a lot of | psycho- 
logical problems to be solved be me has 
achieved a valid and reliable rela veasure 
of unpleasantness against a stand Jorant. 
However, the present experiment ws the 
applicability of the method of er +) sensa- 
tion matching in environmental he studies 
of malodors. In spite of the co xity of 
the malodorous gases and the sim; hysical 
measures used for both the standa as and 
the combustion gases, the function: elation- 
ships obtained are simple and con t with 
earlier findings in the area. l 

To be of general value the psy hysical 
method presented in this study ha: e vali- 
dated in a large-scale experiment several 
odors, each being used as the fixed. | varia- 
ble gas. For use in field studies, fi ample, 
when comparing different malodorou rission 
sources, the standard odor used for itching 
must be highly reproducible technic: y and 
must have a wide psychological ge. In 


addition, it would be desirable if this standard 
yielded a simple relation between attributes 
such as unpleasantness and intensity. Hydro- 
gen sulfide can be suggested as such a stan- 
dard odor. 


Practical Implications 


The results of the present investigation 
demonstrated the practical applicability of 
unpleasantness matching in environmental 
psychology. It was thus shown that unpleas- 
antness evoked by malodorous combustion 
gases of animal manure was independent of 
the combustion procedure tested. From a per- 
ceptual point of view the combustion toilets 
were judged equally efficient. The unpleasant- 
ness levels for undiluted combustion gases 
corresponded to approximately 1 part per 
million hydrogen sulfide which was perceived 
as rather unpleasant and comparable to the 
smell of rotten eggs. (The threshold for 
hydrogen sulfide was approximately 1/1,000 
parts per million.) 


} L UNPLEASANTNESS MATCHING OF MALODOROUS SUBSTANCES 


In additio ith the knowledge of the 
parameters o psychophysical function in 
connection \ atmospheric dispersion cal- 
culations, it assessed that houses with 
such toilets creation areas must be at 
least 200 m part in order to reduce the 
detectabilit; the odor to an acceptable 
degree. 
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EVALUATION OF A DECELERATION WARNING LIGI 


r 


FOR REDUCING REAR-END AUTOMOBILE COLLISIO 


JOHN VOEVODSKY 
Portola Valley, California 
An amber light, center mounted on the rear of several hundred San Francisco 


taxicabs, was designed to communicate information about the taxi’s decelera- 
tion to following drivers. The light, activated by the use of the brake pedal, 


; was pulsed in a controlled fashion at a rate, duty cycle, and intensity that 
varied exponentially with a component of deceleration. A comparison of an 
i experimental group (with the light for 12.3 million miles) with a control group 
(without the light for 7.2 million miles) revealed the light prevented 5.4 
collisions, 1.02 cab driver injuries, and $643 of taxicab damage per million miles. 


_ The possibility of designing automotive 
equipment to prevent automobile collisions 
has been greatly overshadowed by attempts 
to design automotive equipment to protect 

Occupants during a crash (Bowen, 1972). 
Prevention of a significant number of rear- 
end collisions may be possible by the use of 
a light system warning other drivers that the 
car ahead is decelerating. The theoretical basis 
for the design of such a deceleration warning 
licht system was published earlier (Voevod- 
sky, 1967). The present study is an evalua- 
tion of the collision prevention effectiveness 
of a deceleration warning light under normal 


driving conditions. 
METHOD 


"The deceleration warning light system designed for 
this study was an amber warning light, center 
mounted on the rear of a vehicle at the same height 
existing stop lights, which communicated a com- 
i Ponent of deceleration initiated by the driver of 
that vehicle to the driver of a following vehicle. 
A device for Measuring deceleration, essentially an 
"inertial pendulum, hence independent of any mechan- 
_ ical adjustments of the car such as brake pedal posi- 
tion, was rigidly attached to the vehicle. The warn- 
ins light, activated by use of the brake pedal, was 
in a controlled fashion at a rate that varied 
exponentially with the component of deceleration. A 
50% increase in frequency of the light pulses from 
10 pulse per second to 7.6 pulses per second for 
Te -1 gravity force increase in deceleration from 
to .5 gravity provides the nential tionship, 

O i y pi expo: relationship 


N 


Request for reprints should be sent to John 
Voevodsky, 177 Goya Road, Portola Valley, Cali- 
fornia 94025. 

* This report is not concerned with the engineering 


Problems encountered while developing the equip- 
ment. 


q 


Subjects 

A commercial taxicab company, to b ed to 
as the Cab Company, participated in xperi- 
ment at no cost to itself other than : out of 
service for equipping the cabs and ope: them 
so equipped.“ The Cab Company was ask: make 
their San Francisco fleet of 503 cabs av. -ble for 
two reasons: (a) The San Francisco {i travels 
almost 2 million miles per month, thereb) ling a 
testing of the equipment over significant ge in 
a relatively short time period. (b) The Com- 
pany’s complete records of cab accidents avail- 
able for the five years prior to 1972, thus iding 
a statistical base from which to evaluate tne test 
data. 
Procedure 

On March 1, 1972, 343 taxis began operation with 


deceleration warning lights. The remaining 160 taxis 
were to be traded in on new vehicles and therefore 
were not equipped, but served as a concurrent 
control group. The new vehicles were equipped 
when delivered to the Cab Company. Later, between 
June 1 and August 28, 1972, the deceleration warn- 
ing lights were removed from previously equipped 
cabs in order to equalize the mileage driven by the 
equipped and nonequipped cabs. This approximately 
equal monthly mileage was continued through the 
end of the experiment, February 1, 1973. The changes 
in the proportions of cabs equipped or unequipped 
Were corrected by expressing all collision rates as 
collisions per million miles driven. ` 
The dispatcher assigned taxi drivers to equipped 
or unequipped taxis at his discretion, disregarding 
any preferences expressed by the drivers. j 
No massive education program on the operation 
and function of the deceleration warning light 
system was undertaken prior to the experiment or 


* Acknowledgment is made of the cooperation of 
the Yellow Cab Company of San Francisco for 
their participation and for making their accident rec- 
ords available. 
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1ge@ collision rate was 8.91 
‘sions per million miles based 


2 million miles, 
Jan 1973 


Total rear-end collisions from Mar 1, 1972 to Feb. 1, 1973 


o 1 > ˙— 
Total mileage driv 


ATION WARNING Liehr For Repucino ReEAR-Enp CoLLISIONS 


Feb.) 
out deceleration warning lights, qe 


7 Average collision rote is 7.9 

collisions per million miles 
based on 125 million miles 
from Jon. 1, 1967 to Jon1,1972 


With deceleration warning lights, 
average collision rate wos 3.5! collisions 
per million miles based on 12.3 million miles. 


ti 8 9 


en in millions from Mar. 1, 1972 to Feb. 1, 1973 


Ficure 1. Rear-end collisions of cabs equipped and not equipped with deceleration warning light. 


“ae the experiment. Limited news coverage at 
start of the test provided the only source of 
Public information. 


RESULTS 


Aue the end of the reporting period, January 
31, 1973, the vehicles equipped with the 
on warning lights had traveled a 
et 12.3 million miles with a rear-end 

Mision rate of 3.51 collisions per million 
miles (Figure 1).4 This is a 60.6% reduction 
e a 


$ Ass months appear at different points above the 
in th 8 for equipped and nonequipped cabs because 
ea earlier months the total mileage driven by 
Mees cabs was greater than that of the 
‘quipped cabs. 


in the rear-end collision rate from the con- 
current control group which had 8.91 colli- 
sions per million miles over a total of 7.2 
million miles. The rate for the concurrent con- 
trol group is almost identical to the 1971 rate 
of 8.89 over 21.5 million miles, and is near 


the mean rate of 7.9 for the five years prior 


to January 1, 1972, computed over 125 mil- 
lion miles from the Cab Company records. 
The 10 months for which comparative data 
are available can be taken as 10 independent 
experiments. The month of April 1972 was 
excluded from the analysis as there were no 
control data for this month. When the rear- 
end collisions are treated as collisions per 
million miles within each month, a paired ¢ 
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TABLE 1 
REAR END COLLISIONS, Drivers INJURED, AND CAB DAMAGE As A FUNCTION OF MILI R Cans 
EQUIPPED AND NOT EQUIPPED WITH DECELERATION WARNING LICHTS 
5 Rear-end . umage 
Mileage zollisions Drivers injured alt) 
Month/year f 2 
With Without With | Without With | Without Without 
March 1972 1,515,793 197,903 2 3 0 1 1.275 
April 1972 1,661,455 0 8 0 2 0 0 
May 1972 1,439,438 245,316 8 1 1 |, 0 0 
June 1972 987,941 711,234 9 6 1 | 1 227 
July 1972 959,977 757,510 2 8 0 0 1,408 
August 1972 1,085,868 826,081 2 7 0 0 425 
September 1972 921,800 855,596 4 7 0 1 765 
October 1972 1,038,207 1,002,130 0 9 0 2 2,124 
November 1972 876,549 844,301 4 7 3 1 189 
December 1972 875,895 835,591 4 6 0 4 | 853 
January 1973 899,105 908,877 3 10 1 2 | 211 
Totals 12,262,028 | 7,184,539 | 43 64 8 122 7.477 
Per Million | 
Miles 3.51 8.91 65 1.67 1,041 
test of the differences for equipped and non- rangements and behave different! cause of 
equipped cabs is significant ( = 4.06, df= the so-called Hawthorne effec: . Roethlis- 
8, p= .006).° berger & Dickson, 1939). This ci“ ‘sm does 
The rate of personal injury to taxicab not apply because the collisions due to 
drivers struck in the rear by other vehicles the drivers of the cars that folk ae taxis 
for the equipped vehicles was .65 per million and not to the taxi drivers. T! he taxi 
miles, or a 61.1% reduction from the 1.67 driver has not become more cautic = because 


rate of the concurrent control group (see 
Table 1). The costs of repairs to the cab 
vehicles struck in the rear by other vehicles 
declined from $1,041 per million miles for 
the concurrent control group to $398 per 
million miles for the deceleration warning 
light vehicles. This is a reduction of 61.8%. 
Damage figures for the vehicles that collided 
with taxis are not known. Because the Cab 
Company is not at fault these amounts are 
not included in their records. 


Discussion 


A common criticism of social experiments 
that involve innovative arrangements is that 
the operators are sensitive to the new ar- 


5The division of accident rates into units of 
months is somewhat arbitrary, but to produce a 
statistical test at all it was necessary to use the data 
available from the Cab Company. Mileage data are 
reported monthly, rather than by miles to accident, 
which would permit the use of Poisson statistics. 


of driving a taxi equipped with fety de- 
vice is attested by the lack of « in the 
collisions in which the taxicab drive ran into 
the rear-end of a car in front of him. The 
drivers of equipped taxis produced 7.1 such 
collisions per million miles, and those not 
equipped produced 8.1 such collisions per 
million miles. When this difference is tested 
statistically by the same procedure previously 
described, the difference is not significant 
(t= .65, df = 8, p=.52). 
The results of this 11-month, 19.5-million 
mile test of a deceleration warning light sys 
tem in natural driving conditions indicate 
that a significant reduction in rear-end col- 
lisions and injury can be achieved. During 
the experiment, the deceleration warning light 
prevented rear-end collisions to equippe 
vehicles by 5.4 collisions per million miles, 
reduced the number of cab drivers injure 
from such collisions by 1.02 injured Pel 
million miles, and reduced the cost of repairs 
to cab vehicles struck in the rear by $643 


A D: 


per million m 
taxi passeng- 
juries and pr 
vehicles whi 
With the U 
proximately 
(National Sa! 
reduction in 
damage fro: 
society is e: 
the Nation 
rear-end co 
per year. I 
savings of 
human ben: 
Hence, 
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‘These figures do not include 
juries and the personal in- 
vty damage prevented for the 
ın into the rear of the cabs. 
l States public driving ap- 
‘5,000 million miles each year 
v Council, 1971) the potential 
ersonal injury and property 
ir-end collisions for American 
nous. The same report from 
‘afety Council indicates that 


‘ons account for 2,000 deaths 
Hof these are avoidable, the 


10 lives per year would be a 


quantitative experiment also 
kind of “on the street” field 
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evaluation possible with the cooperation of 
public and private agencies.“ 


©The California Highway Patrol granted the 
permit to place the deceleration warning lights on the 
cabs for the experiment. This investigation could 
not have been possible without their approval. 
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EFFECTS OF DIFFERENT LEVELS OF INTEGRA] 
ON ADVERTISING PREFERENCE AND 
INTENTION TO PURCHASE 


GEORGE J. SZYBILLO! axo JACOB JACOBY 
Purdue University 


Ninety blacks and 90 whites evaluated six proposed advertisements in t 
of attractiveness and likelihood of purchase of the product advertised. 7 
advertisements differed in the degree of racial integration depicted, which w 
operationally defined by a S-point continuum ranging from 0 whites 
4 blacks (OW-4B) to 4 whites and 0 blacks (4W-0B). The hypothesis t 


young blacks react negatively to integrated ads depicting tokenism 
three whites to one black) and positively to ads depicting equality 


two whites to two blacks) was supported with cach of the dependent measu 


i 
4 Bauer and Cunningham (1970) have noted 


that “the ways in which blacks buy and use 

goods and services are simultaneously a prac- 

tical business concern and a social issue, with 
_ the two interacting with each other [p. 3].” 
The manufacturer wishing to effectively seg- 
ment this market and simultaneously maintain 
his white market may face a number of social 
problems and dilemmas related to such things 
as product design, brand perceptions, labeling, 
merchandising, and especially advertising. 
Wich respect to advertising, it might be 
asked whether black consumers and white 
consumers have different preferences for all 
black, all white, or integrated advertising? Is 
there a possibility of backlash from whites or, 
perhaps more interesting, from blacks view- 
ing integrated advertising? Does integrated 
advertising have any differential effect on the 
purchase intentions of blacks and whites? 
Moreover, what constitutes an “integrated” 
advertisement? Is it the appearance of only 
I black in a group of 30? 
Buy and large, evidence adduced thus far 
= (Barban, 1969; Barban & Cundiff, 1964; 
_ Barban & Grunbaum, 1965; Guest, 1970; 
Stafford, Birdwell, & Van Tassel, 1970; Tol- 
ley & Goett, 1971) suggests that white sub- 
_ jects display about the same degree of posi- 
K 


750 


The authors wish to thank Barry Barnes and 
Diane Saint for their assistance in collecting the 
A data for this experiment. 

= Requests for reprints should be sent to George J. 
 Szybillo, Graduate School of Business Administra- 
tion, New York University, 100 Trinity Place, New 
Vork, New York 10006. 


tive affect to integrated adv: as they 
do to all white ads (i.e, ad ning all 
white characters), whereas bla cts tend 
to prefer integrated or all bl; ertising 
over all white ads. Notable ex: ; to this 
generalization are Cagley ai ardozo’s 
(1970) finding that highly pre d white 
subjects reacted negatively t tegrated 
advertisements, and Gould, 8 nd, and 
Zoerner's (1970) finding that er (ie. 
14-29 years of age) blacks we egatively 
predisposed to the concept tegrated 
advertising. i 
While some controversy exis; regarding 
the use of black models in advertisements 
(Kassarjian, 1969: Wheatley, 1°71), Kas- 
sarjian (1971) contended that the black 


model is still only a token black integrated 
ads typically feature a single model smoking, 
drinking, modeling clothes, etc. together with 
a number of white models. Wall (1970) sug- 
gested that although the intent of the adver- 
tiser is to establish empathy, blacks respond 
primarily with a sense of loneliness and ali- 
enation and become uncomfortable. 

This raises the entire issue of just what 
constitutes an integrated advertisement- 
Boyenton (1965) defined integrated adver- 
tising as the inclusion of blacks in gene 
consumer advertising and indicated that the 
same aspect which characterizes blacks among 
the general population characterizes blacks in 
an advertisement, namely, “the dark face. 
However, such a definition does not reflect the 
fact that there can be varying degrees of inte- 
gration depicted in an ad. Conceivably, le. 
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ned advertisement will be 
ivertisement contains one 
ive white models versus 
ne white model and five 
| few studies which have ex- 
d th ı of blacks and whites to 
cl of integration have col- 
into a single level called 
out providing data indicat- 
lifferences across the differ- 
tegration (Barban, 1969; 
1970). Thus, none of the 
cor so far has explicitly incor- 
n n a single level of “integrated 
its experimental design. 
he present experiment was 
nine the reactions of young 
s to proposed advertisements 
1 erent ratios of black principal 
ers white principal characters. 
findings, it was hypothesized 
te respondents react equiva- 
terms of affect and purchase 
all levels of all white and 
ertising. Also, based on the 
e, particularly the findings of 
1970) and the writing of Wall 
hypothesized that young black 
cact negatively to an advertise- 
‘ racial composition suggests token- 
and favorably to advertisements 
composition depicts black-white equal- 


Since Gould et al. (1970) found negative 
fect directed toward the concept of “inte- 
ited advertising” only with blacks below 
® 29, this last hypothesis is most properly 
Sted using young blacks. Other reasons for 
sing college-age black subjects and white 
ibjects included the following: (a) Since 
tban’s early studies (Barban & Cundiff, 
64; Barban & Grunbaum, 1965) no re- 
orts have been published that compare 
black student and white student reactions to 

ated advertising. As social conditions 
ge, so too should there be a measurable 
t on people's perception of those 


& Scammon, 1973) 
Wthy of study in its own 


| ton Lever, Ap PTT TN axo Porcatass Iberroerions 


ness firms have become more concerned with 
how they can attract minority 
Integrated advertising does more thas 
sell the product; it can also give minority 
consumers impressions about the nature of 
manufacturer's hiring policies (Szybillo, 
coby, & Busato, 1975). 
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j received 
Leth of information regarding race of the models 
(which was omitted). 
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= Dependent Variables 

Two types of reactions were considered: (a) 
preference (the dependent variable most often used 
in earlier studies) and (b) purchase intentions. 
Preference was measured using three different 7- 
point bipolar scales. These scales were attractiveness 
of the advertisement (having endpoints labeled 
_ “unattractive” and attractive“), the personal ap- 
“sate the advertisement (having endpoints labeled 
. not appeal to me” and “does appeal to me”), 
the overall appeal of the advertisement (having 
ints labeled “bad” and “good”). Likelihood of 
purchase was measured using a 7-point bipolar scale 
(having endpoints labeled “very unlikely to pur- 
thase” and “very likely to purchase”). The positive 
ends of these scales were alternated to reduce 
response bias. In this report, high numbers repre- 
sent the positive end of the scales. 
Each subject completed each of the dependent 
measures for proposed ads for each of the afore- 
mentioned products. The order of the advertisements 
was randomized in each subject’s questionnaire book- 


The white subjects and black subjects received 
questionnaires in separate rooms. A white proctor 
_ administered the questionnaire to white respondents, 
and a black proctor administered the questionnaire 
to black respondents. The respective proctor re- 
mained with the subjects at all times to ensure that 
subjects understood directions and did not com- 
unicate with each other. 
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Racial Ratio of Advertisement 


Ficure 1, Race of Respondent X Racial Ratio of Advertisement Interac: 
for Analysis of Advertisement Attractiveness. 


Subjects were instructed to ans 
in their questionnaires. On com 
received a postexperimental quest 
their perceptions of the intent c 
manipulation check for the racial rat 
also included. Before the subjects t 
session they were debriefed as to the ven 
of the experiment. 


RESULTS 


The subjects’ responses to the manip 
check item on the postexperimental que 


TABLE 1 


Source af 
MS F 2 
Between subjects 
Race of respondent 1248 -asir 
Racial ratio of EEA, 
advertisement (B) 4 | 143.90 49.72 
x 4| 86.07 | 29.74 
Fi se ES 140| 2.89 
ithin subjects 
Product (©) 5| 23.89 14.89% 
AXC 5| 9.47 | 610 
AXC 20| 2.59 1.67* 
AXBXC 20| 2.17 1.40 
Error (within) 700 1.55 


for Analysis of 


naire in: d that all subjects were aware 
of the ı ratio which appeared for each 
of their p: -luct advertisements. Responses to 


the item ing for the presence of demand 


character s indicated that no subject 
guessed thc purpose of the study was to de- 
termine the effect of different racial ratios on 


their responses. It appears that providing 
tach subject with only one racial ratio across 
the six product advertisements was effective 
in reducing demand characteristics. 

Table 1 presents the F ratios and associated 
probability values obtained from analyses of 
variance for the dependent measures. Race of 
respondent had a significant effect for each 
analysis. In addition, the main effects for 
racial ratio of advertisement and products 
were also significant for both analyses. How- 
ever these main effects should be considered 
aie respect to the significant interactions. 

mportantly, in terms of the purpose of this 
experiment, the interaction between the race 
a respondent factor and the racial ratio of 
F vertisement factor was significant for both 
n measures. The significant Race of 
a X Racial Ratio of Advertisement 
a eractions for attractiveness and likelihood 

purchase are depicted in Figures 1 and 2, 

respectively, 


nox LeveL, Ap Prerexence Aano PurcHasn Ixrexnoxs 


cure 2. Race of Respondent X Racial Ratio of Advertisment Interaction 
Purchase. 


of 


Consideration of the control condition data 
(cf., Figures 1 and 2) for the black subjects 
and white subjects provides interesting per- 
spective. Across both dependent measures, 
there was virtually no difference between the 
ratings assigned by the whites to the all white 
and control ads, but there were considerable 
differences for the black subjects, who rated 
the all white ads substantially lower. This 
probably occurred because, in the absence 
of any information regarding racial composi- 
tion of the ad models, most people (blacks 
as well as whites) would assume that the 
characters were all white. However, making 
this fact (zero blacks to four whites) salient 
for blacks served to lower attractiveness and 
intention to purchase ratings. 

A Newman-Keuls analysis of the Race o 
Respondent X Racial Ratio of Advertisement 
interaction for the attractiveness of advertise- 
ment measure indicated that: (a) blacks did 
not rate the conditions of four blacks to zero 
whites or three blacks to one white as being 
more attractive than did whites; (b) blacks” 
rated the condition of two blacks to ; 
whites as more attractive than did whites 
(p< .01); and (c) blacks rated the condi- 
tions of one black to three whites and zero 


- blacks to four whites as less attractive than 
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did whites (respectively, Pp < .05, p < .01). 

Analysis of the black subjects’ mean rat- 
ings of the levels of the racial ratio of adver- 
tisement factor indicated that: (a) black sub- 
jects found the condition of two blacks to two 
whites to be more attractive than any other 
racial ratio (p < .01); (b) blacks found the 
condition of three blacks to one white or the 
all black model condition to be more attrac- 
tive than either the conditions of one black to 
three whites or zero blacks to four whites (? 
< .01); and (c) blacks found the condition 
of one black to three whites more attractive 
than the condition of zero blacks to four 
white models (p < .01). Analysis of the white 
subjects’ mean ratings of the levels of the 
racial ratio of advertisement factor revealed 
no significant differences. 

A Newman-Keuls analysis of the Race of 
Respondent X Racial Ratio of Advertisement 
interaction for the measure of likelihood of 
purchase indicated that: (2) blacks, in com- 
parison to whites, did not assign greater Like- 
lihood of Purchase ratings to the condition of 

four blacks to zero whites or the condition of 

three blacks to one white; (b) black subjects 

did assign greater likelihood of purchase rat- 

ings to the condition of two blacks to two 
whites than did white subjects (p < .05); 
and (c) blacks had lower likelihood of pur- 
chase ratings than did whites when viewing 
either the conditions of one black to three 
whites or zero blacks to four whites (respec- 

tively, p < .05, p < 01). 

Analysis of the black subjects’ mean rat- 
ings of the levels of the racial ratio of ad- 
vertisement factor indicated that the results 
obtained for likelihood of purchase were 

identical to those obtained from the analysis 
of attractiveness of advertisement ratings. 
Analysis of the white subjects’ mean ratings 
of the levels of the racial ratio of advertise- 
ment factor revealed no significant differ- 
ences. 

A Newman-Keuls analysis of the Race of 
Respondent X Products interaction for the 
attractiveness measure indicated that: (a) 
whites assigned greater attractiveness ratings 
than did the blacks to Irish whisky, dress 
shirt, and attaché case (respectively, p < .01, 
p< .01, and 5 .05) and (b) white sub- 
jects’ ratings of attractiveness did not differ 
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from those of black subj umbrella, 
deodorant, and underwear sis of the 
black subjects’ mean rating ducts indi- 
cated that the dress shirt ement was 
assigned greater attractive: ings than 
were the Irish whisky and ear adver- 
tisements (p< .01), Th subjects 
assigned greater attractiven ings to the 
dress shirt advertisement ti the Trish 
whisky, underwear, umbrell: deodorant 
advertisements (p < .01). 

An analysis of the Race o vondents X 
Products interaction for lihood of 
purchase measure indicated vhites were 
more likely to purchase iisky than 
were blacks (p < .05). Tix k subjects 
indicated greater likelihood irchase for 
the dress shirt advertisem« in for the 
Trish whisky advertisement 01), and 
white subjects indicated gre: kelihood of 
purhcase for the dress shi ertisement 
than for the umbrella adve zent or the 
deodorant advertisement ( ye 

Analysis of the Racial Rae Advertise- 
ment X Products interaction the attrac- 
tiveness measure indicated regardless 
of product advertisement, th tio of two 
blacks to two whites received sher attrac- 
tiveness ratings than did eithe: ‘he condition 
of three whites to one black ov ‘he condition 
of zero blacks to four whites (# 05). The 
ratio of two blacks to two \ also Te 


ceived higher attractiveness 
the condition of four blacks 
for the dress shirt, umbrella, de 
underwear advertisements (% < .05). For two 
of the racial ratios (i.e, two blacks to two 
whites, and three blacks to one white) the 
dress shirt advertisement was rated as mor 
attractive than any other product advertise- 
ment (p < .01). A similar product trend wes 
observed in the other racial ratio conditions. 


Discussion 


As predicted, young blacks rated an al 
tisement suggesting equality (two blacks i 
two whites) more favorably than they did a 
advertisement implying tokenism, and 1 
even found the former advertisement me 
attractive than did white subjects. Whites, i x 
the other hand, rated the advertisement E 
dicative of black tokenism (one black to '! 
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ttractive than did blacks. 
the reverse of that found 
indiff (1964) who used a 


whites) as m 
This last findi: 
by Barban ar 


comparably ated advertisement (i.e., 
one black to whites). 

Both black ects and white subjects of 
this study s the closest agreement in 
assigning at iveness ratings to the all 
black model ition, less agreement on the 
integrated ons, and the least agree- 
ment on the yhite condition. In contrast, 
Barban (194 und that black subject and 
white subjec. \2reement was greatest for all 
white ads, lc for integrated ads, and least 
80 for all b ads. The results obtained in 
this study be a function of (a) cultural 
changes ove: ‘he years, (b) the change in 
characterist: of the subject populations of 
the two stu (Barban’s subjects were not 
college sti s, but were middle-income 
blacks and es.), or (c) the different type 
of advertis stimuli used in the studies. 


Similar to er findings of Barban (1969), 
blacks in t+- present study found integrated 
or all black advertisements to be more at- 
tractive than all white advertisements. The 


results are in accord with the previous find- 
ings of Stafford et al. (1970), Guest (1970), 
and Tolley and Goett (1971); no white back- 
lash, as sured by attractiveness, was 


found for any of the racial ratios depicting 
Integration. 
_ The results of the analysis of variance on 
likelihood of purchase are interesting. Black 
subjects showed greater likelihood of pur- 
chase than did whites for the ads of three 
blacks to one white and two blacks to two 
whites. However, they showed less likelihood 
of purchase than did whites when viewing 
the advertisements of one black to three 
whites or the all white model advertisements. 
Blacks showed greater likelihood of purchase 
to all black model ads or any form of inte- 
grated ad in comparison to the all white model 
a Whites, surprisingly, showed greater 
kelihood of purchase when the advertise- 
ment was either all black, all white, or equally 
integrated than when there were three white 
models and one black model in the ad. This 
may indicate that college age whites also 


react negatively to tokenism in advertise- 
Ments. 
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The results presented in this study are 
based on descriptions of “proposed” adver- 
tisements. In order to enhance the internal 
validity (Campbell & Stanley, 1963) of the 
study, confounding factors such as position 
of the models, attractiveness of the models, 
and roles of the models were not included in 
the ad descriptions. These factors may effect 
the impact of racial ratio when judging the 
external validity of these results; however, 
this remains to be empirically verified. Addi- 
tionally further research is necessary with 
different black subject and white subject 
populations. A 

Probably the most significant finding of 
this investigation is that different levels of 
integration are capable of generating different 
responses from members of the same racial 
group. Since integration does vary along a 
continuum, subsequent research in the area 
should either employ different levels of racial 
composition of an ad as a factor in the ex- 
perimental design or, at very least, take cog- 
nizance of this fact. 
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en telephone surveys, made at two-month interval to a national prob 
sample ol 1,200 houschokis who were wiers of instaat color, Wed 
zate measures of brand awareness, altitudes towards brands, aad adver: 
exposure. These measures and an aggregate purchase behavior e 
used to estimate three multiple mgemion equations that formed a 
imic recursive model of the relationships between the monennen. Adver- 
influenced both awareness and attitude, while awamons ssd ttt 
wonced behavior. The influence of advertising went directly from swamsns 
havior, not through attitude, Thus, the bherarchy model of communis. 
effects was found to be only partially correct. Attitude change preorded 
or change, although a les significant reverse effect aho emerged 
One most durable and widely sive model displayed below presents a model 
adopted : of the communication process that is comprised of a system of three inter- 
is the m n hierarchy of effects that related equations. The dependent variables 
postulate isal flow from cognitive change are the three hierarchy variables of interest: 
fo attitude ange to behavioral change. In the brand awareness (cognition), brand attitude 
marketin; ‘ext this model hypothesizes (attitude), and brand inerket 3 
that adve: n has its most direct and dis- ior). The independent variables in each equa- 
cernable on cognitions, a lesser effect on tion are possible causal factors. One sich 
Attitudes, influences behavior primarily causal factor is contemporancous advertising 
through ii- pact on cognitions and attitudes. exposure. The expectation is that advertising 
(ridge. Steiner, 1961; McGuire, 1968) will be significant in the awareness equation 
This form: lation has been sharply attacked but not so in the market share equation. This 
on two grounds: First, it is argued that all expectation is not included as a prespecified 
individuals o not necessarily proceed through constraint, but rather exists as a hypothesis 
tach stage. Second, the direction of the flow to be empirically supported or questioned. 
has been challenged—in particular, there may The remaining causal factors in each equation 
be situations in which attitude change follows are the three hierarchy variables lagged one 
behavioral change (Kiesler, Collins, & Miller, time period. Because the system e 
1970; Krugman, 1965). tions does not involve a nonlagged dependent 
The purpose of this article is to examine variable among the independent variables the 
the communication process, using a large model is termed recursive. 
scale field test, with emphasis on the applica- An index of prices was available for three of 
bility of the hierarchy model and the influ- the brands. It was treated as a caual abt 75 
ence of advertising on the direction and rate * 4 variable is not 
of Movement through the different stages Of ported in this study- 3 
the hierarchy. A distinctive feature of this 3 Technically, a recursive model of system is one 
test is the use of a model, involving time lags, in which the coefficient matrix of the nonlagged 


Which overcomes many of the causal inference 

3 that plague field studies. 

r the model. The dynamic recut- 

a, Requests for reprints should be sent to David 
Aaker, Associate Professor, School of Business 

Cah tttion, 350 Barrows Hall, University of 
fornia, Berkeley, California 94720. 


\ NAMIC MODEL OF RELATIONSHIPS AMONG 
\DVERTISING, CONSUMER AWARENESS, 
ATTITUDES, AND BEHAVIOR 


ll lee eh 


A 
F 
5 
8 
75 


applying ordinary least squares 
the resulting parameter estimates are unbiased. 
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A Dynamic Recursive Model 
AW()) = do + GAW(t—1) 
+ a,ATT(t—1) + asMS(t—1, $) 
+ aADV(t) + tall) 
ATT = bo + AW -I) 
+ bATT(t—1) + bsMS(¢—1, 4) 
+ ADV + w(t) 
MS(4, i) = @ WO) 
+ cATT(t) + csMS(t—1, 0 
+ ADV (i) + ueli) 
where: 
. ADV(!) = Advertising exposure at time f 
AW() = Brand awareness at time t 


A ATT) = Attitude toward brand at time ¢ 
MSC 4-1) = Market share of brand during 
L the time period 1 to 1 
u(t) = Error or residual term 

3 a, b, and c terms = Regression co- 
4 efficients. 


Wold has persuasively argued (Strotz & 
Wold, 1971; Wold, 1954; Wold & Jureen, 
1953) that the causal interpretation of a 
model without such time lags can be ambigu- 
ous. The introduction of time lags, on the 
other hand, makes the model dynamic and 
~ tends to expose the direction of causal influ- 
3 ence. In this context, if attitude change 


causes behavioral change, then the dominant 
causal link is from attitude to behavior, and 
the coefficient of lagged attitude in the 
market share equation should be significantly 
more positive than the coefficient of lagged 
market share in the attitude equation. If the 
dime lags were not introduced, a causal link 
micht be inferred, but the direction of causa- 
tion would be in doubt. It should be noted 
chat path analysis (Werts & Linn, 1970; 
Wright, 1954), another approach to the study 
of causal networks, usually excludes time 
~ delays. 
_ The concept of causal links operating with 
a a delay has been used by Campbell in devel- 
oping the cross-lag correlation technique 
(Campbell & Stanley, 1963; Rozelle & Camp- 
bell, 1969). The dynamic model used in this 
study can be viewed as an extension of cross- 
lag correlation in two respects. First, it ex- 
pands the causal network from two to three 
variables. Second, it allows the introduction 
of control variables (the advertising variable 
in this case). 
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The model is also i: ributed-lag 
format, since in each equa dependent 
variable, lagged one period led among 
the independent variable ficient of 
the lagged dependent var es a mer- 
sure of the rate at which luence of 
the independent variab! pated. A 
large coefficient indicates < ay. 

The hypotheses suggest iis artide 
were developed primarily onsidera- 
tion of the behavior of in: Clearly, 
people differ, and the sam not be 
have identically in all situa nce aggre- 
gate data are used in this e are ask 
ing which hypotheses hay h general 
application to emerge in e of the 
aggregation. It should bi sized that 
conflicting hypotheses can be simul- 
taneously operating in the ı 

METHOD 

This study focuses on the cl a brand of 
instant coffee, a low-involveme on process? 
The most attractive feature involvement 
decisions is that many of the ; reasons for 
behavior appearing to be incon: with a prt 
viously stated attitude (Wicker, are mini- 
mized. Specifically, there is not | to be a sif 
nificant influence from (a) person tors, such a 
other attitudes or an inability to m or understand 
the appropriate behavioral respons or (b) situa 
tional factors stemming from diticrences in the 
settings in which attitudes are elicited and choices 
Re as in Figure 1 were estimated from 
data on aggregate measures of brand awareness, A 
tude, and advertising exposure taken from telephone 
surveys spaced at two-month intervals, and 
market share data derived from actual instant coffee 


sales during the period between the surveys. Com- 


pleted survey data for 19 time points, covering 
period February 1963 to May 1967, were av 

Each of the 19 independent cross-section telephone 
surveys was based on a national probability sam 
of 1,200 households who had used instant co! 
within the preceding 60 days. Each survey bit 
timed to coincide with the beginning of the t 1 
month audit period. Before an interview i 
was necessary to establish that the houschold sf 
user of instant coffee and that the responded, ap 
the brand choice decision. Since only 40% 0 

Of course, the level of involvement with na 
coffee varies across people. It would be of in a 
to obtain such a measure on an individual basis 1155 
conduct separate analyses for (relatively) lve t 
involvement groups versus (relatively) Dra l 
ment groups. What is meant is that the 5 ao 
involvement is low relative to issues like J 

f 


i 


to purchases of durables. 


© usage requirement, it was 
3,000 households for each 
reported in this study repre- 
{ $7,000 telephone interviews. 
used in this study were 
19 surveys. 
| of cach brand at time f, 
ortion of households who were 
brand name with the product 
c specific question was; “What 
runde of instant coffee (both 
‘lar type instant) that you know 
-ht around here?" 
ude score for each brand at the 
, Al was based on a weighted aver- 
tion of the sample saying the 
at time ¢ and the proportion 
of several better brands.” * Since 
is based on comparative rather 
ments of the brands, it can be 
market share which describes 
erformance. 
exposure variable at time 4, 
ined from a three-stage interview 
First, each respondent was asked: 
different brands of instant coffee 
raving seen or heard advertised 
tirty days, either on radio or TV; 
ewspapers?” For each brand men- 
dent was then asked: “What can 
the advertising for that particular 
iy what are they saying or showing 
d in their advertising these days?” 
ns were then compared with the 
is being made by the brands at the 
urvey. The comparison was made by 
variable for 


attitud 
absolu 


(The aggregate behavior measure was based on 
brand and product sales data obtained from 
bimonthly audits of a national sample of more than 
2000 retail stores. These audits were conducted by 


13 attitude score Att(t)m, for each Brand m, 
Š based on the following weighted sum of responses: 
rr. = Proportion of total sample 
saying Brand m is the “best” 
in the market /2.00 + 
Proportion of total 
sample saying Brand m is “one 
of several better brands” /3.86- 
me weights are based on the judgments of an inds- 
. n sample of 40 housewives of the degree of 
a rableness or unfavorableness of the item on all 
n scales, using Thurstone’s scaling procedure 
cla 1957). Theoretically this score can range 
Bra, 11 if no users of the product thought the 
F had any merit, to 50.0 if everyone agreed the 
other was the best. Respondents could also choose 
statements, including “this brand is acceptable, 
hot one of the best,” or “unacceptable.” 


FFF 
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: 

4 = 
5 
Hie 


t 
the market varied between 32% 


for each of the five brands 
for all five brands combined, The results are 
shown in Table 1. When the brands were 
aggregated, a dummy variable was inserted as 
the intercept to compensate for the different 
variable levels at which the brands operated. 
An assumption of the model when applied 
to this aggregate datum was that each co. 


The R? values for the individual brands 
ly indicated strong relationships. The 
lowest R? was 30, and in two thirds of the 
equations it was over 50. Further, in over 
half of the equations the lagged dependent 
variable was not significant so the remaining 
variables were explaining most of the vari- 
ance, The R= for the equations with aggre- 
gated data were deceptively high (over 97). 
The ſive brands had relatively small variation 
around different levels reflecting the fact that 
they had very different average market shares. 
Thus, the total variance increased 


©The model was also run with the data divided 
by the appropriate mean for each brand. The 
were not affected significantly. Thus, 
using the original data are reported since their 
interpretation is more meaningful. 
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TABLE í 
Reoression Equations: Esritatine me Dyxasic Recursive M 
E L = — r ———— ———ͤ— == — — 
| Independent variables | 
r | Dependent — t bim | Sammie 
Be Brand) "variable |[—— 5 i 0 R on | a 
J AWG=1) Arra-U | MS@-1.0 | ADVO f 
~T | aw 2 2.85 21.8 a 
; i yo aso | (420) (2.42) (3.65) 80 . 
X 2 AI vi -18 26 11.2 ; 
í | (06) CW (382) | (6 |0 * 
d 3 | aww 30 15 — 4% 232 
(1.06) 6070 (20) (81) | Qan | a2 s| n 
} 
4 | AWM 26 -4.13 25 12 33.61 
(96) (1.90) 87) (2.6) (1.97) | .38 » 10 
s | aww ot —.08 08 42 24.5 
2.800 (1,52) (24) (2.67) (1.5) 57 s 1 
An | Awu) * —.0⁰ 33 46 
(3.60) (86) (71) (3.01) 4 077 0 | w 
1 | ATTO) -01 07 A3 88. | 
(68) (1.84) (57) (92) (1.70) | 30 4 | 8 
2 | ATTU =.10 —.18 68 10 48 | 
(15) (53) (2.07) (68) (32), 30 0 
3 ATT) 11 —.08 09 (.03 —21 | | 
(1.89) (.26) (34) 85) (10 | 55 3 $ 10 
4 | ATT On 105 — 03 Ad 2.51 Í 
(67) (70) (27) (3.37) (2:02) 84 8 |e 
5 | ATTO 2B 02 13 ate =i f 
iA (1,28) (49) (51) (2.3) (1.09). |.51 | 1 io | 
An | ATTO) 01 05 20 10 | | 
g (36) (1,70) (3.00) (3.60) 4 986 | 0 50 0 
1 MS t-+1) -05 70 .03 01 2.45 9 
(1.23) (1.40) (18) (.04) 1.80) | 40 65 1.03" | f 
2 | mser | —.0s 35 «76 =16 1.27. | E 
(1.26) (2.20) (5.20) (.07) (1.86) | .79 20 1.98 
3 | MS(t. c 06, -03 -70 02 -3i | 1 
(1.12) (14) (3.30) (68) (16) 80 | 32 168 | 
4 | MSE i+) 05 a 23 —08 44 | ier 
(1.27) (1.27) (1.05) (80) (1.38) | .34 | 72 36 
5 us, tn 23 34 1 —4.33 | 2 APOE 
1.77 (3.35) (2.07) (1.37) (46) | 78 | 83 2.40 
All | MSG. 1+1) 05 40 51 «06 } f 90 
(6.05) (5.77) (6.92) (1.63) à .998 52 2.34 | 
| st — e 


6 
= brand awareness at time f; ATT H = attitude toward brand av tee 
riod {tot + 1; ADV(@) = advertising exposure at time ¢; AW(t — 1) 2 


Ard — 1) = attitude toward brand at time period : — 1; MS(¢ — 1% = market share 


> R? are unadjusted. 
„ Regression 


mously when the brands were combined. The 
Standard error of regression is the more ap- 
_ propriate indicator of the performance with 
the aggregated set. 

A common problem in time-series studies 
serial correlation which leads to inefficient 
oefficient estimates and understated esti- 
mates of coefficient variances. The Durbin- 
Watson statistics shown in Table 1 indicate 
that this statistical problem is not trouble- 
some in this research. Over half of the values 
are above the upper limit of the Durbin- 


coefficients, 
4 Dummy variables for each brand were inserted in the aggregate equations. 


Watson test, and none are close to the 1 
limit at which the hypothesis of serial ini 
pendence is rejected.“ 


T This test is biased when lagged dependent Mae 
ables are used. This bias, of course, is small 5 d 
the lagged dependent variables are insignificant | 
they often were in this research, especially l ol 
awareness and attitude equations, Further, at 
the equations were run with the dependent var 
omitted. The results suggest that if the bin 
removed from the test, the hypothesis of 1 z 
pendence would still not be rejected at the 0 3 


hypotheses. The influence 
uttitude seems extremely 
ite, the £ value is only 36, 

a positive coefficient sig- 
level, and Brands 1 and 2 
iicients. In contrast, aware- 
much more significant in 
equation, In the aggregate, 
and all the coefficients ex- 

| 2 are positive with f values 
iis result casts some doubt 
concept. However, it is un- 
eflecting the fact that there 
lag between AW(¢ — 1) and 
two-month time period) than 
and MS(t,t + 1). It may be 
me lag between AW(¢ — 1) 
‚uld show a great relationship. 
e minimal impact of lagged 
vttitude and the strong link 
a ess and market share is sur- 


Testing 
awarencs 
fw. In the 
Daly Brand 


| neither the attitude nor the 
variables seemed to influence 
variable to any significant de- 
relevant £ values in the aggregate 
h below 1.0. It should be noted 
> has a relatively high-positive- 


the Brand ? case, especially noting the nega- 
tive awareness coefficient in the market share 
tion, that usage is preceding and perhaps 
causing changes in the awareness level. There 
äre at least two possible explanations for this 
nding. First, impulse buying may be particu- 
arly important for this brand. Second, it 
Might indicate that for this brand the link 
between awareness and market share influ- 
es purchase behavior very quickly. 11 
Awareness and market share were affected in 
‘the same time period, the awareness would 
“Appear to follow the market share change 
Since it was measured at the end of the time 
| Period. Brand 2 differed from the others in 
| that it had a higher emphasis on quality. It 
Was also available as a regular coffee. Fur- 
it was the smallest brand studied and 
grow sli i i yeriods in- 
ed slightly during the time periods 
The attitude and market share terms that 
$ 8. at che heart of the two major competing 
| hypotheses are considered next. Does an atti- 
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tude change precede and cause a behavioral 
change, or does the reverse occur? It seems 
clear that attitude is a significant influence 
on market share in this study. In the aggre- 
gate, the “ value of the attitude coefficient in 
the market share equation was 5.77, and the 
t values for Brands 2 and 5 were over 2.2. 
Only for Brand 3 was the f value low. 

In the attitude equation, the conclusion is 
not so clear. The market share coefficients 
and the / values are lower. Only Brand 2 had 
a significantly positive coefficient. These re- 
sults thus offer some support to the hierarchy- 
of-effects hypothesis. However, there is cér- 
tainly evidence of some influence moving in 
the opposite direction, In the aggregate, the 
market share ¢ value in the attitude equation 
was 3.0, and for Brand 2 it was 2.07. Also, 
the effect of market share on attitude Sem 
higher than that of awareness. However, | 
there is a methodological bias that tends to 

the market share to attitude flow 2 
of influence. It is caused by the fact that a d 
market share and attitude change in the same 4 
time period appear as a market share to atti- 
tude flow of influence. With this in mind, the 
attitude to market share flow of influence 
predicted by the hierarchy model does seem 
to appear more dominant in this study, 

The advertising variable was expected io 
influence awareness, and to a lesser extent 
attitude, but to have little direct impact on 
market share; these h seem to be 
supported. The impact of advertising on 
market share seems small. It is negative, in 
fact, for three of the five brands. In con- 
trast, advertising is a strong influence on 
awareness and attitude. In the awareness 
equation, there is only one case where the F 
advertising variable is not significantly posi- 
tive. In the attitude equation, the results for 
the individual brands are less significant, but 
the overall results are still strong. Advertising 
for Brands 1 and 2 seemed to have more 
impact on awareness than on attitude. It has 
already been noted that Brand 2 was the 
smallest studied. Brand 1 was similar in 
market size to Brand 2, and Brands 4 and 8 
were the largest. In this study, advertising” 
seemed to have an equal impact on aware- 
ness and attitude. Again it should be noted 
that these results could partially be reflecting 
the nature of the time lags actually operating. 


The coefficients of the lagged endogenous 
variables in the recursive equations are also 
of interest. In general, lagged effects were 
more important in the market share equa- 
tions. Thus, the effect of the independent 
variables on awareness and attitude seems to 
dissipate rather rapidly, whereas their im- 
pact on behavior is of a longer-run nature. 
This conclusion, where confirmed, suggests 
that advertising campaigns addressed solely to 
awareness and attitude may not have the 
long-term impact they desire unless the cam- 
paign’s impact is felt fairly soon in market 
behavior. 


Discussion 


The empirical results in this study suggest 
that advertising can directly influence behav- 
jor (market share) by working through 
awareness. There are undoubtedly those to 
whom brand awareness is an important de- 
terminant of purchase for this product class. 
However, advertising can also affect behavior 
through its influence on attitude—probably 
by reaching a different type of buyer. These 
observations held across the five brands used 
in this study. Thus, it appeared that all 


brands had advertising campaigns that af- 


fected both awareness and attitude simul- 
taneously. 
One surprising result was that the influ- 


© ence of advertising seems to go from aware- 
ness directly to behavior and does not operate 


through the attitude construct. In addition, 


a relatively strong support for the hypothesis 


that attitude change precedes behavior change 


was found in this study. Not surprisingly, the 


reverse effect also emerged, although it was 


relatively less significant in the aggregate. 


It should be noted that this study was con- 
ducted under the handicap of significant 


multicollinearity. Thus, the regression coeff- 
cients tend to have higher variances and 


covariances than are desirable. Fortunately, 
the ¢ test is still appropriate, and conclusions 
based on it are still valid. However, one must 
realize that misspecification could distort 
these findings. Additionally, the data involved 


only coffee users. In other contexts, particu- 


larly when nonusers of a product class are 
involved, the results may differ. 


C eS = SS * 


Dav A. AAKER AND GEORGE S. Day 


Two further qualification en the ges- 
erally crude nature of the and adver- 
tising effects variables that mask sub. 
tle changes in the market nature but 
highly competitive charact he instant 
cofiee market. Other meas markets 
may yield different comn processes 
when subjected to this ty time-series 
analysis. Nonetheless, the « icy of the 


effects across five brands enh he validity 


of the conclusions. 


REFERENC! 

CAMPBELL, D. T., & STANLEY xperimental 
and quasi-experimental desi esearch on 
teaching. In N. L. Gage (Ed 00k of re- 
search on teaching. Chicago: | Nally, 1963, 

Epwarps, A, L. Techniques scale com 
struction. New York: Appi ury-Crofts, 
1957. 

Kieser, C. A., Collxs, B. E., er, N. Att 
tude change: A critical an theoretical 


approaches. New York: Wile; 


Krucman, H. E. The impact sion adver- 
tising: Learning without ir ont. Public 
Opinion Quarterly, Fall 1965, 2 356. 

Lavince, R. J., & Steer, G. A del for pre- 
dictive measurements of adve effectiveness: 
Journal of Marketing, 1961, 25 

McGuire, W. Personality and at change: An 
information-processing theory. G. Green- 
wald, T. C. Brock, & T. M rom (Eds.). 


Psychological foundations of atiii: <2. New York: 
Academic Press, 1968. 

ROZELLE, R. M., & Camesett, D. T. More plausible 
rival hypotheses in the cross a panel corre: 
lation technique. Psychological Bulletin, 1969, 7¹ 
74-80. 

Strotz, R. H., & Worp, H. O. A. V 
nonrecursive systems: An attempi at synthesis. In 
H. M. Blalock (Ed.), Causal models in the 30 
sciences. Chicago: Aldine-Atherton, 1971. 

Werts, C. E., & Linn, R. L. Path analysis: Psy 
chological examples. Psychological Bulletin, 1970, 
74, 193-212. a 

Wicker, A. W. An examination of the “Other 1 
bles” explanation of attitude-behavior incon- 
sistency. Journal of Personality and Social Psy- 
chology, 1971, 19, 18-30. 

Worn, H. O. A. Causality and econometrics. E. 
metrica, 1954, 22, 114 ff. í 

Worn, H. O. A., & Jurren, L. Demand analysts 
New York: Wiley, 1953. i F 

WRC, S. The interpretation of multivariate ae 
tems. In O. Kempthorne et al. (Eds.), sa 
and mathematics in biology. Ames: Iowa Stal 
College Press, 1954. 


(Received March 26, 1973) 


Recursive versus 


cond- 


etd 
ivi 


fated 3: 


ics. The present study cross-validated 


coby and Kaplan's (1972) prediction 


Since E. 5 seminal statement (1960), 
many rese. ers have used the construct of 
perceived - < to investigate various aspects 
of consu: behavior. Unfortunately, this 
construct hss not been given a standard defi- 
nition. For ample, in summarizing the early 
work on perceived risk, Cunningham (1967) 
defined this concept in terms of two basic 


Components, uncertainty and consequences, 
with the latter subdivided into two categories, 
Performance and psychosocial consequences. 
Since then, other investigators (e.g., Perry & 
Hamm, 1969; Roselius, 1971) have distin- 
guished psychological from social conse- 
quences and identified yet other types of 
consequences (physical safety and financial). 
AS a result, it has been difficult to compare 
the results across studies of perceived 

A Accordingly, an investigation was Con- 
lucted in 1970 (Jacoby & Kaplan, 1972) to 
determine the interrelationships among the 
ve delineated types of consequences and to 
letermine their individual and collective rela- 
— 


* Requests for reprints should be sent to J. Jacoby, 
S partment of Psychological Sciences, Purdue Uni- 
nity, West Lafayette, Indiana 47907. 
goa operational definitions of the varieties of 
ived risk may be obtained from J. Jacoby. 


ious investigations of perceived risk have 
mate the consequence of overall perceived risk. 
‘porated all these components in the same study 
the components best predicted overall 


r subjects evaluated the physical, psychological, 
c, and overall perceived risk associated with the 


s equations 
in predicted overall perceived risk values for all 
validation reflected very high agreement with the 


COMPONENTS OF PERCEIVED RISK IN 
PRODUCT PURCHASE: 


A CROSS-VALIDATION 


LEON B. KAPLAN 
Advertising Research Department, 
E. J. DuPont, Wilmington, Delaware 
GEORGE J. SZYBILLO axo JACOB JACOBY * 


Depariment of Psychological Sciences, 
Purdue University 


perceived 
the earlier results, One hundred 


tionship to overall perceived risk." In gen- 
eral, the results indicated that the five types 
of consequences an average of 74% 
of the variance in the overall perceived risk 
measures taken across 12 different products. 
Performance risk tended to explain more vari- 
ance than did any other type of consequence, 
As has often been noted and recently 
restated by Heeler and Ray (1972), such 
findings, though interesting, are unsupported 
conjectures until and unless they are cross- 
yalidated. Unfortunately, most consumer be- 
havior literature has not been cross-validated. 
Accordingly, this study reports the results of 
a second study, undertaken two years later 
and employing different subjects, which 
cross-validated the original findings. , 
Subjects : 
The subjects, 104 upperclassmen at Purdue Uni- 4 
versity enrolled in consumer psychology during the be 
spring 1972 semester, were not naive; they had just K 


: 
d 


. 
2 


METHOD 


44 


2Concurrent with Jacoby and Kaplan's (1972) 
work which conceptualized risk in terms of five 
subcomponents, Roselius (1971) identified a sixth 
variety of risk: “Time loss: when some products 
fail, we waste time, convenience, and effort getting 
it adjusted, repaired, or replaced [p. 58].” 
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2 
d TABLE 1 
3 RANKING oF Propucrs sy Mean Risk VALUES ror Eacn Tyre or Pre 
Overall Performance | Financial Social | Ps Physical 
Product — — f= — — - 
|2 | sD- CE | SDR s2 X | sp | Xx t | so 
1. Sports car 7.18 | 1.64 | 7.26 | 1.75 | 7.76 | 1.43 | 7.42 | 208 | 7.32 374 | 
2. Life insurance | 6.50 | 2.10 | 6.58 | 199 | 6.99 | 199 | 4.04 | 2.31 | 47 567 | 
3. Color TV 6.17 | 1.99 | 6.35 | 2.09 | 6.04 | 1.99 | 5.82 | 2.33 | 4.95 3m | 10 
4. Suits 5.91 1.65 | 5.58 1.93 | 6.38 1.84 | 7.13 188 | 681 L93 1. 
5. Winter coats | 5.18 | 1,93 | 5,00 | 209 | 5.54 | 2.00 | 6.37 | 1.88 | 6.16 2.46 | 1 
6. Dress shoes 4.98 | 1.84 | 4.59 | 1.03 | 5.04 | 1.96 | 5.97 | 2.16 | 5.47 3.02 | 
7. Deodorants 3.66 | 2.01 | 4.25 | 234 | 3.80 | 2.07 | 3.39 | 2.34 | 3.81 3.4441 
8. Toothpaste 2.94 | 1.54 | 2.81 | 1.74 | 2.61 | 1.47 2.40 1.66 | 2.23 3.57 1 
Razor blades 2.92 | 1.54 | 3.41 | 1.91 | 3.30 | 2.60 | 2.04 | 1.43 | 2.06 111 
Vitamins 2.76 | 1.61 | 2.66 1.82 2.88 1.76 | 1.76 4 177 3.63 
Aspirin 2.33 | 1.47 237 | 1.87 | 1.94 | 1.07 | 1.49 9 | 147 84 
12. Playing cards 1,60 | 1.07 | 1.65 1.10 1.66 | 1.02 | 2.20 | 1.77 | 175 1,24 | 
a two-hour class on the topic of perceived pute the multiple correlations | the co 
5 each risk variety defined. However, they nents of risk and overall perc k for all] 
Y no information about the earlier study, or product categories. As suggest: Tosier (198 
the manner in which the five types of perceived and Lord and Novick (1965) aression d 
risk were related to each other and to overall efficients from the previous stu: y & Kapha 
perceived risk 1972) were applied to the dat: is study 
obtain predicted overall perceive lues for 
Inst: t product categories. These predict call perce 
5 risk values were then correlate the $ 
A questionnaire was developed to measure the overall perceived risk values to cross- vatima 
amount of each specific type of perceived risk as composite correlations for each n= category. 


well as the overall perceived risk these subjects 
associated with 12 different consumer products. 
Products were selected to cover a substantial portion 
of the overall perceived risk continuum. An assort- 
ent of health, recreational, and hygienic products, 
varying along an expensive- inexpensive dimension 
d including products bought for oneself and for 
others, highly visible and low-profile items, intimate 
nonintimate products, were used. The only 
A on was that the product had to be appropri- 
ate for both sexes. The number of products was 
limited to 12 to reduce respondent fatigue and to 
inimize error. 

Each page of the questionnaire examined one 
aspect of perceived risk (i.e., either overall perceived 
risk, or one of the five consequences) for one of the 
12 products. The top of each page contained the 
definition of each consequence. The interested reader 
is referred to Jacoby and Kaplan (1972) for a 
description of the considerations underlying the de- 
velopment of these specific operational definitions 
and an elaboration of the procedures employed to 
dentify the five types of consequences. 


nalyses 


Means, standard deviations, and the intercorrela- 
tion matrix were determined for these variables; 
also, stepwise multiple regression was used to com- 


RESULTS 


vidual product category were highly Smit 
to those obtained in the first study” I 
overall perceived rank order tends to com 
spond with what would be expected if ü 
products were ordered simply by price ale 
In addition, a certain degree of cons 

validity was established in that the order í 
risk components was similar within meanm 
ful clusters of products. For example, © 
rank order of the risk components for 4 
three apparel products (i.c., suit, overo% 
and dress shoes) was the same and went If 
social risk through psychological, mant! 
and performance risk to physical risk. 2 
tionally, the rank order of risk compose, 


8 Tables with data described but not P 
here because of space limitations are available : 
request from J. Jacoby, as is the earlier Jae 
and Kaplan study. 


: products (toothpaste and 
ume and went from physi- 

erformance, financial, and 
chological risk, These re- 
e obtained from the pre- 
cross-validity). Similar 

* similar risk component 


fer two of 
mpirins) w 
cal risk th 
wxial risk 
sults agree 
viows «stud 


products : 


hierarchies onstruct validity). 

Table 1 the products ranked accord- 
ing to the values on each of the risk 
componen! in the earlier study, the 
financia) r umn indicates that the mean 
values di: cradually until the value for 
dress shot < is followed by a relatively 
substanti in mean value (5.04 to 3.80) 
for deodo after which the means again 
decrease tly. While a monotonic rela- 
tionship n actual price and perceived 
financial uld be expected, the relation- 
ship appe be curvilinear. At the upper 
end of th ce continuum (foreign sports 
car and ‘nsurance) actual dollar value 
appears |: sortant than relative cost. The 
most mea ul break (for the sample of 
college stuc ots used) should occur between 
dress shoe: 5-835) and (deodorants, razor 
blades, anc other $2-$3 items.), and that 
sems to Le what happened. 


An analovous situation would seem to exist 
for social risk. The largest break in mean 
values occurs between color TV and life in- 
surance (5.52 to 4.04). A color TV and the 
products rated above it on social risk (sports 
cars, suits, coats, and shoes) are all highly 
visible, as opposed to life insurance and the 
lower-tated products. These results are similar 
to those reported earlier. 


TABLE 2 
ISTERCORRELATIONS MATRIX POR THE Five Tyres OF 
Risk AND OVERALL PERCEIVED Risk CALCU- 
LATED Across THE 12 Propvet 
CATEGORIES 


SSS r —r— 
Risk | 1 | 2 3 4 | 5 | 6 
. 
1. Financial 1.00 67 76 .23 59 76 
® Psychological 100 61 16 79 69 
y Performance 100 33 58 70 
Physical 100 11 33 
5. Social 100 .67 
6. Overall 1.00 
. ˖ 


PONENTS or Percerven Risk: A Cross-Vatimation 


TABLE 3 


Sowwany Tams row Perono Ovemass u 
Risk rao Irs Curoxexr Vatews eee, 
Aut 12 Paogect Cateouurs 


j $ 
ne | ve | r |ia 

1 Peformance | W| 2 | 

2 Nr | 25| o | 
3 | Financial “ui a 

4 | Social s| oa | 

S | Physical mj a| 

Í Q : 


Ppa 
dE 
: 
E 


REE 
i 
g 
3 
7 
4 


Multiple ved 
i ranged from a low of .53 for playing 
cards to a high of 82 for deodorants. All 


È 


variance associated with 


Table 3 summarizes the results of trying to 
predict overall perceived risk from its com- 
ponents across all 12 product categories. The 
five components accounted for 739% of the 
variance in overall perceived risk ( < 0¹ j 

Table 4 presents what is probably the 
strongest evidence for a substantial degree 


É 
` 
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TABLE 4 


MULTIPLE CORRELATIONS Usinc Five COMPONENT 
VALUES TO PREDICT OVERALL PERCEIVED Risk 
rox First AND SECOND StTupies WITH 

Cross-VALIpATED COMPOSITE 
CORRELATIONS 


Column 1 | Column 2 | Column 3 
| Cross- 
Item | R 2 R validated 
First Second composite 
study study correlation 
1. Aspirin 75 75 72 
2. Suit 7¹ .57 53 
3. Razor blades 83 60 63 
4. Deodorant 8³ 82 79 
5. Foreign sports 
car 64 54 53 
6. 23-inch table 
color TV 65 13 igh 
7. Pair of dress 
shoes 71 79 sie 
8. Winter coat A 77 76 
9. Toothpaste 77 73 70 
10. Life insurance 65 70 74 
11. Vitamins 73 72 70 
12. Playing cards 74 8 52 
13. Overall 78 86 84 


of cross-validation. The regression weights 
from the equations of the previous study were 
applied to the data of this study and pre- 
dicted criterion scores for overall perceived 
risk were obtained. These scores were cor- 
related with the actual observed overall per- 
ceived risk criterion scores of the present 
study and yielded the cross-validated com- 
posite correlations which appear in column 3 
of Table 4. Column 2 presents the multiple 
correlations actually obtained in the present 
study. 

The shrinkage in the multiple correlations 
from the first study (column 1) to the cross- 
validated composite correlations (column 3) 
is relatively negligible for aspirin, deodorant, 
toothpaste and vitamins, and moderate for 
suits, razor blades, foreign sports car, and 
playing cards. However, the shrinkage seems 
minimal when one compares the cross-vali- 
dated composite correlations for those prod- 
ucts to their multiple correlations derived 
from the optimal weights of this study (col- 
umn 2). There was actually an increase in the 
cross-validated composite correlations in com- 


L. B. Kaptan, G. J. SZYBILLO, AND J. Jacony 


parison to the multiple c. s obtained 


in the previous study TV, dress 
shoes, winter coats, and to However, 
in no case does the cross-v composite 
correlation (column 3) exceed »rrespond- 
ing multiple correlation ot the pres- 
ent study (column 2). The: vecessarily 
lower than the multiple c. ms of the 
present study, for these mu! relations 
were obtained with the opt veights de- 
rived from this study, Final e that the 
difference between the preci: olumn 2) 
and obtained (column 3) rs eached or 
exceeded .04. In most insta: ese values 
hover near .02 and suggest rkable de- 
gree of cross-validation. 
Discussion AND Co N 

These results may be co d on two 
levels: substantive and met! sical. Sub- 
stantively, there are at leas > implica- 
tions. First, the data agai icate that 
overall perceived risk can bce y well pre- 
dicted with the five types nsequences 
identified earlier (cf. Jaco Kaplan, 
1972). Second, as found in the <arlier study, 
the data clearly indicate ti erformance 
consequences are most predic of overall 
perceived risk for most kinds of products. 
Third, it reconfirms that sintar types of 
products possess similar sequences 
hierarchies. 

Methodologically, the results of the present 
attempt at cross-validation strongly 
with those of the earlier investigation. Addi- 
tionally, beta weights based on data from the 
first study were highly accurate predictors 


when applied to the data obtained in the 
present investigation. Thus, the attempt at 
cross-validation appears successful. : 
The implications of the present investiga- 
tion are straightforward. The data from both 
investigations should (a) facilitate compa” 
son of results across studies employing dif- 
ferent types of consequences as surrogates for 
overall perceived risk, and (b) provide a basis 
for understanding and perhaps resolving some 
of the conflicting findings of other studies. 
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EFFECT OF ANONYMITY ON RETURN RATE AN! 


RESPONSE BIAS IN A MAIL SURVEY 


CAROL FULLER 1 


Naval Personnel Research and Development Laboratory, Washington, D.C. 


One group of officers and enlisted men to whom the Navy Personnel Survey 
71-1 was mailed were instructed to record their service num their answer 
sheets, while another group were instructed not to identify their™gnswer sheets. 
Answer sheets were returned more frequently by officers asked to identily 
their answer sheets. Differences in answers to attitude questions were low in 
relative frequency and small in magnitude. Pro-Navy statements were en- 
dorsed by a higher proportion of the officers who identified their answer 
sheets; negative statements were endorsed by a higher proportion of the 
officers who did not identify their answer sheets. None of the anonymous- 
identified group differences for the enlisted men’s questionnaire were large 
enough to be of practical importance. 


a 


Among the potential sources of error in the 
use of self-report data is the assumed tend- 
ency for respondents to distort their answers 
in a direction favorable to the sponsor of the 
survey. It frequently has been assumed that 
an important factor in determining the mag- 
nitude of this response bias is whether the 
respondents are asked to indicate their iden- 
_ tity on their answer sheets. However, there 
are important advantages to be gained from 
the use of identified questionnaires. When 
Tesponses can be identified, it is possible to 
relate those responses to other data concern- 
ing the respondents. It is also possible, when 
respondents are identified, to follow up non- 
respondents and thereby avoid bias due to 
self-selection within a survey sample. 
As described by Guilford ( 1954) and 
Anastasi (1968), the tendency for respon- 
dents to “fake good” on self-report inven- 
_ tories is motivated by the desire to make a 
a favorable impression. This falsification re- 
5 quires both that the respondent be motivated 
E to create a favorable impression and that 
be know what responses are regarded as 
4 favorable (Scott, 1968). 
J While it generally has been assumed that 
3 respondents are less likely to fake good when 
responding anonymously, previous research 
using personality inventories and attitude 
questionnaires has revealed small or no dif- 


* Requests for reprints should be sent to Carol 
Fuller, 3300 South 5th Street, Arlington, 


Virginia 
22204. 


ferences in the response given ubjects 
under anonymous and under identi condi- 
tions. This finding is consistent í search 
in which the same individuals been 
tested under both anonymous and entified 
conditions (Corey, 1937; Fisch 1946; 
Gerberich & Mason, 1948; Olson ») and 
when equivalent groups have b tested 
(Ash & Abramson, 1952; Hamel & R 1952; 
Kosen, Kitchen, Kochen, & Siodolosky, 
1970; Rosen, 1960). 

The direction of the differences that have 


been reported between anonymous identi- 
fied conditions has been consistent with a 
faking good or social desirability interpreta- 
tion. Thus, Fischer (1946) found no differ- 
ence in the mean number of personal prob- 
lems checked but did find a greater number 
of “serious problems” checked when the 
reports were unsigned. Becker and Bakal 
(1970) found that anonymous-identiſied con- 
ditions had less effect on the responses of 
subjects who were low in defensiveness than 
subjects who were high in defensiveness. 
Rosen (1960) found that an identified grouP 
expressed more positive attitudes toward a 
particular course in which they were enrolled, 
while there was no difference between the 
anonymous and the identified groups in thelr 
attitudes toward reading. Elinson and Haines 
(1950) reported finding a higher proportion 
of pro-military attitudes among an identifie 
group of Air Force and Army enlisted men 
than among an anonymous group. In a survey 
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Er 
of attitudes 
fice, Benson 


d candidates for public of- 
1) found a greater number 


of “undecide swers in response to inter- 
views than et ballots. 

This stud conducted to assess the 
effect of an: ty and identification on the 
responses 0 l officers and enlisted men 
toa mail su Specifically, the two groups 
of officers a! two groups of enlisted men 
were comp: on (a) return rates, (b) 
demographi acteristics, (c) the number 
of blanks e answer sheets, (d) the 
number of tral responses, and (e) the 
number of ative and positive attitude 
statements sed by the respondents. 

METHOD 

The questi: ire used in conducting the study 
was the Na ersonnel Survey 71-1 (Braunstein, 
1972b, 19720 Navy Personnel Surveys con- 
sist of struct: tionnaires administered period- 
ically to co! titude and opinion data from 
Navy-wide 8. 

Two equi oups of officers and two equiva- 
lent groups O <nlisted men were randomly selected. 
One group of 00 officers and one group of 16,250 
enlisted men re asked to identify their answer 


sheets. Th viduals are referred to as identified 
officer and ed enlisted, respectively. One group 
of 6,500 officcrs and one group of 16,250 enlisted 
Men were asked not to identify their answer sheets 
in any way. se individuals are referred to as 
anonymous of and anonymous enlisted, respec- 
tively. The t office returned 3,559 questionnaires 


leaving 6,173 identified officers, 
s officers, 14,780 identified enlisted 
5 anonymous enlisted men. 


RESULTS 


Return Rates 


The return rates for each of the four groups 
are presented in Table 1. Those officers who 
identified their answer sheets were more likely 
to respond than were those who did not iden- 
tify their answer sheets. Among the enlisted 
men, the return rates were more similar for 
the two groups. 


Questionnaire Responses 


The proportion of identified respondents 
endorsing each demographic item was com- 
Pared to the proportion of anonymous te- 
Spondents endorsing the item. The (957) 
confidence interval for sampling error was 
Subtracted from the difference. Differences 


P ANONYMITY ON SURVEY RETURN Rate AnD Response Bras 
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TABLE 1 


Return Rates rox Ipentiriep-Anonymous Or- 
FICERS AND IpenTIFIED-ANONYMOUS ENLISTED 
Groups 


Group i = 
No. admin- No, re- Propor- 
istered turned | tion fe- 
| | turned 
Officer 
Identified 6,173 3,147 51 
Anonymous 6,203 2,469 AO 
Enlisted 
Identified 14,780 5,293 30 
Anonymous 14,785 5,735 3H 


| : , 
| Questionnaires 


Í 


officers I, 3 = 12.3, df = 1, p < 01 
mous enlisted = .03, 2 = 6.25, df = 1, p < 01, 


less than 5% were considered to be too small 
to be of practical significance. The only dif- 
ferences (minus sampling error) greater than 
5% found between the two officer groups 
indicated more officers on shore duty (identi- 
fied officers, 76%; anonymous officers, 59% 
and more nonpilot line officers (identified of- 

ficers, 27%; anonymous officers, 12%) among 
the identified officer group. 
ence (minus sampling error) greater than 


blank and neutral responses are presented in 1 
Table 2. While enlisted men tended to give 


s Difference in proportion returned for identified- anonymous 4 


b Difference in proportion returned for identified-anony= 


* 


* 


The only differ- 


+ 


5% between the two enlisted groups indi- 


cated that more men in the identified group 
were not rated (identified enlisted men, 28%; 
anonymous enlisted men, 19%). ‘ 


The mean number and the proportion of | 


more blank and neutral responses than did 
the officers, there were no differences between 
the identified and the anonymous officer 
groups or between the identified and the 
anonymous enlisted groups. a 
For each alternative answer to each atti- 
tude/opinion item, the proportion of identi 
fied individuals who chose that alternative 
was subtracted from the proportion of anony- y 
mous individuals who chose that alternative. 
The (95%) confidence interval for sampling 
error was subtracted from this difference. 
Officers. There were 599 alternative answers 
for the questions in the officers’ question- 
naire. The anonymous-identified difference 


TABLE 2 
O Meas Newree of Braxa axo Neoreat RESPONSES 


T Omas ibexnneo-Axoxvuoos Orrcers AND 
Iivexnnko-Axoxwvxovs H Gaovurs 


Blanks Neutral responses 
; Groep — — 
J 1 = J l x 
o: 6,088 | 2 | a7] 7 
4 Iden titted ! 
T Amaymous | s7 | 2 | 15,729 | 6 
Enlisted i 
| Wdentifed | 3835 | 7 | 43,689) 8 
| 36483 6 45,088 8 


exceeded sampling error for 118 alternatives 
(mean difference = 3%; range = 1%-14%). 
For 62 of the 118 questions, the anonymous- 
identified difference did not exceed sampling 
‘error for any alternative. 
_ Among the 599 alternatives there were 165 
hat expressed negative or positive attitudes/ 
ons (eg, “Disagree: There are no en- 
td poor performers in my immediate work 
u "). Of these 165 attitude items, 85 
positive statements and 80 were negative 
E The anonymous-identified difference ex- 
ceeded sampling error for 20 of the 85 posi- 
d statements (1%-10%). For only 2 of 
20 statements did the anonymous— 
entified difference (minus sampling error) 
d 5% (7% and 10%). Each of the 20 
Positive statements was endorsed more fre- 
quently by the identified group. 
The anonymous-identified difference ex- 
ceeded sampling error for 25 of the state- 
ments expressing negative attitudes. For 7 of 
these 25 negative statements, the anonymous- 
identified difference (minus sampling error) 
Was greater than 5% (5% or 6%). Each of 
the 25 negative statements was endorsed 
more frequently by the anonymous group. 
Enlisted. There were 635 alternative an- 
wers for the questions in the enlisted men's 
questionnaire. The identified~anonymous dif- 
ference exceeded sampling error for 51 al- 
ternatives (mean difference = 2%; range = 
1% 7%). For 88 of the 120 questions, the 
identified-anonymous difference did not ex- 
ceed sampling error for any alternative, 
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Among the 635 alternatives we )8 state- 


ments that expressed negative or tive atti- 
tudes/opinions. There were 99 ive and 
99 negative statements. 

The identified-anonymous group ifference 
exceeded sampling error for 6 of positive 
statements (1% or 2%) and í of the 
negative statements (all = 1% ) 1 state- 
ment was endorsed more freque by the 
anonymous group for all but 1 o. these 12 
statements, 

Discussion 

The difference in the return ra for the 
two officer groups could be due r toa 
heightening of the motivation sponse 
among the identified group or « pening 
of response motivation among anony- 
mous group, Comparison of these n rates 
with those for the Navy Persor Survey 
70-1 (Braunstein, 1972a; Br ein & 
Muldrow, 1972) supports the lat! terpre- 
tation. All persons were asked lentify 
their answer sheets for Navy Pe vel Sur- 
vey 70-1. Of the officers who w mailed 
questionnaires in 1970, 78% respon |: 52% 
of the enlisted men responded in While 
the return rates for both of the offic: groups 
and both of the enlisted men’s grow, in 1971 
were lower, the rates for the two enlisted 
men’s groups were depressed abou! equally. 
The return rate for the anonymous officer 
group was reduced even more than the return 
rate for the identified officer group. Two 


interpretations of the lower return rate for 
the anonymous officer group are possible. 
Certain persons may feel that their opinions 
are important to the study only when they 
are identified as their individual opinions. 
An unintended consequence of anonymity 
may be a reduction of this perceived value 
of the specific individual's responses. An 
alternative interpretation is that the officers 
tend to feel some pressure to respond and the 
anonymity instructions reduced this perceived 
pressure. 

The anonymous and the identified officer 
groups were found to differ on two demo- 
graphic characteristics, and the anonymous 
and the identified enlisted men’s groups dif- 
fered on one such characteristic. It is possible 
that there was an interaction between these 


the identification instruc- 
sons with these character- 
kely to respond when in- 
their answer sheets than 

rs were anonymous. It is 
. by chance, more of these 
and enlisted men were 


possible 
icular © 


the i ication instructions. There 
no evider vailable that provides a 
basis for chi between these alternative 


he differences for attitude 
n the anonymous and the 
groups were not large 
direction in every case was 
a social desirability bias. 
few differences for attitude 
en the anonymous and the 
men's groups, and the 
very small (1%-2%). 
th the lower response rate 
us officer group and the more 
mal responses of the identi- 
p suggests an interpretation 
terms of an interaction be- 
‘ructions and the presurvey 
officer groups. This interpre- 
that individuals with positive 
ord the Navy are more likely to 
uswer sheets when they are 
asked to identify their answers than when 
they are instructed not to identify their an- 
wers. This would lead to a higher proportion 
‘Of individuals in the identified officer group 
vho have positive attitudes toward the Navy 
ind a higher proportion of individuals in the 
Anonymous officer group who have negative 
Attitudes toward the Navy. Since the atti- 
tudes of the respondents were not measured 
Prior to the administration of the survey, it 
Not possible to determine whether the more 
favorable responses of the identified officer 
Soup are due to the influence of the identi- 
tion instructions or to a greater prob- 
ability of response among individuals with 
Positive attitudes when they received the 
entification instructions. 
The low frequency and small magnitude of 
ne anonymous-identified differences are con- 
“istent with previous research. It is possible 
2 t there was little social desirability bias 
the responses of either group; that is, both 
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the anonymous and the identified groups re- 
sponded candidly. Since the content of the 
survey was not highly personal or sensitive, 
it is possible that individuals in neither group 
felt any need to dissimulate. However, pre- 
vious research indicates little or no difference — 
in the responses of identified and anonymous 
groups even for highly sensitive information 
(Ash & Abramson, 1952; Corey, 1937; 
Fischer, 1946; Kosen et al., 1970; On, 
1936). King (1970) found no difference be- 
tween anonymous and identified groups in be 
proportion of individuals who admitted the 
use of illegal drugs. í 
A comparison of the responses given by 
an anonymous and an identified group, how- — 
ever, does not necessarily indicate the amount 
of social desirability bias in the responses of 
the identified group. It seems unlikely that 
social desirability bias is the result of a delib- 
erate, highly conscious attempt to mislead 
the researcher. Rather, it has been viewed — 
as a nondeliberate tendency of which the re- 
spondent is not aware (Anastasi, 1968; Ed- 
wards, 1957). Kosen et al. (1970) found no | 
difference in the response to neutral ques- 
tions or emotionally threatening questions be- 
tween a group which rated “possible question- — 
naire items” and a group which responded 
orally to the questions in the presence of the 
experimenter. Items relating to sensitive in- 
formation may be more likely to elicit social 
desirability tendencies but these tendencies — 
are apparently evoked equally for anonymous 
and for identified respondents. If one were 
interested in determining the amount of social 
desirability bias in particular questionnaire 
responses, it would be necessary to employ 
additional, indirect methods of measurement 
that are less susceptible to this potential 
Source of error (Campbell & Fiske, 1959; 
Cook & Selltiz, 1964). 
Guarantees of anonymity frequently may 
be ineffective. Becker and Bakal (1970) sug- 
gested that for certain subjects, anonymity 
instructions may make subjects more con- 
cerned about the favorability of their re- 
sponses. Also, it is possible that those sub- 
jects who are more predisposed to social 
desirability bias may be more prone to be 
suspicious regarding assurances of anonymity. $ 
If subjects in the anonymous group are not 
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r return of questionnaires. 


eported method (Dillman, 
ccessful in stimulating high 
questionnaires, depended on 
pulations of virtually all as- 
cedure such as questionnaire 
of appeals, and follow-up 
. The use of this method twice re- 
-response rate (75%) to a 12- 
item) questionnaire from the 
traditionally a population from 
d to elicit high response. One 
t of this concerted application of tech- 
i was personalized communications. 
However, the extent to which it alone con- 
ributed to the high response was not deter- 
mined. One purpose in the present study is 
t0 present results from two experiments which 
ested the effect of personalization when all 
4 the other factors of the mail technique 
Noted above were controlled. 
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TRIBUTION OF PERSONALIZATION TO MAIL 
ESTIONNAIRE RESPONSE AS AN ELEMENT 
OF A PREVIOUSLY TESTED METHOD 


DON A. DILLMAN* axp JAMES H. FREY 
Washington State University 


periments reported the effects of varying degrees of 
rates to mail questionnaires. In Experiment 1, half of am alumni 
received three personalized cover letters with a university question- 
while the other half received three un 
ıs higher in the personalized condition, although the later introduction 
ichly personalized method in both personal and not personal conditions 
ot increase response. In Experiment 2, nearly a methodological replica- 
ialf of a sample of the general public received a different questionnaire 
while the other half received the questionnaire after 
ct by telephone. The call did not increase response rates but did stimulate 


letters. The return 


dent that he is receiving the researcher's in 
dividual attention. More specifically, 
alization refers to how one is communicate 
with and not just whether the communica: 
tion occurs. Ways frequently used to accom 
plish personalization include individ: 
typed letters, handwritten letters or poste 
scripts, individual saluations, hand- applied 
signatures, telephone calls to each respondent, 
and some types of follow-up“ The experi- 
ments reported in this study deal with the 
following types of personalization: (a) indi- 
vidually typed names, inside addresses, and 
personal salutations in conjunction with indi; 
vidually signed letters (versus preprinted 
salutations and signatures), and (6) prior con- 
tact (versus no prior contact) by telephone. 
A number of researchers have argued 
personalization is one of the important 
ments of a method that increased 8 
(Reeder, 1960; Roeher, 1963; Tallent 
Reiss, 1959). However, experimental da 
were not available to support their c 
tions. Experiments by others have resulted 
increases of five to eight percentage points as 
a result of personalization (Linsky, 1965; 
Longworth, 1953; Simon, 1967). Two other 
experiments have shown no significant posi- 


— ee eer 

3 Follow-up letters may or may not be a form 
personalization. Tf a follow-up is sent only to tho 
not responding to an earlier mailing, and that 
is indicated, for example, “we have not yet receiv 
your questionnaire,” some degree of personalizati 


js achieved. 
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Ford, 1947; Kawash & Aleamoni, 1971), 
Finally, there have been some instances in 
which personalization actually produced a 
t response than did an impersonal 
Procedure (Andreasen, 1970: Simon, 1967). 
These mixed results from past research, 
plus the fact that the studies differed in many 
Ways (e.g., topic and length), make it diffi- 
cult to determine what the effect would be 
itt a researcher added a personalization pro- 
cedure to his study. A second problem associ- 
" ated specifically with the studies in which 
Personalization had a positive effect was that 
Fesponse rates ranged from only 20% to 60%. 
Unless response rates of a magnitude as 
great or greater than that achieved in the 
Dillman (1972) research can be consistently 
obtained, the mail questionnaire technique 
will be of limited value to social scientists. If 
bersonalization is not found to push rela- 
tively high response even higher, then it can 
be discarded in favor of mass mailing proce- 
“dures with considerable savings of time 
E money. 
, Our tests of Personalization procedures are 
concerned with the effect of varying degrees 
Of personalization rather than with just the 
effects of its Presence and its absence. The 
‘first experiment tests the effects of mail per- 
‘Sonalization techniques, and the second ex- 
periment tests the effects of using personaliza- 
tion by telephone in addition (rather than as 
an alternative) to personalization by mail. 


EXPERIMENT 1— PERSONAL SALUTATION AND 
__ REAL SIGNATURE Ix COVER LETTERS 


€ personalization procedure was implemented jn 
the following fashion: 


very comparable to the one whi 
originally typed. (b) Names, 
lear M.... Were individually placed on each 


2 
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tive effects from personalization (Clausen & letter using the same typen 
+ 
; 
§ 


c) The Project 
director’s signature was indisi signed in bls 
ink (for contrast) with a ba! en to leave an 
impression on the paper as > of a “real 
Signature. One half of the received cover 
letters produced in this man control group 
received entirely multilithed | th the saluta- 
tion of “Dear Alum” and wi: ick preprinted 
signature. The experiment inued with a 
follow-up postcard (an applic ture versus a 


preprinted signature), and a s 
produced in the same manne: 


low-up letter 
original cover 


letter, Thus, one group receiv personalized 
communications, the other gro 

A subexperiment was included ourth mailing 
which consisted of a replacement -- ionnaire and 
a cover letter sent by certified Half of the 
nonrespondents from each of tha groups re- 
ceived a letter done in identical. to the pre- 
vious one they had received. TI, nder of each 
group received a “highly persé letter that 
was individually typed. Thus, two original 
Personalization treatments were c- d into four: 
(a) personal throughout all fou: igs, (b) per- 
sonal in three mailings and hig: onalized in 
the fourth mailing; (c) imperso oughout all 


four mailings; and (d) impersonal hree mailings 


and highly personal in the fourth iling. A dis- 
cerning respondent could distinguis between the 
“personal” and “highly personal” ir part because 
of the indentations produced by the vewriter. The 
individually typed letters were produced on a mag- 
netic tape selectric typewriter (MT After being 
placed on tape, the letters could be cificiently pro- 
duced at the rate of 20 per hour by the operator 


who added inside addresses and salutations to each. 
Results 


The personalization treatment used in the 
first three mailings did. make a difference in 
the results. The proportion of questionnaires 
returned after these mailings was nearly 

“Percentage points greater for the personal- 
ized group (x? = 8.29; p<.01). On the 
basis of these findings we conclude that the 
personalization procedures are an important 
part of the method and their omission would 
adversely affect response. 

The results of the subexperiment included 
in the fourth mailing were mixed. The addi- 
tion of the highly personalized method did 
not increase response. The response rates 
were virtually the same for both the group 
receiving the nonpersonal treatment through 
all four Mailings and the nonpersonal sub- 
group receiving the switch to the highly per 
sonalized treatment in the fourth mailing 
(45.9% versus 45.3%). The personal sub- 


™ 


Con 

poup given \igh-personalization treat- 

ment in the h mailing actually had a 

r respons i did the rest of its group 
Which receiv: e personalized treatment 
through all mailings (50% versus 
73.3% ). 

These find upport the notion that one 
Cannot recov: n a low-response rate with 
a last-ditch at personalization. One 

ible expl: n for the findings which 
how no diffe in response rates is that 
the time : on had received his fourth 


ry little note of the specific 
ving guessed what it was 
he letters were different in 
cation procedures, the ques- 
irn envelopes were not. His 
id or not to respond at that 
cen based on the carry-over 
alization or the nonperson- 
res through previous mail- 
‘ourth mailing per se. If so, 
even a stronger argument 
entation of personalization. 
e but unexplored effect of 
ight be to increase the speed 
“ai tionnaires are returned. This 
s desirable for two reasons. First, if returns 
ome in faster, follow-up is less expensive be- 
lise fewer people need to be contacted. 
g the retrieval time lessens the 
Mance of an uncontrolled event affecting the 
Ponse quality. Surveys need to be imple- 
Mented as rapidly as possible on issues for 
Which attitudes are changing (e.g., candidate 
Preferences in an election) or issues which 
Subject to the intrusion of some major 
ent (e.g., the assassination of a public 
sire during a study of opinions on gun 
*ntrol). Thus, increasing the speed of re- 
Mms should, in the long run, improve data 
ality. A plot of the cumulative day-by-day 
urns for each of the experimental groups 
vealed that the respondents who received 
~ Personalized communication did return 
ei questionnaires more quickly, but only to 
icht extent. A difference of two to three 
j oa was consistent through the fourth 
“ng. This was hardly enough to justify 
Personalization procedure for that rea- 
one, 


bout. Altho 
{ of pers: 
aire and 


early im) 
Another po 


TION OF PERSONALIZATION TO MAIL QUESTIONNAIRE RESPONSE 


TABLE 1 


Response Rates FOR LETTER PERSONALUATION Ex- 
PERIMENTS APPLIED TO ALvuxt Srupy, OVERALL 
AND By TREATMENT 


| No. of returns | 
Treatment = —| %& 
No. mailed | No, completed 
— . —ͤ — — 
Response to first three mailings 
Personal 45° 353 
Not personal 409 322 68.7 
Unidentified = 3 2 
Totals | 927 678 731 
Response to fourth mailing 
Personal 45 33 73.3 
Highly personal 40 23 50.0 
Subtotals 91 56 01.5 
Not personal 61 28 45.9 
Highly personal 64 29 45.3 
Subtotals 125 57 45.6 
Totals 216 113 $2.3 
Totals 

Personal 44 409 92.1 
Not personal 47 379 84.8 

Unidentified — 3 = 
Grand total 891 | 791> 88.8 


a This represents the number of questionnaires which could 
have been Completed. It is somewhat lower than the number of 
questionnaires mailed because of incorrect addresses, deaths 
and because of the inclusion of persons on the original mailing 
lists who were “supporters” of the university, but not alumni. 
It was further adjusted downward, as reflected on the totals 
column; due to the receipt of information on additional incli- 
gible respondents after the fourth mailing had been sent. | 

b This figure includes respondents whose personalization 
treatment could not be identified because they had torn the 
identification number from their questionnaire, 


EXPERIMENT 2—A Prior TELEPHONE CALL 
Method 


The telephone experiment builds on the mail ex- 
periment previously reported in Experiment 1. A 
full set of mail personalization procedures, from 
the first mailing to the last, was applied to all the 
respondents. The telephone was used, experimen- 
tally, to increase the dose of personalization received 
by part of the respondents. The object was to deter- 
mine the effects of the increased increment of 
personalization. : 

A sample (n= 696) of the general public of the 
state of Washington was systematically drawn from 
all telephone directories in the state. The question- 
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naire was 12 pages long and required 89 responses. 
It was believed that the topic, similar to that of 
the alumni study, had little salience to most of these 
persons, and it would be difficult to secure a high 
response to the lengthy and detailed questionnaire. 

Telephone contact was designed to encourage mail 
response. The procedure followed in each contact 
was to (a) explain the nature of the study; (b) 
create interest by asking a few questions on the 
topic, selected because of their presumed interest to 
the respondent; (c) explain that we wanted to send 
them a mailed questionnaire; (d) emphasize that 
they, individually, were of great importance to the 
study; and (e) answer their questions. Clearly a tele- 
phone conversation can convey the fact of individual 
attention to the respondent. 

The experimental procedure was as follows: The 
sample was divided into two halves—one group to 
receive a mail questionnaire without prior contact, 
the other to be telephoned before receiving the 
questionnaire. Only two thirds of the group to be 
telephoned was actually interviewed. Twenty- 
eight could not be reached even after 8 to 10 at- 
tempts in addition to checks with the operator to 
determine if the numbers were working and correct. 
Of those contacted, nearly 23% refused to be 
interviewed. 


Results 


Response rates for each group were as fol- 
lows: (a) those interviewed by telephone, 
75.2%; (b) those who refused over the tele- 
phone, 50%; (c) those who could not be 
reached by telephone, 45.2%; and (d) the 
control group, 64.5%. People interviewed on 
the telephone were significantly more likely 
than were the control group to return com- 
pleted questionnaires (x? = 7.43; p< .05). 
However, the overall response rate for the 
complete experimental group was virtually the 
same (66.9%) as that for the control group. 
Thus, we must raise the question as to 
whether the telephone made any difference at 
all. It seems plausible that those people most 
likely to return a questionnaire are also likely 
to submit to a telephone interview. We may 
have talked by phone with a selective sub- 
group of the sample who would have been 
just as likely to return a mail questionnaire 
without the call as with it. 

The “high personalization” subexperiment 
applied to the fourth mailing of the alumni 
study was also run on the sample of the 
general public. Half of the nonrespondents in 
each experimental group were randomly se- 
lected to receive the individually typed 
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letters. The remainder receive: less per- 
sonal multilithed letters. Ag he results 
were not significant. Of thos: eiving the 
highly personalized letter, 24.: turned the 
completed questionnaires con to 21.5% 
who received the less persona! d version. 
These findings reinforced the lusion of 
the mail only experiment tha ist-ditch- 
high-personalization attempt i urranted. 

However, the speed of was in- 
creased rather dramatically. percent 
of those interviewed by telepi vho even- 
tually returned the questionna nad done 
so prior to the second follow-uy attempt. 
This compares to only 52% of {he control 
group. The group interviewed |» telephone 
returned 90% of all questionnaires within 25 
days of the first mailing. The e elapsed 
before the control group had pa the 90% 
mark was nearly double, or 48 «ys. Even 
when the no-contact and refuse! subgroups 
who responded more slowly il did the 
group interviewed were include the ex- 
perimental group, the speed of onse was 
still much faster than that for ‘ne control 
group. Seventy-five percent of total ex- 
perimental group respondents hs’ returned 


-d mailing 
group. 


their questionnaires before the > 
as compared to 52% of the contr 


Discussion 


Personalization effects may have substan- 
tial significance for making the mail question- 
naire method a viable technique for the 
collection of social science data. The experi- 
ments were with questionnaires of consider- 
able length under conditions of already high 
response. Other procedures thought to be im, 
portant on the basis of previous research had 
already been applied. The results showed that 
the application of personalization raised high 
response rates even higher. An economically 
produced multilithed letter with the salute 
tion in matching type and a real signature 
which together give the “look” of an indi- 
vidually written letter, increased response 
rates significantly. Although a return rate o 
nearly 85% may be acceptable, one of i 
over 92% is even more gratifying, partici; 
larly inasmuch as the number of “hard-core 
holdouts has been reduced in this particulat 


Conti 
study by one An additional dose of 
personalization, telephone call, did not 
increase the ise rate for a separate 
sample but did ficantly increase the speed 
of reply. 

Implication: iese experiments extend 
beyond the pí lization procedures used. 
A further test basic method previously 
reported by w (1972) also has been 
provided. The eral public is among the 


most difficult 
high response 
is because of 
Specific appea! 
ings of recipro 

a role in ot! 

designed to in 
Another facto 
such an extren 
_ to generate re 

characteristics 
belief structure 
creating furthe 
On these grou 
of stimulating 


roups from which to get a 
mail questionnaires. This 
difficulty of forming group 
taking advantage of feel- 
which have so often played 
udies reporting techniques 
se responses (Scott, 1961). 
making the general public 
iy difficult group from which 
nse is that the differences in 
ch as age, education, and 
are at nearly the maximum, 
ficulties in communication. 
, it is expected that methods 
, ponse should be more effec- 
tive on specialized homogeneous groups than 
the general public. The fact that the response 
tate for the group of alumni receiving the 
complete treatment (92.1%) was higher 
than that received by the three samples of 
the general public (75% in the previously 
Mentioned two response rates reported by 
Dillman (1972 ) and 65% in the response 
tate reported in this study) seems to confirm 
this observation.“ 


This should not be construed as meaning per- 
Sonalization should reduce by half of the Saber 
of holdouts in any study where personalization is 
i rather it underscores the fact that personaliza- 
tial made a considerable dent in a group of poten- 
oth respondents who were resistant to all of the 
ae techniques applied as a means of increasing 
ie It is as important to note what happens 
res e number of holdouts as well as how much the 

ae rate per se is increased. 
cation a be pointed out, however, that the edu- 
quite a level of the alumni group was obviously 

e igh, and that this factor has been shown to 

associated positively with response (Longworth, 

1 Secondly, as previously noted, the question- 

05 sent to the alumni was slightly shorter than 
e sent to the general public. 
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The goal of efforts to improve mail ques- 
tionnaire response is to increase the viability 
of the technique for research. It is note- 
worthy to observe that the response level 
obtained in the alumni sample approaches 
what is generally a very satisfactory level for 
personal interview studies. The geographical 
dispersion of the Washington State Univer- 
sity alumni (43 states and nine foreign coun- 
tries) would have made personal interviews 
impossible. Thus, instead of considering the 
mail questionnaire to be cheaper but an unde- 
sirable alternative, it may be considered in 
some instances as a cheaper but desirable 
alternative. Its use in such situations can add 
significantly to our capabilities for surveying 
difficult to interview populations. 
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Impressive evidence has accumulated regard- 
ing the validity and reliability of peer ratings 
in many different situations (Hollander, 
1965; Kubany, 1957; Roadman, 1964; Tupes, 
1957; Weitz, 1958). Two methods of obtain- 
ing peer ratings have been widely used. Most 
common is a nomination procedure in which 
raters are required to choose, from among 
their peers, a specified number of persons 
who are “highest” on some performance 
continuum and an equal number of persons 
who are lowest.“ Because of its simplicity, 
this method is especially suited to situations 
in which a large number of persons are to be 
tated. The second method is a complete or 
partial ranking of peers that may be accom- 
plished by several procedures such as simple 
rank ordering, alternation ranking, or paired 
comparisons. Ranking methods are typically 
used when relatively few persons are rated. 

In either case, a scale is usually formed 
by summing algebraically the highest (posi- 
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Requests for reprints should be sent to Gary 
Kaufman, Room 1513, Internal Revenue Service, 
1111 Constitution Avenue, N.W., Washington, D.C. 
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SCALING PEER RATINGS: 


AN EXAMINATION OF THE DIFFERENTIAL VALIDI | 
OF POSITIVE AND NEGATIVE NOMINATIONS 


Peer ratings obtained by the nomination technique are typically scaled 

algebraically combining the top nominations with the bottom nominati 
It is thus implicitly assumed that top and bottom ratings are equally v+ 
measures of performance. This assumption was tested in two studies of Res 

Officer Training Corps (ROTC) cadets (Vs = 113, 87) by correlating se\ 

kinds of peer rating scales with a number of criteria. The results suggest t 
negative nominations contribute little to peer rating scales and that t 
frequency of positive nominations may be the most useful index. 
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tive) and the lowest (negative minations 


received by each ratee or by “aining an 
average of the combination of itive and 
negative ratings. These proced: ire based 
on the implicit assumption ombining 
positive and negative evaluati o form & 
single scale best reflects the rel: informa- 
tion contained within a set 0“ ngs. The 
present study tested this hypoth: , for the 
case of the nomination method, by comparing 
the correlations between peer rating: scaled by 
different methods and several other measures 


of performance. 


Srupy 1 
Method 


Subjects in the initial study were 113 members of 
the 1969 senior class of the University of Tennesse 
Army Reserve Officer Training Corps (ROTC). Each 
cadet was asked to nominate, from a list of names 
provided him, the five persons in the class who were 
most effective as officers and the five who were least 
effective. The nominated names were to be listed in 
rank order, with the most effective listed first, the 
next most effective listed second, and so on. 
rank ordering was required to provide weights neces- 
sary for several of the scaling procedures under 
investigation. An individual rated most effective Was 
given the score +5, the next most effective, +4 an 
so on. Among those nominated least effective, the 
lowest ranking individual was given the score ~$ 
the next least effective, —4, and so on through J. 

The ratings were then scaled by seven different 
methods: (a) the algebraic sum of the weighted 
nominations; (b) the total number of nominations 
received; (c) the frequency of most effective (Post 
tive) nominations; (d) the frequency of least effec- 
tive (negative) nominations; (e) the algebraic um 
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it at .05 level. 


ations divided by the number of 
ations receive; (% the algebraic sum of 
d positive nominations divided by the num- 
Positive nominations received; and (g) the 
fic sum of the weighted negative nominations 
| by the number of negative nominations 


nce criteria were then collected from the 
“Past performance records in the university 
Program and at a six-week summer camp. 
“Criterion measures were: (a) distinguished 
ry student, an award made to about 20% of 
based on ROTC grades, academic grades, 
mmendations from several Army officers; 
u camp leadership ratings, a graphic rating 
“combined with a trait checklist completed by 
fs! summer camp platoon leaders; (c) mili- 
2 ncy, an objective composite measure 
on performance at the rifle range, in compass 
Teading courses, physical fitness, and à 
examination on military knowledge; (d) drill 
nce, a rating of the cadets’ abilities to con- 
E drill and leadership in field situations 

Cadet and regular Army officers; (e) ROTC 
Standing, a ranking of cadets by ROTC grade 
average; (f) original rank, military rank 
d at the beginning of a cadet’s senior year; 

Present rank, the cadet's military rank at the 
* Senior year. 


correlations between the peer rating 
1 the performance criteria are shown 
1. These relationships, by visual in- 
of the scatter plots, appeared to be 

linear and homoscedastic. Inter- 
ions among the criteria are shown in 
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TABLE 1 
< BETWEEN Peer RN Scares AND ROTC ano Sumere Cawr Carrexia 
(N = 113) 
vigebraic | No, rat- 5 = f * | 4 
sum ings OY TA y i F 
| Positive | Negative | Positive l Negative 
ae a = 
50 50 60 —.17 K a | 40 — oO 
Fb .06 42 — 44 48 35 -07 
30 10 31 —.25 20 10 | — 07 
40 38 4 —.13 42 4 -17 
20 2¹ . 8 —.14 28 3 —.0¹ 
62 59 11 —.19 63 42 ~ 18 
55 58 60 —. 18 55 38 -143 


— — — E 


rom f positive at .O1 level (Hotelling, 1940). 


Table 2. The frequency of positive nomina- 
tions emerged as the best predictor of the 
various criteria, predicting six of the seven 
criteria at least as well as, and generally 
better than, did the other scales. The criteria 
are reasonably independent; initial and final 
military rank, however, have an extremely 
high intercorrelation. Taken as a whole, these 
data suggest that the negative nominations add 
little relevant variance to peer evaluations. 
These results, however, required replica- 
tion to ascertain that differences between the 
correlations are reliable, and that the pattern 
can be generalized to other criterion mea- 
sures. A second study was therefore under- 
taken, employing a new sample and different 
performance criteria. It was also possible, in 
the second study, to collect peer ratings prior 
to the subjects’ participation in summer camp. 


Stupy 2 


Method 

A nomination procedure identical to that of the 
first study was used, with the exception of the 
instructions to rank order the nominees, the results 
of which contributed little to the validity of the 
ratings in the first study. The subjects were 87 
cadets in their junior year; performance criteria 
were obtained the following year after their attend- 
ance at summer camp. The summer camp criteria 
had been completely revised, and consisted of three 
measures: (a) performance, a graphic rating of the 
cadets’ leadership abilities in various summer camp 
situations by the platoon leaders completed at least 
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TABLE 2 


È 


j 
f 


i 


— 


Drill per- ROTC ¢ 
formance standir 


Spearman-Brown formula. Thes: 
were based on data from the fir 
The third column of coefficien 
lated using rate-rerate correlat 0 
six-month period. The data for these 
tions came from a study 1 
not described in this article (Każin 
The split-half and ra Grate «stim 
reliability were very similar des 
of different raters and different 
calculating the coefficients. The intrad 
estimate was surprisingly divergent from | 
other estimates. The intraclass coeffcien 
however, were calculated on weig/ted no 
nations, and the split-half coefficients 
calculated on the frequencies of nomi tio 
thus, the intraclass coefficients reflected 
ferences between raters in assigning rel 
Positions among the top five and the bottai 


TABLE 3 


CORRELATIONS BETWEEN PEER RATING 
(N = 89) axp Revisep Susorer Caur CR 


Peer rating scales 


Performance 
Persona! char- 
acteristics 

Situations 


Nole. r = .21 significant at .05 level. Sol 
` level (Hot 
inane ntly different from 7 positive at .01 I ** 
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ive, while the uf reflected rater agree- 
ovat in assign lividuals to one of two 
uses, the to; or the bottom five. 
USSION 
The results wo investigations clearly 
watradict the ption that an algebraic 
mm or aver positive and negative 
tmeminations > 1c most valid prediction 
fedex. Of the procedures investigated 
È this study st useful was simply the 
frequency of ive nominations. These 
wults are sin those obtained by Webb 
(1987), who : i a biserial correlation of 
between raic sum of the highest 


"three lowest 
thotomy of th 


peer ratings and a di- 
ceiving highest ratings and 


these receiving tings, suggesting that the 
rative non ns were of little value. 
| Correlations peer ratings with other 
Performance ires, however, were not 
tained. 

Va possible « planation of this phenome- 
ma, Doll an ngo (1962) offered the 
hypothesis tha: ow ratings are contaminated 
by reactions of :aters to antisocial behaviors 

are unrelated to performance criteria. 
To test this h othesis, they experimentally 
femoved the low ratings which were given 
for antisocial reasons and found that the 


validity of the peer rating was enhanced. 
t procedure, however, was fairly com- 
ed and inapplicable for most practical 


Our conclusion is also similar to one 
Ratched by Berkshire and Highland (1953) 


TABLE 4 


Reuaniines or Peer RATING SCALES 


Correlations 
Rating Intraclass | 
for weighted ee Rate-rerate 
nominations — 
( — 113) | 110 
1 
l 64 82 
Motto —0² 59 
Tota 90 64 


complaint that the peer rating asks a rater to 
“cut his buddy's throat” (Webb, 1955), and < 


tially d 
on iP Hollander’s (1965) study, also car- 


ried out in a military setting, suggest 
peer ratings may be usefully obtained as early 
as three weeks after hiring a trainee, and thus 
provide an inexpensive alternative to more 
costly methods of assessment. 
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mpany, Portsmouth, Virginia 


mpts have been made to 
nse style problems inherent 
g on vidual rate another by using 
(e.g., forced-choice, paired 
on) or various raters (€g. 

ough each proposed innova- 
‘ost one or more rater errors, 
it will give rise to other seri- 
‘at are unique to the proposed 
Innovations in rating, although 
e likely to result in robbing Peter 


Tesponse style problems, as well as the 
al benefits inherent in rating proce- 
can be discussed by breaking down the 
| rating variance in an analysis of vari- 
ework (Guilford, 1954). In a situa- 
re several raters rate several ratees 
al categories, one obtains the follow- 
Mrces of variance: 


S X Categories 
ees X Categories 
S X Ratees X Categories 


sts for reprints should be sent to Robert 
haska, Corporate Personnel Research, Gen- 
lectrice Company, College Boulevard, Ports- 
Virginia 23705. 

W at General Electric Company, Fort Wayne, 


A \IPIRICAL COMPARISON OF THE RELATIVE 
FECTS OF RATER RESPONSE BIASES ON 
THREE RATING SCALE FORMATS 


THOMAS D. HOLLMANN * 
Wayne State University 


Kendall type behaviorally anchored scales for derived performance 
ons (Format 1), scales for the same dimensions but without the 
al anchors (Format 2), and scales for dimensions selected on an 
i basis (Format 3) were compared on the basis of susceptibility to 
sponse biases. Leniency error and composite halo error were present 
itings; there was no evidence of relative or absolute halo errors in any 
There was some evidence that the use of scales for derived dimensions 
l leniency error and increased the amount of variance attributable to 
differences. The scale reliabilities of the three formats were also deter- 
A discussion of the feasibility of obtaining relatively independent 
for several job performance dimensions was included. 


The rater main effect is indicative of leni- 
ency error (Guilford, 1954). Leniency, a mis- 
nomer for this response bias, is analogous to 
what Cronbach and Gleser (1953) refer to as 
an elevation when discussing profile compari- 
sons. This error occurs when different raters 
use the scales differently in an overall way. 
Some raters consistently give relatively high) 
ratings, while others consistently give rela- 4 
tively low ratings. These differences are, ſor 
the most part, consistent across scales as well 
as across ratees. While the intrarater differ- 
ences may be valid, they are meaningless — 
without assumptions and corrections for com- d 
parisons of persons rated by different raters. 

The ratee main effect is composed of true 
ratee differences across scales and an overall 
bias which Guilford (1954) calls absolute 
halo” and Blumberg, DeSoto, and Kuethe 
(1966) call “consensual halo.” The “absolute” - 
or “consensual” halo bias occurs whenever 
raters erroneously, and en masse, rate certain 
ratees higher and others lower. There is no 
way to statistically separate the valid and 
biased components. In a situation where the 
raters are familiar with the ratees’ relevant 
behaviors, however, rater agreement is more 
likely to represent convergent validity rather 
than consensual halo error. 5 

A Rater & Ratee interaction is evidence for 
what Guilford (1954) calls “relative halo” 
and Blumberg et al. (1966) call individual 
halo.” This error occurs when some ratees are 
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given high ratings across categories by some 
raters and low ratings across categories by 
other raters, but there is no agreement or 
consensus among raters. 

A category main effect represents 4 com- 
posite of true category differences for the 
ratees and an overall rater bias for rating 
some categories high and others low. The 
latter component has the effect of restricting 
the range of at least some of the scales and 
thereby reducing the scales’ reliabilities and 
potential utility for correlational purposes. 

The Rater X Category interaction describes 
variance due to raters’ idiosyncratic tenden- 
cies to use category scales at different levels. 
This has never been shown to be a serious 
problem for the utility of rating scales. 

The Ratee X Category interaction repre- 
sents the extent to which the raters, as a 
group, make ratee-category discriminations. 
This source of variance is large when each 
ratee's category profile differs from the other 
ratees' profiles and when each ratee’s cate- 
gory profile is similar across raters. This 
Source of variance should be high since the 
agreement among raters adds convergent va- 
_ lidity to the ratings and the differential pro- 
_ files of ratees yield maximum information 
about their relative strengths and weaknesses. 
Ihe three-way interaction is, of course, 
error variance, in the analysis and in concept. 
I The desirable sources of variance, then, are 
the ratee main effect and the Ratee X Cate- 
gory interaction. They provide information 
as to how the ratees differ, both quantita- 
_ tively and qualitatively, and this is the pur- 

pose of all rating scale procedures. Any other 
sources of variance contribute response style 
errors of one type or another. 

The behaviorally anchored rating scales 
developed by Smith and Kendall (1963) have 
been proclaimed as a major stride toward the 
improvement of rating scales. They were re- 
portedly acceptable to raters, conscientiously 
used, and it has been felt that the method- 
ology involved in the development of the 
scales reduces the danger of unwanted error 
variance attributable to rater response styles. 

The reliabilities of the scales have been 
determined by examining interrater agreement 
on the assignment of behavioral anchors to 
scale positions of categories. High-scale reli- 


Roßkkr F. BurnaskA AnD THOMAS D. HOLLMAN 


abilities have been re; Smith and 
Kendall (1963) and Fol: n, and Blood 
(1971), but only mod bilities have 
been reported by Zedec! iker (1971), 
Moderate interrater rí have been 
reported (Hakel, 1971 & Guion, 
1970; Maas, 1965) wh: mith- Kendall 
scales have been used t dividtals. In 
addition, several inv (Dunnette, 
1970; Hakel, 1971; La Guion, 1970: 
Zedeck & Baker, 1971 found the 
Smith-Kendall scales to | tible to halo 
error, and Zedeck and (1971) found 
that they yielded little for construct 
validity. Maas (1965), found that 
interrater agreement on ied-expectation 4 
rating scales was signif superior to 
agreement on adjective y scales, and 
Campbell, Dunnette, Ar nd Hellervik : 
(1973) found that bel: lly anchored - 


iance, less halo 


scales yield less method 
than do Likert- 


error, and less leniency e 
summated rating scales. 

To date, no systematx 
has been conducted by ev: 
variance to determine if 
chored scales are both 
rater response biases and 
more conventional ratin 
fills the void by compar 
rating scale formats to < 
the three yields the most le ratings W! 
the fewest rater errors. The three formats 
are (a) the Smith-Kendall type, pehaviorally 
anchored scales (Format 1), (b) scales for 
the same dimensions as Format 1 but with 
adjective rather than behavioral anchors 
(Format 2), and (c) a set of scales measur 
performance categories selected on an a pro 


parative tes 
g sources of 
vehaviorally an- 
susceptible to 
re reliable than 
s. This study 
> three types of 
mine which of 


e shortage of 
les is that? 


is difficult to find situations in which a largë 
number of potential raters have had an & 
quate opportunity to observe t 
a group of ratees, since it is 
which constitutes sample size. © ity 0 
by necessity, compromises the desitabi 0 
a large number of raters for the suitab! ail 
the overall situation and, therefore, ‘ual 
of the results are suggestive rather 
conclusive. 


EFFEC 


IETHOD 
Constructing ! ales 


cales for Format 1, a slight 
smith-Kendall procedure was 


To construct 
modification oí 
followed. 

First, 33 und 
university were 
(Flanagan, 194° 
very ineffectiv: 
played by coll 
they had enco: 
Bach student 
critical incident 
behaviors) and 
tlssroom-teach 
dent reported, 


antes at a large Midwestern 

to provide critical incidents 
trative of very effective and 
voom-teaching behaviors dis- 
tructors and professors whom 
| during their college careers. 
ked to provide three positive 

effective classroom-teaching 
negative ones (very ineffective 
haviors). For each critical inci- 
n its were asked to name and 
define the beh category of classroom-teaching 
behavior of wi he critical incident was an €x- 
ample; for ex: if the ineffective critical incident 
pertained to an ructor failing to compensate for 
bis foreign acci he behavioral category might be 


“verbal prese Behavioral categories of 
dassroom-teacl i behaviors were also obtained 
independently critical incidents from another 
group of 15 un iduates. 

For each crit incident reported, a single sum- 


s derived to reflect the essence 
vior for that incident. Terminol- 
as originally reported were re- 
mary statements composed the 
l statements from which scale 


mary statemen 
of the central | 
ogy and meani 
tained. These 
pool of beh 
anchors were 
Next, a grou 


sisting of 17 undergraduates, 10 
students, and 5 psychology pro- 
inci ited with the 135 derived critical 
incident sum statements and the 13 behavioral 
Categories. A bricf definition of each behavioral cate- 
— was provided. This group was asked (a) to 
5 each of the summary statements to the behav- 
state category which they thought the summary 
9- 1 was an exemplar, and (b) to rate, on a 
Shae: horizontal scale, the teaching effectiveness 
inci e behaviors represented by each of the critical 
cident summary statements. 
8 were tallied, and for each behavioral 
55 . for which there was high agreement (65% 
Sy of the respondents) as to category place- 
ing efi e mean and standard deviation of the teach- 
eit ectiveness ratings were computed. Those criti- 
ee summary statements within each behav- 
E se that had differing means and small 
uc deviations were then chosen for use as 
those ts for the scale values of Format 1. Choosing 
assur peas with a wide dispersion of means 
levels f coverage of different scale aonan 
eviati or each behavioral category; small standar 
level SE assured agreement on the effectivencss 
standard each statement chosen. The means An 
lor care deviations of the chosen behavioral anchors 
Enco: ch category were similar, and no problem was 
untered in obtaining anchors for the midpoints 
e scales, 
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Each of the critical incident summary si : 
was then reworded into a behavioral expectation. 
statement. For example, if the critical incident sw 
mary statement was “The instructor lectured © 
from the text,” the behavioral expectation stateme! 
would be “could be expected to lecture only f 
the text.” The behavioral expectation statements 
were then applied as anchors to the appro 
scale positions of a 9-point vertical scale for 
behavioral category to which they pertained. 
behavioral categories used were those that had 
tively large numbers of behavioral expectation 
ments. If few behavioral expectation statem 
were assigned to a given behavioral category, 
category was considered to be either unimportant 0 
nonratable. The resultant behavioral categories 
Format 1 were “knowledge of material,” “rappo 
with the students,” “verbal presentation,” “social 
consciousness,” and “student evaluations.” 2 

Format 2 consisted of 9. point vertical scales for 
the same behavioral categories as used in Format iy 
but the brief definitions of the bebavioral categories 
and the scales’ behavioral expectation anchors were 
omitted. These were omitted to determine the extent 
to which such anchors produce differences in the 
susceptibility of Formats 1 and 2 to rater response | 
biases. The two extremes and midpoint of these 
scales were anchored by “very effective,” “very 
ineffective,” and “average effectivencss,” respectively. 

Format 3 consisted of behavioral categories ob- 
tained from a teaching effectiveness rating ſorm 
published by Stoops and Rafferty (1961). The be- 
havioral categories Were “knowledge of subject," 
“understanding pupils,” “fairness in marking.“ 
“interest in pupils, and “human qualities.” The 
number of behavioral categories was the same as 
used in Formats 1 and 2, a 9-point vertical scale 
was again used for each behavioral category, and X 
the extremes and midpoint of the scales were an- 
chored by “very effective,” “very ineffective,” and 
“average effectiveness,” respectively. This a priori i 
format is representative of many graphic rating scales 
currently in use, and was used in the present study x 
so that the ratings, rater errors, and reliability of 2 
this format could be compared with those produced 4 
by the other two systematically derived formats. 


Administering the Scales 4 
The raters were first-year graduate students of 
psychology- The ratees were three associate profes- 


sors of psychology 
cession during their first year 
The rating was conducted in 
on the last class day of the third quai 

lar academic year. 3 
Vaters were AN assigned to use one of the 
three rating formats. Because of unforeseen circum- 
stances (e.g; illnesses, dropouts, etc.) that prevented 
equal number of raters for each format, 12 rates 


used Format 1, nine 
Format 3. The order o 
was counterbalanced for the formats, ang 
were made independently. Confidentiality © 
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TABLE 1 
Two-Way ANALYSES OF VARIANCE 
Format 1 Format 2 t3 
Source of variance — le ae — 
F Estimates F Estimates Estimates 
w? w? at 
Replications across ratees 
Rater 4.09** 07 3.5455 e 40 
Category 13.49% ll 22.79** 23 14.1 19 
Rater X Category 38 00 39 00 71 00 
Replications across categories 
Rater 1.61 0⁴ 1.20 3 17 
Ratee 26.02** 27 19.30% 27 183.2 15 
Rater X Ratee 1.62 04 1.51 04 | 1.27 02 
Replications across raters 
Ratee 29.92** 28 19.87 27 22 JS 
Category 6.10* 10 8.32 21 19 
Ratee X Category 1.42 .01 2.04** -03 | 7 00 
*p <.05 
* <.01 
ratings and the individual raters was maintained at of variance for each format shown in 


all times. 

For the purpose of determining the effects that the 
behavioral expectation anchors and brief definitions 
of the behavioral categories had on the ratings, 
those who made initial ratings with Format 2 were 
contacted three weeks later and asked to rate the 
professors again, using Format 1. This was done to 
determine whether the presence or absence of the 
behavioral expectation anchors and brief definitions 
of the behavioral categories would effect the sus- 
ceptibility of these two formats to rater response 
biases. 

The reliability of the scales of each format was 
estimated by following the analysis of variance tech- 
niques set forth by Lindquist (1953). By ignoring 
differences between raters, the test-retest reliability 
coefficient of the mean rating of each format was 
obtained. 


RESULTS 


Following the method of analysis set forth 
by Guilford (1954), the ratings of the three 
formats were analyzed by treating them as 
replications across ratees, categories, and 
raters, respectively, and thus obtaining three 
two-way analysis of variance factorial designs 
for each format in which each source of vari- 
ance is treated as a random factor. Following 
this procedure, the three two-way analyses 


Table 1. 

For Format 1, the significent sources of 
variance are the category efi ad the ratee 
effect, plus the rater effect n collapsing 
across ratees. Estimates of a-squared 
(Hays, 1963) reveal that the ate effect 1s 
the largest source of variance, accounting for 
approximately 27% of the total. The fact 
that the rater effect is significant in one situ- 
ation and nonsignificant in the other 18 in- 
consequential since this effect accounts for a 
relatively small proportion of variance. 

The significant sources of variance for 
Format 2 are the three main effects; ttt, 
ratee, and category, plus the Ratee X Cate- 
gory interaction. The category and ratee er 
fects are the largest, each accounting for 
more than 20% of the total variance. 
interaction effect is relatively unimportāth 
accounting for only 3% of the variance. : 

The three main effects are all significan 
for Format 3; there are no significant p 
actions. In each case, these significant sourt 
account for just less than 20% of the to 
variance. 


Ey oF RATER RESPONSE BIASES ON THREE RATING SCALE FORMATS 


By treatir h main effect as a variance, 
or a measu the extent to which the 
means for t irious levels of the factor 
vary around and mean of the factor, one 
can test th vificance of the differences 
between the effects of the formats by 
homogeniet vriance F tests. While none 
of these te as significant, yielding no 


t for discussing differential 
ny main effect in the three 
ra-squared values of these 
e suggest that there are im- 


statistical sı 
contribution 
formats, the 
sources of v 


portant difi between the formats which 
do not en as statistically significant 
solely beca the relatively small sample 
sizes. The c ice suggests that using For- 
mats 1 and Juces the variance attributa- 
ble to the r ffect and increases the vari- 
ance attrib = to the ratee effect, and that 


also reduces the variance 
e category effect. 


using Form 
attributable 


The relia’; of the scales of Format 1 is 
78, the reli y of the scales of Format 2 
is .81, and reliability of the scales of 
Format 3 i Unfortunately, because of 
the small ni) er of raters for each format, 
the standard . rors of the reliability coeffi- 
cients are sc ige that it is relatively im- 
Possible to rcaningfully demonstrate signifi- 
cant differences between the reliabilities of 
the scales. 

Correlations between similarly named 


Seales for Formats 1 and 2 were obtained to 
yield additional evidence of the effects that 
the presence or absence of the behavioral 
_ “pectation anchors and brief category defi- 
hitions had on the ratings of these formats. 
The overall correlation between the two for- 
mats is .61 (p < .01). The Pearson product- 
moment correlation coefficients between the 
category pairs, which are all significantly 
diferent from zero at alpha = 01; are: 
Knowledge of Material, +.74; Rapport with 

© Students, +.62; Verbal Presentation, 
e den Consciousness, +.54; and Stu- 

valuations, . 54. 


Discussion 


The ratings from all three formats con- 


tai 8 
a leniency error and composite halo er- 
none contain absolute or relative halo 
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error. While there is some indication that 
the systematically developed scales reduce 
the amount of leniency error and increase the 
amount of variance attributable to ratee dif- 
ferences, there is no evidence for the superi- 
ority of any one format. 

On the basis of the omega-squared esti- 
mates, both Format 1 and Format 2 reduce 
leniency error, suggesting that the use of the 
scales which emerge from the methodology 
employed are more meaningful to raters than 
are the scales selected on an a priori basis 
and, therefore, that rater differences are 
reduced. It is very likely, in fact, that this 
reduction in the rater component accounts for 
the increase in the ratee component. As scales 
become more meaningful to raters and more 
related to observable behaviors rather than 
interpretations of behaviors, the behavioral 
differences between ratees account for more 
variance in the ratings, and the interpretative 
differences among the raters account for less. 

The one distressing finding is the over- 
whelming presence of composite halo error. 
It is inconsequential that the Ratee X Cate- 
gory interaction, the absence of which indi- 
cates the presence of this error, is significant 
for Format 2. This effect accounts for only 
3% of the variance and can hardly be con- 
sidered meaningful. The ubiquity of com- 
posite halo both in the present study and in 
other research (e.g., Dunnette, 1970; Landy 
& Guion, 1970; Zedeck & Baker, 1971) is 
alarming. The purpose of obtaining perform- 
ance ratings on several dimensions is to 
gather information concerning individuals’ 
patterns of strengths and weaknesses. To the 
extent that the several scale ratings are cor- 
related, another indicant of composite halo, 
this objective of using the scales is not met. 

Maybe it is time to raise the question, How 
realistic is the objective of obtaining rela- 
tively independent scales for several dimen- 
sions of job performance? It is impossible to 
escape the fact that each individual scale of 
almost any set of rating scales is influenced 
by some type of pervasive element. Halo er- 
ror, most would agree, is the result of a global 
evaluation biasing the ratings on each per- 
formance category in the same direction, the 
result of which is inflated scale intercorrela- 
tions. A major proposed benefit of the Smith- 


ee 
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Kendall behaviorally based scales is that they 
focus the raters’ attention on behaviors rele- 
vant to each performance dimension and away 
from a global evaluation. However, by in- 
creasing the focus on behavior, the global eval- 
uation may be replaced by something just as 
global: a pervasive motivational component. 
If behavior is a function of a multiplicative 
_ relationship between undifferentiated drive and 
Specific habit strengths for a given stimulus 
situation, then drive, or motivation, will have 
~~ constant effect on all behaviors at a given 
me. If work motivation is relatively con- 
a stant over time, we should expect a correlation 
between behaviors on a variety of perform- 
ance dimensions, Perhaps efforts to measure 
absolute ability levels independently of actual 
performance, plus a separate assessment of 
the pervasive motivational component, will 
prove a more fruitful source of usable infor- 
tion. Perhaps a pervasive motivational 
dimension, like Spearman’s “g,” is a fact of 
life and has to be tolerated. 
The test-retest reliability reported in this 
study for the Smith-Kendall scales is some- 
what larger than previously reported (Hakel, 
1971; Landy & Guion, 1970; Maas, 1965) 
interrater reliabilities for these scales, Al- 
though the reliabilities of the three formats 
could not be meaningfully compared for sig- 
nificant differences, perhaps a repetition of 
the present design with larger sample sizes 
will provide more conclusive results. 
Thus we see again that format modifica- 
tions and improvements in scale development 
techniques are not the final remedy to the 
criterion problem. Other approaches to im- 
prove ratings, such as rater training, may 
prove to be equally or more fruitful alterna- 
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‘NITUDE ESTIMATIONS OF EXPRESSIONS 
OF FREQUENCY AND AMOUNT’ 
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ide estimation was employed to fied the numerical equivalents of 
i’ of frequency ranging from sever to Swan, and 44 exprenions 

inging from sone to all, The remalts were generalieable ecru 
ipation levels and unaffected by topic importance, 
‘opriate variance measures are provided for each 
stions for anchors for 4- to -point sakes. The percentage 
nents for adjacent points on ales are allo gives, Rewis 
+ work on scaling, and the utility of present approach i 


1111 
elites 


In appli wlogy, we make heavy use Thurstone (1927) in 
of judgme: equency and judgments of ing 
amount. T) : rater judges how much or Quantitative values were assigned to 
bow often : wioral statements apply to behavior, attitudinal statements, 


1 


the attitud: tionnaire judges how much of equal intervals with an abh 
or how fre . the statements reflect his For purposes 

feelings; u nality assessee judges how this procedure made possible the 
Much or h cn the statements are true of 

about hims: vertheless a limited number the goodness of performance of 
of adverbi) oc adjectival modifiers are cupant. These median weights of 
available for essing the range of extensiv- performance were relatively invariant 


always) and mount (none to all). establish the weights (Ubrbrock, 


or experience un be described, such as their Coons (1930) 5 
foxdness, strength, and potency (Osgood, quency or ex Leader Behavior 
Suci, & Tannenbaum, 1957) in terms of fre- velopment of the 


quency and amount. Thus, a scale evaluating tion Questionnaire (LBDQ). A 
“goodness” can be couched in terms of fre- of staff members completed paired 


currence suc “fai ae * 4 
Er ce such as “fairly good” or “very adverb was noted. In any 


relation between stimulus and sensation, judged greater of 


ge research was supported by the Personnel 

Training Research Programs, Psychological with other ad 
to investigate alternatives that make work 

s Requests for reprints should be sent to Bernard signiñed by ea 


a 15 Rochester, Rochester, New York 147 
Now at Florida International University, Miami, 
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the employ s rating; the respondent to or experiences in the hope of achieving scales 


ity of such nents in frequency (never to particular sample of judges employed 
Many othe: aspects of behavior, sensation, At about the same time, K 


oa. adverbs the 
Qvencies of occurrence such as “usually good” sons among thee Er requency 
ör “rarely good” or in terms of amounts of terion of how much each : ench 


Following Fechner’s logarithmic law of the adverbs, . he's 1 Bak much it 
— expressed frequency or extent. “Never” was 


œs Division, Office of Naval Research. It was Some years earlier, Simpson (1944) had 
in connection with the development of scales asked 335 high school and college students tom 
Pre iv i te the “number of times in a was 
cet o 28118 ch of 20 adverbs and adverb 
~ Bass, Graduate School of Management, Univer- phrases. They were asked to give a range ol 
Pimat for each answer. Hakel (1958) re. 
peated Simpson’s work with 100 Minnesota — 
He noted a high degree of agree- 


— -- p? 
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ment between the medians of his samples and 
those of Simpson as well as large individual 
differences in response to many of the ad- 
verbs. Nevertheless, such individual differ- 
ences varied considerably from one adverb to 
the next. 

Sheppard (1954) determined for 78 British 
subjects the mean distance on a 7-inch line 
represented by the phrases and adverbs, very 
bad, bad, rather bad, and so forth, But much 
more comprehensively, Cliff (1959) looked at 
adverbs such as slightly, somewhat, rather, 
3 very, etc., as modifiers of a variety of adjec- 

tives such as evil, immoral, nice, lovable, 
2 etc. Reliability of paired comparison judg- 
ments was found to be .999. The multiplying 
Falues of “intensity” of each adverb for three 
college student samples from Wayne, Prince- 
ton, and Dartmouth universities were ob- 
tained. The high degree of stability across the 
three samples was striking evidence of the 
relative invariance of such adverb modifiers. 
Cliff's success with a multiplicative model 
for adverbs of intensity argues strongly for 
both the theoretical as well as practical utility 
of a multiplicative rather than an additive 
approach to adverbs as modifiers of frequency 
and amount (Hakel, 1958; Hemphill & 
Coons, 1950; Sheppard, 1954; Simpson, 
1944). As a consequence, magnitude estima- 
tions using Stevens’s (1966) procedures 
seemed to be a more accurate way of stan- 
duardizing expressions of frequency and 
amount, a way which better fitted the true 
nature of relations between stimulus and sub- 
jective experience. Such standardization, if 
sufficiently invariant for pools of different 
individuals, could then be used by investi- 
_ gators to build any kind of behavioral rating 
scale. Objective quantities of frequency and 
amount could be readily associated with sub- 
jective modifiers according to Stevens's 
(41971) law that equal stimulus ratios pro- 
duce equal sensations. 


_ 5 The claims for the superior validity and power 
of magnitude estimation scaling have been ques- 
tioned due to the fact that if judges are asked -to 
se equate differences rather than make ratio judg- 

ments, a logarithmic transformation of magnitude 
; estimation scales emerges. Intuitively, we counter 

that the evidence is less likely to hold for psycho- 
. logical judgments such as word meanings, compared 
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Bass (1968) initiated effort for a 
questionnaire survey. Th: y scale he 
used was based on a n estimaties 
study of 28 adverbs of fr: Each of 71 
undergraduate students asked to 
assign a number of hi hoosing te 
“sometimes,” then to in at number 
would best fit each of th r adverbs. 
On the average, if “som was set al 
1,000, “always” was seen 3 times al 
frequent; “very often” w 3 times af 
frequent; “fairly often“ w 3 times af 
frequent, “seldom” was es as fre 
quent and “never” was hus, these 
particular 6 adverbs of t died bon 
an approximate relation other of 
5:4:3:2:1:0 and were sel as the re 
sponse alternatives for a qt Maire. 

The purpose of the pre investigation 
was to obtain reliably esta | geometric 
means and standard deviat į magnitud 
estimations of reasonably < ive lists 0 
expressions of frequency an int used d 
modify attitudinal ratings. e set out të 
see if such results were ins: e to rating 
of important, as opposed nimportan 
issues for the raters. It v ypothesi 
that judges generate one set tios of el 
tensivity when rating issues onsequence 
to them and a different set hen judging 
issues of little importance to them. Only # 
the hypothesis was rejected would a Singe 
standardized list be possible. 

METHOD L 

The method employed was tha! of ratio scaling: 
The distinguishing characteristic oi a ratte scale 
that a true or absolute zero point can be defined 

derived. Be 


meaningful ratios can therefore be i 
scaling uses magnitude estimation“ (Stevens, 


weights. Moree 


to physical stimuli such as brass 
Xz) and (*. 


comparing two differences (Xi — 8 
involves judging four things, While magnitude 0 
mation reduces the problem to assigning NU * 
against a standard anchoring number. 

6 The equal interval status of magnitude 
has been questioned and we have to agree that 1 
nitude estimation and partition scaling Si" 1 
cordant results on prothetic intensity continua. 
ever, on metathetic qualitative dimensions 
approaches give results that are in good agree = 
The question then seems to be: “Do our scales 
frequency and amount represent prothetic or 1 


tathetic continua?” If the latter, no problem eee 


. 88 


gies or intensity dimensions the answer is less 
on Stevens (1971) still argues that the magnitude 
has equal intervals while others point out 

— lack of linearity with partition scales 
the inequality of its intervals, We cannot 
this iscue since it’s not clear how one draws 

ae line between prothetic and meta 
bum ut, The data we have collected is not in a 
ki which lends itself to giving concrete testimony 
© where our scales of frequency and amount 
ae this line. Furthermore, even if they are 
i etic dimensions, it’s still not clear how to 
ve Stevens disagreement with others over 


Which technique leads to equal interval scales. 


Mac: £ ESTIMATIONS OF EXPRESSIONS OF FREQUENCY AND AMOUNT a 
piti) in which iue is assigned to a referent a value of $0 to “sometines,” you might amiga 
asipi, and U other stimuli are judged in the value 100 to any otber womi or pòrre 
sitios to the concept. For example, if an which you feit represested teten the freqerecy 
ee atta e value 30 to the word of “sometimes” and a valor of $ to any word 

"h isign 15 to the word “sel- or phrase that semed to repremmt only 1/10 
dm” If be felt dom represented one half as the frequency of “sometimes.” 
mah as some kewise he might assign the You may we any whole or decimal member 
vabe 300 to vd “always” if be felt that greater than or equal to mero (0), just as long 
abways expres imes as much as the word as you feel it represents the sumerical vakse of 
sometimes. a word or phrase when that word or phrase 

is compared with “sometimes.” 
pane Instructions used the referent “sosse” insted 
The subject 75 males and females drawn eee ons ree 
from three po night school master of busi- made instead of judgments of frequency. The re- 
pes administ: IBA) students (most of whom ferent concepts “sometimes” (frequency and 
were working undergraduate students, and “some” (amount) remained constant throughout the 
hgh school j experiment. 
Desi The 48 other expressions of frequency and 4% 
gn expressions of amount were presented 1 — of five 
The subje: 1 30 expressions of frequency  ‘ifferent orders 20 28 tog 2 
aad 44 expr: i amount. Instructions to each she subjects were randomly assigned to ose of 
subject read vs: two conditions: 3 
People í y use words such as some- Condition 1: expressions of frequency with 
times, rare! om and never. Assume some- important topic 
one has ju ! to you, “I sometimes think expressions of amount with 
the Viet > r should continue until we unimportant topic 
win.” Condition 2: expressions of amount with 
On the a | form, please assign a number important topic 
to what yo. .onceive “sometimes” to mean. expressions of frequency’ with 
You may number greater than or equal unimportant topic. 
to zero (( vse place the number on the 
line next to “sometimes.” To ensure agreement with what the experimenters — 
Then, usin. the number you have assigned considered topics of importance (air pollution and 
1 as a standard, please assign a the Viet Nam war) and unimportance (the amount 
number to . of the other words on the form of rainfall in Nepal, worms in the street after a 
indicating c.c: word’s value relative to some- rain storm), the subjects were asked to rate cach 
times. Again, use any number greater than or topic on a S-point Likert scale of importance from 
equal to zero (0). The number placed next to “extremely Important to me to “extremely unka- 
each word or phrase should reflect what you portant to me.” To make more meaningful experi- — 
1 word or phrase means when compared mental 8 each 3 —— 4 
“sometimes.” i i a 
eee om e LAN e batan i ttn si 
since all agree that for these dimensions the parti- importance-unimportance i Be nge 4 
tion scale represents a straight linear segment of jects who ranked topics in a = — 
the magnitude scale and that the in are portance-unimportance were deleted from the analy- 
og equal (Stevens, 1971). If, on the other ss- 
our scales of fi amount are 
f frequency and RESULTS 


and Table 2 presents the means and standard 
deviations of the 44 expressions of amount. 

In the first analysis a 2 * 3 repeated mea- 
sures multiple analysis of variance (MAN- 
OVA) was performed using the 39 expres- 
sions of frequency as dependent variables. 
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Two levels of topic importance (important 
and unimportant) were compared across 
three groups (college students, MBA students, 
and high school students). Using the un- 
transformed raw data, the multivariate F 
(1.79) representing the comparison of scale 
values assigned the 39 expressions of fre- 
quency when these were imbedded in im- 
portant and unimportant contexts, respec- 
tively, was statistically significant (df= 
78/55, p < 05). Univariate Fs comparing 
important with unimportant topics for the 


TABLE 1 


MEANS AND STANDARD DEVIATIONS 
or EXPRESSIONS OF FREQUENCY 


Expressions of frequency * SD 
Always 58.01 3.52 
Continually 50.16 3.18 
Constantly 49.70 3.31 
Frequently if not always 45.24 3.06 
Very often 42.45 3.08 
A great deal of the time 41.37 3.08 
Very frequently 40.02 3.33 
A great many times 39.28 3.09 
Usually 39.18 3.13 
Often 37.64 3.09 
Frequently 36.07 3.29 
Quite often 35.39 3.93 
Rather frequently 34.44 3.18 
Commonly 32.97 3.14 
Fairly often 32.64 3.30 
Fairly many times 30.65 3.00 
Sometimes 19.42 2.86 
Some of the time 18.01 3.01 
To some degree 15.52 2.92 
Now and then 15.19 3.04 
Occasionally 14.92 3.06 
Once in a while 10.22 2.89 
Not often 7.18 2.55 
Not very often 7.23 2.56 
Fairly infrequently 6.99 2.72 
Infrequently 6.47 2.61 
Rather seldom 6.42 2.66 
Very seldom 4.72 2.64 
Rarely 4.56 2.23 
Very infrequently 4.54 2.47 
Seldom if ever 3.69 2.42 
Hardly at all 3.47 2.38 
Hardly ever 3.34 2.23 
Very rarely 2.99 2.11 
Almost never 2.63 2.10 
Seldom 33 2.60 
None of the time 17 1.49 
Not at all 15 1:53 
Never 08 1.41 
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TABLE 
MEANS AND STANDA ONS 
OF EXPRESSIONS © 

Expressions of amount * | SD 
All ».12 | 432 
An exhaustive amount of “0.27 | 471 
Almost entirely 7.61 | 576 
Completely 7.35 590 
An extraordinary amount of 4.46 | 438 
Almost completely 1.38 | 471 
An extremely abundant amount 18.80 | 4.52 
An extreme amount of 8.20 | 3.59 
A great amount of ‘1.56 | 2.74 
A great deal of 1.36 | 2.83 
Very much 9.59 | 2.94 
A full amount of 9.50 | 3.79 
A lot of 7.10 | 2.77 
Much 35.14 | 2.64 
Quite a bit of 4.24 | 271 
A good bit of 32.65 | 3.13 
A considerable amount of 31.44 | 2.90 
Pretty much 30.04 | 2.87 
Fairly much 27.70 | 2.42 
An ample amount of 26.22 | 2.81 
An adequate amount of 14.07 | 2.60 
A moderate amount of 21.80 | 3.42 
Some 18.63 | 2.92 
To some extent 13.42 | 2,95 
To some degree 13.10 | 2.72 
Somewhat 11.75 | 3.03 
A limited amount of 9.57 | 285 
A little 7.81 | 250 
A small amount of 7.51 2.52 
Comparatively little 7.22 2.11 
A little bit of 7.20 | 267 
Not much 7.02 | 2.70 
A small degree of | 5.27 | 2.50 
Very little | 5.21 | 2.48 
A slight amount of 5.09 2.58 
A meager amount of | 4,28 | 2.66 
A scanty amount of j 3.68 | 24 
A minimum amount of | 3.64 | 271 
A trifling amount of | 3.13 2.59 
Scarcely any | 2.98 2.20 
A trivial amount of 2.85 | 2.60 
An insignificant amount of 2.48 2.11 
Hardly any 2.28 2.20 
None 15 1.65 
words “rather seldom” (4.19), “frequently, 


if not always” (3.97), “very seldom” ce 
and “a great many times” (4.57) were @ 
statistically significant (df = 1/ 132, 
.05).7 
for 

1 Calculation of multivariate degrees of i of 
repeated measures MANOVA: Let K = 1 
measures on each subject; M, = total number Of 5. 


5 ` jates/» 
jects; P = number of concomitant vectors (covyari 
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re then transformed in 
ested by Stevens, 1966, 
e different subjects might 
nt anchor values for the 


The raw d 
two ways (as 
1971). First, t 
have chosen d 


referent conce e.g., one subject might 
assign the val to “sometimes”, while to 
another “som: might mean 100), the 
data were st lly transformed so that 
tach person’s or value was the same, 


this, each subject’s origi- 

ıs divided into 50, and all 
multiplied by the result. 
rmation involved the use 
raw data. 


namely, 50. 1 
nal anchor va! 
other values 

The second t: 
of logarithms < 


Both trans! tions yielded identical re- 
sults. Neither multivariate Fs nor any 
of the univar Fs were significant. The 


ta were then analyzed by 
there were no significant 
mean scale values assigned 
irequency when these ex- 
added in important as op- 
ant contexts. This conclu- 
e for the MBA students, 
and high school students 
ur sample. 
sis considered the effect of 
ind group membership with 
expressions of amount in a 
2% 3 repeated measures MANOVA. In con- 
trast to the uency analysis, neither the 
multivariate Y nor any of the univariate Fs 
Was significant in the analysis of the raw 
data. The analysis performed on the loga- 
t rithms of the raw data revealed that neither 
the multivariate Z of .84 (df = 88/41) nor 
k of the univariate Fs was statistically sig- 
A In addition, there were no signifi- 
15 differences when the data were trans- 
med so that each subject had the same 
anchor value. 


log-transforme 
subsample. A 
differences in 
to expression 
pressions were 
posed to unin 
sion was the 
college stude: 
that constitute 
A second an 
topic importance 
Tegard to the 


T DISCUSSION 
10 he main hypothesis of the study, that 
ges assign different scale values to ex- 


and let 7 7 z 3 
Partialed fee portion of experimental space being 


Multivariate ap nok- Pi 
55 


For ` 
example, in the frequency analysis: 
S K SA 39x2 _ 8 
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pressions of frequency and amount as a 
function of topic importance, was generally 
not supported. Analysis of the raw data of 
the frequency expressions, however, did yield 
significant differences as a function of topic 
importance. This result is easily explained, 
however, because different subjects might 
have chosen different anchor values for the 
referent concepts. When either of the appro- 
priate transforms was made (Stevens, 1966, 
1971), namely, equal anchor values for all 
subjects or the use of logarithms of the raw 
data, these apparent differences disappeared. 

Although this hypothesis was rejected in 
this particular experimental instance, it 
should be pointed out that acceptance of the 
null hypothesis (i.e., no significant differences 
in scale values assigned expressions of fre- 
quency and amount as a function of topic 
importance) does not necessarily prove it to 
be so. The present results do not provide a 
sufficient condition for concluding that a 
single list, though possible, is equally applica- 
ble for important and unimportant issues. 
However, the onus is on the skeptic to un- 
earth evidence to reject the null hypothesis 
and to state a more plausible one. Neverthe- 
less, researchers should be aware of these 
cautions when using the data from Tables 1 
and 2. 

Tables 3 and 4 present, respectively, 4- 
point through 9-point scales of expressions of 
frequency and amount as well as the per- 
centage overlap in distribution between ad- 
jacent scale points. The expressions chosen 
to represent the various points on each scale 
are statistically optimal in the sense that 
their mean scale values came closest to the 
exact mathematical values necessary to es- 
tablish a scale of any given ratio. Also the 
observed variances about their respective 
means are small. The means and standard 
deviations of all the expressions have been 
presented in Tables 1 and 2 in the hope that 
the complete lists can more adequately serve 
particular needs of individual investigators. 

Tilton’s (1937) overlap statistic, 0, was 
computed in order to provide an index of 
the amount of separation present between 
adjacent scale points. It is clear from an €x- 
amination of Tables 3 and 4 that as scale 
fineness increases (i.e., as the number of scale 
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TABLE 3 
STATISTICALLY OPTIMAL SCALES OF FREQUENCY (WITH PERCENTA‘ 
OF OVERLAP BETWEEN SCALE POINTS) b 
No. points in scale 
9 8 7 6 5 4 
8 Always 7 Always 6 Always 5 Always 4 Always ways 
24% 24% 24% 5% 2% <1% 
7 Continually 6 Continually 5 Constantly 4 Frequently, if | 3 Very often ten 
21% 21% 4% not always 12% <1% 
8% 
70 
6 Very often 5 Very often 4 Often 3 Quite often 2 Fairly mar ometimes 
24% 13% 25% 1% times <1% 
<i% 
5 Quite often 4 Rather fre- 3 Fairly many 2 Sometimes 1 Occasional Never 
42% quently times 10% <1% 
2.5% 6% 
4 Fairly many 3 Sometimes 2 Sometimes 1 Once ina while | 0 Never 
times 45% 10% 2% 
6% 
3 Sometimes 2 Now and then | 1 Once in a while | 0 Never 
45% 16% 2% 
2 Occasionally 1 Not often 0 Never 
16% 1% 
1 Not very often |0 Never 
1% 
0 Never 
Note. Each percentage shown represents the overlap in distribution between the accompany en point and the 
scale point directly below that entry, For example, there is 24% overlap in response distributions betw sponse at Pov 


and response at Point 8 in the 9-point scale. 


points increases) so does the percentage of 
overlap between the distribution of adjacent 
scale points. As the scale becomes more coarse 
(fewer scale points) there is correspondingly 
less overlap between distributions. 


CONCLUSION 


We have demonstrated, at least, for three 
samples of judges at different age, educa- 
tional, and occupational levels, and for im- 
portant and unimportant issues, that it is 
possible to fix quantitative meanings that are 
associated with verbal judgments of fre- 
quency and amount. Furthermore, we have 
specified expressions of frequency and amount 
that bear integer relations with each other 
for 4-point to 9-point scales. Also, we have 
been able to specify the percentage of over- 


lap of judgments between adjacent points om 
a given scale. 

What we provide in this study are à rela- 
tively stable set of positive numbers, begu 
ning at zero, for summarizing the increase 
associated with modifying expressions of fre- 
quency and amount. ‘The results shoul 
useful to scale developers. Scales using be 
expressions listed should be more compare a 
quantitatively. If the judgment “someting 
is set at 19, the judgment “always” 
reliable numerical equivalent of 38 ue e 
ple’s minds. Many summary operations, 0 


5 : un 

ously, are easier to perform using such i 1 

bers, as opposed to the words thems 10 
in a Ia 


particularly if the numbers are 
scale with an absolute zero. 


i int scale 
Another interesting use of the point 
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: TABLE 4 
STATISTICALLY OPTIMAL SCALES OF AMOUNT (WITH PERCENTAGE 
OF OVERLAP BETWEEN SCALE Pornts) 
No. points in scale 
9 8 7 6 | 5 | 4 
8 All All 6 All 5 All 4All 3 All 
44% 39% 18% 10% 2% <1% 
TAnexhaustiv: \lmost 5 An extraordi- |4Almostcom- | 3Anextreme 2 A great amount 
amount of ntirely nary amount pletely amount of of 
18% 31% of 16% 3% <1% 
1% 
6Anextreme \n extreme 4A great 3 Very much 2 Quite a bit of | 1A moderate 
amount of amount of amount of 2% <1% amount of 
29% 8% 17% <1% 
5 A great dea i A lot of 3 Quite a bit of | 2 Fairly much | 1 Some 0 None 
20% 70% 50% <1% <1% 
4 Quite a bit « 3 Fairly much | 2A moderate 1 To some degree | 0 None 
5% 9% amount of <1% 
12% 
3 An adequate Some 1 Somewhat 0 None 
amount of 0% 2% 
32% 
2 Some 1 A limited 0 None 
5% amount of 
4% 
| 
1A little 0 None 
0% | 
0 None | 
7 = 3 y *a scale point and th 
Note. Each pe e shown represents the overlap in ditribution e ees Se gah on EN Rein 7 


šale point direc 
and response at 


8 for the 9-point scale. 


and their numerical equivalents would be to 
locate the verbal expression most closely 
matching a mean result that fell between two 
scale points, for example, suppose the 5-point 
scale of frequency (Table 3) had been used 
in a study of some substantive issue, and the 
ame result for a group of respondents came 
7 3.5, halfway between “always” and “very 
alten.“ Table 1 indicates that the expression 
continually” (50.16) lies just about halfway 
etween “always” (58.01) and “very often” 
(as). Therefore, “continually” would rep- 
368 the mean position of the respondents. 
anaes errors of labeling alternatives in 
Oia nt developed scales can also be lo- 
scripti For instance, the Leader Behavior De- 

ption Questionnaire (LBDQ) is a popular 


elow that entry. For example, there is 44% overla 


behavior checklist which uses a 5-point scale 
ranging from virtually always to virtually 
never, with the scale midpoint labeled “occa- 
sionally.” Note in Table 3 that “occasionally” 
is one whole interval away from the mid- 
point of the 5-point scale based on magnitude 
estimation. f 

More precision can be obtained for lin- 
guistic translations. Just how equivalent are 
the French “toujours”, Spanish “siempre” 
and English “always”? For cross-language 
contracts, treaties and agreements, the possi- 
bility arises of using the universal language 
of mathematics to locate zones of disagree- 
ment between what were purported to be the 
same statements in different languages. 

It has been generally understood that the 
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number of anchor points does not make much 
difference in most measurement properties of 
a scale; that is, whether one uses, four points 
or eight points does not affect the internal 
consistency, stability, predictive or concur- 
rent validity of the scale (Bendig, 1954; 
Komorita, 1963; Matell & Jacoby, 1971, 
1972). Our study, however, demonstrates one 
effect that has been previously overlooked. 
The percentage of overlap in adjacent judg- 
ment increases as the number of anchor points 
increases. When we move from three to nine 
points, we pay the price for the increase in 
scale fineness. The overlap between adjacent 
point judgments can run as high as 44%. 
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M! URING ORGANIZATIONAL COMMUNICATION * 


KARLENE H. ROBERTS? anp CHARLES A. O'REILLY IH 


University of California, Berkeley 


esearch on organizational communication has not included systematic 
pment of a measurement method that can be used to compare dimen- 
communication within and across organizations. This research reports 
tial attempts to develop such instrumentation. Seven samples, with a 
number of over 1,200 respondents, were used to develop a 35-item 
onnaire measuring 16 facets of communication. 


Communi has been the subject of 
considerable ntion in organizational re- 
search. Ho investigators often attach 
diverse labe seemingly similar communi- 
cation vari and similar labels to what 
appear to fferent variables (Porter & 


Roberts, in s; Roberts, O'Reilly, Bret- 


ton, & Port n press). There has been no 
systematic « pment of instrumentation to 
measure co nication variables in orga- 
nizations. “í present research begins to 
develop a dardized questionnaire that 
can be used ompare communication across 
organization: 

A review the organizational literature 
teveals that 


among the communication facets 
iy discussed and investigated 

es ity of information flow (eg, 
110 8, 19 „ sccm and distortion of 
the 0 Bead, 1962; Wilensky, 1967), 
: alities used in transmitting information 
1 Porter, & Tenenbaum, 1068), and 
OR, 1 8 5 of information (Davis, 1968; 
ia. & Roberts, in press; Rosen & Tesser, 
aay The communication literature not di- 
x 515 concerned with human behavior in 
ae suggests additional aspects of 
ge that should be given greater 
e in behavioral organizational re- 
1969), hese include overload (Porat & Haas, 
ae and satisfaction with one’s communica- 
(Lawler, Porter, & Tenenbaum, 1963; 


Wson, 1965). Desire to interact with others 
a, 


most freque 
are directio 


1 A 
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e N000314-69-A-0200-1054. 
meee for reprints should be sent to Karlene 

versity 57 School of Business Administration, Uni- 
ente e, California, Berkeley, 380 Barrows Hal, 

ey, California 94720. 


is suggested by the interpersonal literature to 
be an important progenitor of actual inter- 
action behavior (Rogers & Bhowmik, 1970). 
Finally, three noncommunication variables— 
trust in superior, perceived influence of 
superior, and mobility aspirations of the 
respondent—have been repeatedly shown to 
influence individual communication in orga- 
nizations (e.g, Cohen, 1958; O'Reilly & 
Roberts, in press; Read, 1962). 

Because communication is a process rather 
than a static variable it is one of the most 
difficult organizational variables to measure. 
Behavioral observations are often impossible 
to make and self-report forms are usually 
phrased in such a way that they take snap- 
shots of process variables. The attempt in this 
study was to develop a measurement device 
that allows respondents to summarize their 
own behavior over time. Aggregation of indi- 
vidual responses should minimize many of the 
inherent individual response errors. 


METHOD 


Based on a literature review, 189 Likert-type 
items were created to assess respondent perceptions 
of the communication dimensions previously indi- 
cated. Communication purpose, content, importance, 
and speed, which are also frequently mentioned in 
the organizational literature, were considered in 
initial instrument development. They were later dis- 
carded because of the difficulty respondents had 
answering items relevant to them. Items were also 
developed to measure desire for interaction and to 
communication variables: trust in 
superior, perceived influence of the superior, and 
respondent mobility aspirations (Likert, 1967; Read, 
1962). While the number of researchers who have 
measured the latter three dimensions is extensive, 
only Read (1962) reports intercorrelations of items 
supposedly assessing the same facets. 

The 189 items in the pool were administered to 
70 graduate students, all of whom had prior work 
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TABLE 1 


Variax Factor Loapincs FOR ITEMS ASSESSING 
Trust IN SUPERIOR, PERCEIVED INFLUENCE OF 
SUPERIOR, AND SUBORDINATE’S MOBILITY 
ASPIRATIONS FOR A SAMPLE OF 
95 Mıtırrary SUBJECTS 


Varimax factor loadings 


(n = 95) 
Item 
1 | 2 3 
1. Trust 68 .22 13 
2. Trust 76 08 Al 
3. Trust 79 2 OL 
4. Perceived influence 28 30 SI 
5. Perceived influence 14 16 87 
6. Mobility aspirations 12 90 21 
7. Mobility aspirations -23 88 .20 
Proportion of common 
factor variance 26 20 24 


experience. These students were asked to respond to 
the questions based on their experience in some past 
or current job, considering their communication be- 
havior in a typical work week. Items were retained 
that had high intercorrelations with other items 
ostensibly measuring the same facets and low inter- 
correlations with the rest of the item pool. A 60- 
item questionnaire resulted and was further tested. 
This version of the questionnaire (51 communica- 
tion and 9 noncommunication items) was adminis- 
tered to 86 mental health workers, at all job levels, 
in five health-care-delivery teams working in a large 
medical center. Their responses were subjected to 
a V-type cluster analysis to further reduce the num- 
ber of questionnaire items. This technique does not 
seek orthogonal item groups. Instead, using an ob- 
jective index of collinearity (P?) and the correla- 
tion matrix among variables, the pattern of correla- 
tions is compared, and subsets or clusters of variables 
are identified that have collinear correlation profiles. 
Each cluster of variables may be thought of as 
occupying generally the same space, that is, are 
measuring the same construct (Tryon & Bailey, 
1970). Two single communication items were in- 
cluded with the clusters (one measuring overload 
and the other measuring general satisfaction with 
communication) after it was determined that these 
questions behaved appropriately with regard to the 
rest. The question concerned with satisfaction is a 
modification of the GM faces scale (Kunin, 1955). 
When this question was related to the clusters 
it showed relationships to each cluster without load- 
ing heayily on any one, suggesting that satisfaction 
with communication in an organization is perceived 
to be partly related to (a) interaction, (b) accuracy, 
(c) directionality of information flow, etc. 
Conceptually, trust, influence, and mobility should 
be more independent of one another than are the 
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communication items. Consec 
communication items 
principal-components analysi 
ity estimates in the diagon 
multiple correlations (R*) and 
tion. Three separate factor 
were included in a test b: 
officers and enlisted men in 
unit. The results are reported 

Based on these analyses th 
reduced to 35 items which w: 
multi-item indexes and 6 sing! 
viously mentioned, questions 
load, satisfaction, and modalit 
individual items. 


were 


RESULTS 


Reliability 
The 35-item version of he“ 
administered twice to 24 
students (all with previot 
and to 42 managers for tesi 
Tn both cases the adminis 
weeks apart. The test-ret« 
the 10 indexes and 6 indi: 
sample question for each « 
ported in Table 2; 9 of the 10 indexes 
reasonable test-retest relia! tities. Ho 
the small sample sizes inn 
kept in mind. 
In addition to the mental 
the 35-item version of the 
administered to the military 
ously referenced, 54 emergency room 
patient hospital personnel, 52 managers 
various firms, 179 respondents at varou 
levels in six United Kingdom locations” 
large financial institution, 148 subjects 
six comparable United States offices © 
same organization, and 579 pe 
another military organization. 
among the means and standard de 
across the seven samples (7 = 1,218) 
that within-group variance is less 
tween-group variance. This suggests th 
feasible to discriminate among 018a 
using these communication indexes. 
Maximum, minimum, and median 
alphas (a measure of the internal cons 
of the items measuring a dimension) are 
ported in Table 3 for each of the 
samples on the 10 multi-item inde 
alphas appear relatively consistent 4 
except the last index. 


al items 
cension are 
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TABLE 2 
COMMUNICATION DIMENSIONS AND Test-Rerest Reuiasitsrins 
= = enn meet ——— ee 
‘Test-retest 
correlation 
Index Definer — 
Students | Managers 
(n = 24) | (a = 42) 
- — — 
1. Trust low free do you feel to discuss with your immediate 
superior the problems and difficulties you have in your 
job without jeopardizing your position or having it 
“held against” you later? 3 71 a 
2. Influence in general, how much do you feel that your immediate 
superior can do to further your career in this 
organization? 2 A9 6l 
J. Mobilit ilow important is it for you to progress upward in your 
present organization? 2 77 w 
4. Desire Í How desirable do you feel it is in your organization to be 
intera in contact frequently with others at the same job level? 3 46 76 
5. Directi While working, what percentage of the time do you spend 
—up in contact with superiors? 3 M ** 
6. Directi While working, what percentage of the time do you spend in 
do contact with subordinates? 3 2 B 
7. Directi While working, what percentage of the time do you spend 
lat in contact with others at the same job level? 3 82 76 
8. Accura When receiving information from the sources listed below 
(superior, subordinate, peers), how accurate would you 
| estimate it usually is? 3 2 ói 
9. Summariz:- | When transmitting information to your immediate superiors, 
tion | how often do you summarize by emphasizing aspects 
| that are important and mi imizing those aspects that 35 37 
are unimportant? y 3 
10. Gatekee; Of the total amount of information you receive at work, 78 
how much do you pass on to your immediate superior? 3 * < 
11. Overload Do you ever feel that you receive more information than k — 
you can efficiently use? 1 5 z 
12, Satisfaction | Put a check under the face that expresses how you feel 
about communication in general, including the amount 
of information you receive, contacts with your 1 60 73 
and others, the accuracy of information a eic. ? b " 
Modalities Of the total time you engage in communi while on 
the job, about what percentage of the time do you use 
the following methods: 
13. Written Written j 1 5 4 
14. Face-to-face | Face-to-face 1 30 “38 
15. Telephone Telephone i 2 8 
16. Other Other 


scored on 7 7 
9 A revision of the questionnaire including additional items and indexes is available on request. Items are on J. in 
les, except for indexes 5, 6, and 7 which use 10-point scales. } 


Validity construct, face, and a TEN attempt 
Objective criteri i Jerat convergent-discriminant validity. 

vised for prea a ae 1 com- First, evidence for the construct validity a 
munication facets discussed in this study. the 10 dimensions Is Pro ded by a 
However, difficulty in obtaining objective cri. of the cluster analyses previously | à ees 
terion measures and their potential inappro- and increases confidence that even similar ap. 
Priateness once obtained warrant considera- pearing facets such as trust and influence, 5 S 
tion of more indirect validity assessments: interaction and desire for interaction, can 
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TABLE 3 | | Í 
Mumu, Moxarum, AND MEDIAN CRONBACH ALPHAS | 


vor THE 10 Muttr-Irem Inpexes ACROSS 
SEVEN SAMPLES (# = 1,218) | 


1. Trust 
2. Influence 
‘ 3. Mobility 
4. Desire for interaction 
S. Directionality—upward 
6. Directionality—downward 
7. Directionality—lateral 
8. Accuracy 
9. Summarization 
10. Gatekeeping 


' 

_ successfully differentiated from each other. 
The intercorrelations among indexes from a 
representative sample, shown in Table 4, 
confirm the notion that communication 
dimensions are differentiable. 

5 Second, individual questionnaire items ap- 
pear to have face validity. During the item 
i development respondents indicated which 
4 questions were confusing or not differentiable 
in their minds. Thus, questions attempting to 
measure task versus social information, 
Peed of transmission, and the purpose of the 
communication were eliminated. 

Third, in the spirit of convergent - discrimi- 
nant validity, intercorrelations of nine com- 
munication and three noncommunication di- 
_ mensions with other similar and dissimilar 
measures are presented in Table 5 for the 

first military sample. The other measures 
include assessments of overall job satisfac- 
tion (Kunin, 1955), supervisor’s considerate 
leadership style (Stogdill & Coons, 1957), 
organizational competence and flexibility 
(Campbell & Beaty, 1971), organizational 
commitment (Porter & Smith, 1970), and 
respondent's age and tenure. 
Examination of Table 5 reveals some of 


TABLE 4 


INTERCORRELATIONS AMONG THE COMMUNICATION INDEXES FOR U.S. BANK EMPLOYEES (n = 148) 


the expected convergence and discrimination. 
= Education and tenure may be surrogates for 
rank in military units. Thus, it was hypothe- 
sized that only communication facets that 
change as a result of rank are related to edu- 


scales scored so that a high score equals a high value for that measure, Decimal places are omitted. 


Index 


Trust 

Influence 

Mobility 

Desire for interaction 

Directionality—upward 

Directionality—downward 

Directionality—lateral 
8. Accuracy 

Summarization 

Gatekeeping 

Overload 

Satisfaction 

Modality—written 

Modality—face-to-face 

Modality—telephone 

Modality—other 


1 
2. 
3. 
4, 
5. 
6. 
7 
9. 
10. 
12. 
13. 
14. 
15 
16. 
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TABLE 5 


LLLATIONS OF Nowe Couverts axo ‘Times Noxooweriocanor hoeses 
wr Orues IxsrRusexTs rom 95 Mirrary Reoososmwis 


* 


t | pirr oth e e 
(Kunin, 1955) | Ce 80 
Tres eee bie 
Inthe tar is 32 
Mobilit S53 or 
& Dnie f ao eT 
4 — — ~.22° 0t 
6 Durrer 
pe * n 
J. Direc 
O -06 D 
& Accur ae Al 
4, Summ. 27° -19 
W Amou: n 
* a 
11, Overt | -22° Vi 
. Satis 
comr „370 1 
Note. AN oo that a Aiak score represents high values fot the eat. 
1 84 
cation an re. Of the 18 correlation co- jobs dependent on information 
efiicients ny education and tenure, only (e.g., dispatchers 
the three logical ones are significant. the military units) with those of ad oong 
Other nships are also of the type pying jobs not as sensitive gt ergs 
expected. in superior, for example, is transmission (e. g., mechanics). a type * 
related to respondent’s overall job satis- these comparisons (not report t 
faction, pe d considerate leadership style, study) suggest that communication facets 
perceived ~izational competence and flex- measured by this 
ibilty, an mitment to the organization, eral, discriminate ‘ ca 
but not to | cure and education. The amount cations for occupants © jobs Spean The 
of informa:ion an individual passes is sig- 0 from those not so deg > 
niñcantly related to his overall job satisfac- former are more d 
tion and to his perception of his superior as communication — (they think more 
being considerate, but not to other variables. frequently about th 5 
Desire for interaction also shows appropriate 
relationships with responses to other percep- Discussion 


tions about the organization. The negative 
relationship between information overload 
and overall job satisfaction and the positive 
Correlation between communication satisfac- 
tion and considerate leadership are also 
understandable. Finally, the large number of 
Significant relationships among overall job 
_ Satisfaction and communication and noncom- 
munication indexes seem logical. In sum, the 
results presented in this study are in no way 
definitive, but provide a crude approxima- 
tion of convergent-discriminant validity and 
Merease confidence in the indexes. i 

Further examples of convergence and dis- 
“timination are obtainable by comparing per- 
_ “eptions about communication by people in 


based on eam 23 

ication. Wo 
1 1 Groups of individuals (egos 
workers and first-line supervisors) may also 
display unique communication behaviors. For 
even greater differentiation of communication 
problems, practitioners might find it miei 


ls y * 


beneficial to use individual items rather than 
communication indexes. 
The complexities of human behavior in or- 
ganizations, including communications, should 
be better understood when they are more 
rigorously related to other organizational 
Variables. For example, if there is a strong 
_ relationship between the degree to which or- 
ganizational members perceive that the infor- 
mation transmitted at work is accurate and 
their performance, it should be advantageous 
for organizations to devote more time to cor- 
recting problems of accuracy. This suggests 
the obvious need to relate communication 
‘measures to performance criteria. Deter- 
mining the relationships of communication 
variables to objective and behavioral criteria 


developing a standardized measure that 
describe various facets of communication 
a variety of organizations. The data 
reported in the present study are exploratory 
nd an initial attempt in this direction. The 
value of an easily completed descriptive mea- 

of organizational communication seems 
vious. If the case often made by organiza- 
theorists is correct—that “good com- 
nication” makes a difference—then an 
rstanding of what is “good communica- 
tion” and its correlates should increase our 
knowledge of organizational behavior. 


2 REFERENCES 
CAMPBELL, J. P., & Brarx, E. Organizational cli- 
mate: Its measurement and relationship to work 
_ group performance. Paper presented at the meeting 
_ of the American Psychological Association, Wash- 
___ ington, D.C., September 1971. 
_ Conex, A. Upward communication in experimentally 
4 created hierarchies, Sociological Review, 1958, 15, 
342-351. 
Davs, 55 Success of chain-of-command oral com- 
munjcation in a manufacturing management f 
D gioi of Management Journal, 1585 in ye 


Karcexe H. Roperts AND CHARLES A. OREULTx III 


Graves, D. Reported communicatio 
formal status in managerial work 
Relations, 1972, 25, 159-170. 

Kus, T. The construction of a : 
tude measure. Personnel Psyc: 
65-78. 

Lawter, E., Porter, L., & TENES 
ager’s attitudes toward interactior 
nal of Applied Psychology, 1968, 

Lawson, E. Change in communi 
formance, and morale. Human 
18, 139-147. 

Likert, R. The human organisat: 
McGraw-Hill, 1967. 

O'Renry, C., & Roserts, K. Infor 
in organizations: Three experim 
tional Behavior and Human Perfor: 

Porat, A., & Haas, J. Informat 
decision-making. Behavioral Scie 
98-104. 

Porter, L. W., & Roperts, K. Orga 
munication. In M. Dunnette (Ed.) 
industrial and organizational psych 
Rand-MeNally, in press. 

Porter, L. W., & Smoru, F. E. 
organizational commitment: A lon 
of the initial stages of emplo) 
reactions. Unpublished manuscript. 
California, Irvine, 1970. 

Reap, W. Upward communication in j: 
archies. Human Relations, 1962, 15 

Roserts, K., O’Remry, C., Bretron, & 
L. Organizational theory and orga 
munication: A communication failure? 
Porter & K. H. Roberts (Ed.), Com /unication in 
organizations. Middlesex, England: Penguin, 
press. (Human Relations, in press.) 

Rocers, E., & BHowmix, D. Homophily 
Relational concepts for communication rese; 
Public Opinion Quarterly, 1970, 34, 523-538. 

Rosen, S., & Trsser, A. On reluctance to communi- 
cate undesirable information: The MUM effect 
Sociometry, 1970, 33, 253-263. 

Stocpmt, R., & Coons, A. Leader behavior: Is 
description and measurement, Columbus: Ohio 
State University, Bureau of Business Research, 
1957. 1 

Tryon, R. C., & Bary, D. E. Cluster analysis. 
New Vork: McGraw-Hill, 1970. 

Witensky, H. Organizational 
York: Free Press, 1967. 


(Received April 30, 1973) 


-heterophilys 


intelligence. New 


Applied F ty 
e Vel. 59, No. Sh 


FAC 


BENSON ROSEN asp THOMAS H. JERDEE 
0 School of Business Administration, University of North Carolina 


luence of individual and organizational factors on disciplinary judg- 
is studied by means of a disciplinary exercise administered to 222 
duate business students, In four separate discipline cases, the severity 
nizational consequences for a rule infraction and various characteristics 
en the same rule violation resulted in 
compared to mild organizational consequences, greater personal respon- 
was attributed to the offender and harsher disciplinary measures were 
nded. Offender characteristics such as status, talent, performance, and 
disciplinary judgments. Results 
í attribution and in terms of practical 
ations regarding fair and consistent administration of discipline. 


offenders were manipulated. Wh 


sation equity also influenced 


in terms of current theories o 


Manageri: ments regarding the sever- 
ity of partic: isciplinary problems and the 
determinati ippropriate disciplinary ac- 
tions have i aut organizational implica- 
tions, An of vs willingness to change his 
behavior an iform to company policy in 
the future r>- depend to a large extent on 


that he has been treated 
_ the reactions of other em- 


his percepti 
fairly. More 


ployees to va ‘ous disciplinary actions may 
have long-ter!) consequences for organiza- 
tional morale and future employee behavior. 
Accordingly, systematic study of factors that 
influence managerial disciplinary judgments 
is needed. 

Research by Boise (1965) examined em- 


Ployee characteristics that have a bearing on 
disciplinary action. Boise surveyed Phila- 
delphia city employees and found that a 
Worker’s value to his department influenced 
his supervisor’s choice of penalty for rule 
infraction, According to Boise, supervisors 
pete hesitant to impose penalties on a sub- 
ordinate when his skills were in short supply- 

Kipnis and his associates (Goodstadt & 
Kipnis, 1970; Kipnis & Cosentino, 1969; 
Kipnis, Silverman, & Copeland, 1973) inves- 
tigated the relationship between the super 
visors emotional state and his choice of 
disciplinary actions. ‘These investigators 
showed that negative emotional states result- 


mg from i í i= 
mg from uncooperative or hostile subord 
2 
Requests for reprints should be sent to Benson 
Unis, Graduate School of Business Administration, 
niversity of North Carolina, Chapel Hill, North 
arolina 27814. 


S INFLUENCING DISCIPLINARY JUDGMENTS 


are dis- 


nate behaviors were associated with more 
coercive disciplinary measures. 1 
Although there are very ſew investiga 
specifically concerned with disciplinary judg- 
ments, it seems reasonable to assume that 
theory and research into attribution of caus- 
ality and responsibility (Jones & Davis, 1965; 
Kelley, 1967) is relevant to an unders 
of the judgmental processes of managers 
faced with disciplinary decisions. Attribution 2 
theory asserts that greater responsibility will 
be attributed to an actor, rather than to cir- 
cumstances beyond the actor’s control, in 
situations where the consequences of the ac- J 
tor’s behavior are more severe (Kelley, 1973). f 
Kelley’s (1971) discussion of the processes 
underlying moral evaluation is also relevant 
to understanding judgments of responsibil- i 
ity. Kelley argues that judgments of what 
works“ or what produces favorable outcomes — 
tends to influence judgments of what is mor- 8 
ally right. Studies by England (1967) ind 
cate that this pragmatic evaluative frame- 
work is especially characteristic of American 


managers. “ 

The purpose of the present research is to 
assess the influence of the organizational 
harm resulting from a rule violation and of 
various individual characteristics on judg- 
ments regarding the severity of disciplinary 
problems and the appropriateness of various 
disciplinary actions. Specifically, we predicted 
that subjects would attribute greater respon- 
sibility to an actor and recommend more sē- 
vere punishment when the unacceptable be- 
havior results in more serious organizational 
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consequences. Also investigated was the hy- 
pothesis that an employee's value to the 
organization (depicted in terms of talent, past 
performance, and status) and his psychologi- 
cal health influences judgments of the serious- 
ness of his offense and severity of punish- 
ment. Finally, it was hypothesized that 

underpaid employees are treated less harshly 

than are equitably paid employees. 


k METHOD 
Subjects 


The subjects were 222 undergraduate business 
students at the University of North Carolina. 


Procedure 


Experimental materials were presented in the form 
of a discipline exercise. Subjects were asked to 
4 assume the role of a personnel manager for a large 
manufacturer and to consider a series of incidents 
involving employees who had committed a variety 
4 of rule infractions. Each case was written in four 
or more versions to manipulate the independent 
Be variables. From the subjects’ perspective, however, 
| the manipulations were completely unobtrusive, 
since each subject received only one version of 
each incident and was unaware that other versions 
existed. Assignments of subjects to treatment condi- 
tions were completely random. 

The discipline incidents and the experimental 
variables of interest were as follows: 


I. Illegally using the company car: Extent of 
Damage X Status of Employee. Four versions of this 
incident were written to depict an employee (de- 
cribed as the vice president or janitor) who bor- 
rowed the company car for personal use under false 
_ pretenses. The car was subsequently involved in an 
accident described as very minor or as completely 
_ demolishing the car. 

2. Taking hard drugs: Department Performance 
-X Employees Talent. Four versions of this incident 
were written to depict a designer (described as mar- 
-ginally talented or exceptionally talented) who often 
worked under the influence of hard drugs and en- 
couraged fellow employees to use drugs. Depart- 
mental performance was described as either high 
or low. 

3. Extended lunch hour: Consequences for Man- 
ager X Employees Psychological Adjustment. This 
incident depicted an administrative assistant (de- 
scribed as well adjusted or suffering from family 
problems) who frequently extended his lunch hour. 
The consequences of the extended lunch resulted in 
failure to keep up with the work or in no noticeable 
© decrement in productivity. 

44. Expense account padding: Salesman’s Salary 
X Salesman’s Performance. Four versions of this 
incident were written to describe a salesman who 
was guilty of padding his expense account. The 
company’s sales commission rates were described as 
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among the highest or amo 
try; and the salesman’s p 
as either up 10% or dow 
previous year. 


Dependent Variables 


Subjects were asked to dk 
priate disciplinary action fo 
made decisions írom the 
(0) no action; (1) oral w 
record; (2) oral warning no 
record; (3) written repriman 
record; (4) one week sus; 
(5) discharge. In addition, 
fenses on three dimensions 
offender’s personal responsibi'i 
of unethical behavior. 

In summary, for each of 
incidents, each subject read on 
consequences of the rule infract\> 
and read only one version of 
stances—designed to increase « 
of responsibility to the offend 


Resutts AND Di 


Effects of Organizational £ 
Rule Injractions 


It was hypothesized that m 
ciplinary action is recommen: 
fractions that result in sev: 
mild organizational consequ 
shows mean severity of disc 
ations of the offender and the ove 
high- and low-consequence-sev cri ty 
in each of the four incidents. 

Results supporting the hypothesis v 
tained in each of the four incidents. 
cantly greater punishment was reco 
when illegal use of the company car 
in a major accident compared to a 
accident; when illegal use of drugs 
in lower departmental performance com 
to satisfactory departmental performal 
when an employee’s extended lunch 
caused work to fall behind compared l 
decrement in work; and when violatio 
expense account regulations were as 
with decreased sales compared to mer 
sales. In addition, significant differences 
also obtained in the predicted directio 
evaluations of the seriousness and ethi 
of the rule infractions. 

These findings provide strong ang 
sistent support for the influence of sev 
of consequences on disciplinary deci 


x 


Stronger discipline was advocated when viola- 
tions of organizational policy resulted in 
greater harm to the organization. These find- 


ings also support Kelley’s (1971) theory of 


moral evaluation and imply that judges lose 


sight of organizational standards or “ideals” 


in situations where policy violations have 


minimal consequences for the organization. 
These findings held for rather blatant viola- 
tions such as dishonesty and working under 


the influence of drugs as well as for a lesser 


infraction of extending the lunch hour. 


Greater responsibility for the offense was 
attributed to employees in the severe com- 


_ pared to the mild organizational consequences 
conditions in two of the incidents. These 
findings are consistent with current theories 
of attribution processes (Kelley, 1973) and 
recent research by Phares and Wilson (1972), 


Walster and Shaw and Sulzer 


(1964). 
Effects of Characteristics of the Offender 


It was hypothesized that an employee’s 
value to the organization, depicted in terms 
of talent, past performance, and status, influ- 
ences disciplinary judgments. Table 2 shows 
the mean severity of discipline and evalua- 
tions of the offender and the offense for the 
various employees described in the discipli- 
nary incidents. 


(1966), 
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TABLE 2 
Ervects or CHARACTERISTICS ov THE OFFENDER ON Disci 
AND EVALUATIONS OF OFFENSE AND OFFENDE» 
Incident 1 Incident 2 tS 
Dependent Excep- i ud 
variables Janitor Vice pres- Marginal | tionally a | 
( = 109)| ident F designer | talented | F „ 
( = 113) (n = 104) | designer pr 104) 
(n = 113) n p 

Severity of | i 

discipline 2.98 1.70 | 147.02°°| 2.54 2.05 |612| 1 10 m 
Seriousness of 

offense 3.02 1.80 92,00°* 2.73 2.40 |560 l f 
Unethicalness | 

of offense 2.92 2.13 34.06** 2.70 2.24 7.75* i 2 18 
Personal re- | | 

sponsibility 

of offender 3.64 3.22 14.22** 3.38 3.20 ns 2.41 ) | 16.57%" 

bd 05. Fy l 

2 855 


An employee's value »rganization | 


significantly influenced re dations for 
disciplinary action in two e incidents, | 
The janitor was disciplin e severely 
than was the vice president re identical 
tule violation, and the de of marginal 
talent was disciplined more ely than was 
the exceptionally talented dé r, for work- 
ing under the influence of dri In addition, 
evaluations of the seriousness and the ethical- 
ness of the behavior and the attributions 0 
the offender's responsibility were in the prë- 
dicted direction in both of these disciplinary 
incidents. Offenses committed by employees 


with high job status or creative talent were 
perceived as significantly less serious and sig- 
nificantly less unethical, and the offender was 
perceived as significantly less responsible for 
the rule infractions. It appears that 
status and exceptional talent were associated 
with greater leniency for rule infractions. 
These findings provide additional experimen- 
tal support for Boise’s (1965) survey. finding 
that supervisors are reluctant to discipline 
“highly valued” employees. saad 
A second offender characteristic exam i 
was the psychological adjustment of the ros 
ployee. In the present context it was hypo 15 
sized that an offender is treated with oily 
sympathy and indulgence because of 1 
problems partially beyond his control. No 


re obtained for disci- 
endations for the well- 
c assistant compared to 
tant with family prob- 
ile violation was per- 

less serious and less 
intly less personal re- 
muted to the assistant 


is considered the rela- 
mity and discipline. 
terms of severity of 
n to the poorly com- 
ympared to the well- 
n, and this difference 
lificance (F = 4.27; dj 
loreover, the dishonest 
I ed as significantly less 
=5 = 1/218, p < 0s) and 
thical (F = 9.71, dj = 
the underpaid salesman 
paid salesman. Equity 
nfluenced attributions of 
e offense, The equitably 
perceived to be more 
behavior than was the 
. (F = 32.80, df = 1/218, 


cresting to determine in 
h the pervasiveness of the 
ions, In the present sample, 
; can be viewed as restoring 
) search would be required 
mine whether rule infractions unre- 
10 restoring equity, such as deviations 
Ormal work procedures or violation of 
Standards, would also be downgraded 
der-paid employees. A related question 
$S the extent to which negative out- 
Other than low pay, such as dehuman- 
Working conditions, temper judgments 
ate disciplinary action for various 
ions. 
on to providing support for cur- 
es of attribution processes, the 
findings have some implications of 
al importance to managers. While it is 
“sonable to expect that greater disci- 
F action would be taken for very serious 
Mractions compared to small violations, 


ciplinary indulgence as a means of 
equity is also fraught with difficulties, 
it piles one form of inequity atop another. 
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EFFECTS OF INCENTIVE, FEEDBACK, AND 
PRESENTING THE FEEDBACK ON LEADER 
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U.S. Military Academy 


R OF 
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CABOT 
Florida Techr:o: tute 


The purpose of this study was to determine if the content of a lead 
behavior could be altered by manipulation of the type and manr 
back and incentive. Ninety-six groups of four people, with one 
group, served as subjects in a 2 X 2 X 2 design. During a case disc 
experimenter varied the three independent variables and recorded tl 
of the leader’s verbal behavior by means of the Bales category 
was found that: (a) positive feedback made the leaders more task 


whereas negative feedback increased negative social-emotional beh: 


the incentive decreased the number of questions asked; and (c) th 
presenting the feedback (publicly or privately) produced no changes l 
behavior when analyzed as a main effect, but significant interactic 


found. 


In trying to identify the causes of leader- 
ship-attempting behavior in small groups, 
many recent studies have focused on situa- 
tional rather than personal determinants of 
this type of behavior. Jaffee (1968), in devel- 
oping his empirically based theory concerning 
the determinants of leadership-attempting be- 
havior, as measured by verbal behavior, 
exemplifies this approach. He concluded that 
the more readily accessible method of modi- 
fying leadership-attempting behavior of indi- 
viduals within a group may be the manipula- 
tion of the characteristics of the situation 
rather than the long-term “personality” of the 
individual. Jaffee further noted that three of 
the prime causes involved in whether an indi- 
vidual attempts to lead and the amount of 
his leadership-attempting are: the availability 
of both incentive and feedback, and the man- 
ner in which the feedback is given. However, 
it was not known if these variables affected 
the content or kind of verbal behavior that 
an individual emits when attempting leader- 
ship behavior. It was the purpose of this 
study to assess the effects of the aforemen- 
tioned variables on the content or quality of 
a leader’s verbal behavior. Specifically, the 
variables of incentive, feedback, and the man- 
ner in which the feedback was given (either 
privately or publicly) were manipulated, and 


1 Requests for reprints should be sent to Richard 
P. Butler, Office of the Director of Institutional 
Research, U.S. Military Academy, West Point, New 
York 10996. 


the resulting kinds of vehavior were 


recorded. 


METHU 
Subjects 


teers who wert 


The subjects were 384 mai 
a ourse. In all, 9 


taking an introductory psy 
groups of 4 were observed. 


Experimental Setting and * paratus 


room separated 
se-way mirror. A 
e four persons 
table, facing the 
was placed a bot 
ven, The box had d 


The setting was an 8 X i? 
from an observation room by =< 
table large enough to accommo 
was placed in this room. 
subject designated as the le 
through which feedback was 
total of four lights, two of h were located on 
the inner side and visible only to the leader, d 
two of which were placed atop the box and in 5 
view of all the subjects. Eac! ir consisted of 3. 
red light and one green light. A microphone 15 
positioned in the middle of the table and was oa 
nected to an amplifier with earphones 80 that 1 
subjects’ discussion could be monitored in Si T 
joining room. The switches for operating the 700 
of the leader's feedback box were located in 
adjoining room. 


Experimental Design 


A 2 * 2 * 2 complete factorial design with on 
replications per cell was employed. Twelve k Š 
of four served in each of the eight rand 5 
signed treatment conditions. An explanation 

ctors follows. 5 
Re For one half of the 96 groups th 
leaders were told that they would receive 3. 
did an effective job in leading the ensuing 10 
No mention of the $5 incentive Was made 
leaders of the other 48 groups. 


ual 
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Feedback. | half of the groups the leaders 

received prir sitive feedback via the flashing 
» of lights, th ht green lights indicating that 

the leaders w g well and two red ones mean- 
ing that they »t doing well. For the remaining 
48 groups t! rs were given mainly negative 
feedback or d lights and two green lights. 
For each lea ieedback lights were presented 
in a random subject to the ratio constraints 
previously m i 

Manner o ting the feedback. Forty-cight 
kaders receiv: r feedback publicly, that is, the 
red and the lights on top of their feedback 
box were fli thus enabling the other group 
members t ce how well their leader was 
doing. For vaining 48 leaders the feedback 
was given p so that only the leader himself 
knew how s was doing. 

The Bales y system (Bales, 1950) was used 
to record th: ative or content verbal behavior 

T of the lead: 

Procedure 

The four + of each group were read gen- 
tral instructi: - the experimenter. In essence, the 
instructions that the purpose of the experi- 
Ment was to ve and study how a small group 
Roes about so two separate problems that often 
Occur in the lay working world. The first case 
Problem conc i a salesman who constantly vio- 
lated compaii les but nevertheless was high in 
Sales. The gr: task was to come to some kind 
of Solution to case in the allotted time of five 
Minutes. The è purpose of this first discussion 
Was to identity ihe duration of speech for each 


subject so that the individual who spoke the most 
could be appointed as the leader for the second 
fase discussion. if has been shown that duration 
OÍ speech by individuals in small groups is highly 
correlated with a group member being selecte 27 70 
the leader of the group (Simpkins & West, 1966). 
€ experimenter recorded duration of speech from 
ee the one-way mirror in the control room. 
At the end of the five minutes he reentered the 
perimental room, appointed the individual with 
e longest duration of speech as the leader, and 
tead instructions to the second case problem. This 
fase involved the distribution of funds to the various 
partments of a small town for the coming year. 
the task of the group was to find a solution. 
5 to the leader varied according to which 
of th ent he had been assigned. During the course 
1 ie: ee discussion the experimenter observed 
e Bal er’s behavior, recorded his verbal remarks on 
ace = category system, and delivered feedback to 
jects a, ty two and one-half minutes, When the sub- 
Sn ove a solution to the case or when the 
entered penne limit had elapsed, the experimenter 
0 as. e room and ended the discussion. $ 
the 855 ss the experimenter’s reliability in scoring 
eder tatesorp system, a second observer, who 
Perience in the use of category systems, 


0 


333 


simultaneously observed five groups and recorded 
with the experimenter, 


Resutts 


For each of the 12 Bales’s categories the 
frequency of behaviors considered within its 
domain was recorded for each of the 96 lead- 
ers. However, for purposes of data analyses, 
the 12 categories were not treated separately, 
but, following a suggestion by Bales (1950), 
were collapsed into four major areas: (a) 
Categories 1-3 (positive social-emotional 
acts), (6) 4-6 (attempted answers), (c) 7-9 
(questions), and (d) 10-12 (negative social- 
emotional acts). 

To assess scorer reliability, Pearson prod- 
uct-moment correlations were computed be- 
tween the experimenter’s scores and those of 
the other observer for five groups. All co- 
efficients for the four major areas were .90 or 
above. Thus, high reliabilities as reported by 
Bales (1950) were obtained in this study 
as well. 

By far, the largest average proportion 
(.70) of the total number of acts falling 
within each of the major areas for the 12 
leaders in each of the eight treatment combi- 
nations was in the attempted answer area. 
Positive social-emotional acts and the ques- 
tion area were in a virtual tie for second 
place with average proportions of .12 and .13, 
respectively. Negative social-emotional be- 
havior accounted for an average proportion 
of only .05 of the acts. Examination of the 
proportions, using Bartlett’s test for homo- 
geneity of variance, indicated that hetero- 
geneity of variance across treatments was 
present. To remove this difficulty so that sta- 
tistical assumptions could be met, the arcsin 
transformation was employed. Similar analy- 
ses on the transformed data showed that the 
transformation succeeded in obtaining homo- 
geneity of variance. 

To analyze the effects of incentive versus 
no incentive, positive versus negative feed- 
back, and public versus private presentation 
of this feedback on each of the four major 
Bales’s categories, four 2 X 2 X 2 analyses of 
variance were made. Table 1 presents the re- 
sults when the F values were significant. If 
an F value was not significant, it does not 
appear in Table 1. Thus, for Category A and 


8 
0 


* 
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TABLE 1 


ANALYSES OF VARIANCE OF PROPORTIONS ON EACH OF 
BALES’ Major CATEGORIES 


Behavior category df | MS F 
C. Questions 
Incentive 1 |.15 | 4.62* 
D. Negative social-emotional 
acts 
Feedback 1 | .47 | 16.79*** 
Incentive X Manner of 
Feedback 1238.04 
A-+D. Social- emotional acts 
Feedback 1 | 418 | 6.84** 
Incentive X Manner 
of Feedback Te ei oo Uy i 
BC. Tasks related units 
Feedback 1 | .26) 7.94** 


Note. Proportions were transformed by x’ = 2 arcsin 


W. Only significant F values are shown, All others were non- 
significant: 


Category B there were no significant main or 
interactive effects; and therefore their analy- 
ses do not appear in the table. For Cate- 
gory C, the only significant effect was for the 
incentive versus no incentive treatment (F 
= 4.62, df = 1/88, p < .05). Inspection of 
the means showed that the no incentive con- 
dition elicited significantly more questions 
than did the $5 incentive treatment. It can be 
conjectured then that without the pressure of 
a $5 incentive the leaders were more prone 
to ask for the opinions, suggestions, and 
information of other group members. 

For Category D, a highly significant differ- 
ence was obtained between the primarily neg- 
ative and primarily positive feedback con- 
ditions (F = 16.79, df= 1/88, p < .001). 
Examination of the means revealed that when 
the mostly negative feedback treatment was 
administered there was a strong increase in 
the number of behaviors falling within the 
negative social-emotional area. It appears 
then, that when negative feedback is com- 
mon, the leaders tend to become more dis- 
agreeable, tense, and antagonistic, since these 
are behaviors listed under Category D. It 
should be noted that virtually all the groups 
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needed the full 30 
solution to the prob 
of feedbacks prese! 
10. Another significa 
order interaction bei 
ditions and the man: 
feedback, that is, pri 
=8.04, df = 1/88, 7 
the means showed that 


to come to a 
the number 
1 leader was 
was the first- 
ucentive con- 
resenting the 
publicly (F 
Inspection of 
incentive was 


present with public prs n of the feed- 
back, the greatest amou gative social- 
emotional behavior was d. In contrast 
to this, the smallest a vas produced 


when no incentive was 
feedback. 

A second method of 
category system is to 
combinations produced & 
proportions of acts direc 
(B +C) and to the 
(A + D). Two addition 
ance were performed on 
dices. The results of 
A D combination also 
Two significant F ratios 
first was for the feedback 
df = 1/88, P < .01). 
back, a significant inc 
acts in the social-emotiona: 
The second significant © atio (F = 135, 
df = 1/88, p < 01) was the first-order inter 
action between incentive the manner 0 
administering the feedback. In this study; the 
least amount of social-emot nal behavior oe. 
curred when no incentive 
back were employed; the 
curred under conditions ot 
feedback. y 

The analysis of variance f oe 
ssi ton Cable 1) indicated a signi 
effect for the feedback conditions (f= 19 
df= 1/88, p< 01). The inference | 
looking at the means was that i 
ily positive feedback was given, the 
tended to be more task oriented. 


with private 


ng the Bales 
he treatment 
rences in the 
the task area 
notional area 
lyses of vari- 
summary in- 
alysis of the 
ir in Table 1. 
obtained, the 
ble (F = 6.84, 
negative feed- 
the number of 
ea Was elicited. 


Tas 
Ubu 


atest amount oc- 
no incentive-public 


or the B+C 


eaders 


Discussion 
s t0 
The purpose of the present study, “jich 
see if, by contriving the Eion 
the leader of a small group perform. 


3 jd be 
content of his verbal behavior cou 


ad private feet 


, 


when prime 
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altered. It w nd that it was possible to 
T modify some not all, types of verbal 
T behavior. TI that some types were sub- 
ject to modi n is important in that it 
adds a new asion to the modification of 


r in small groups, namely 
he sheer quantity of speech 


leadership b 
that not only 


be altered, been shown by several 
other resear see Jaffee, 1968), but also 
that the con leader verbal behavior can 
be modified | 

Specificall the three independent vari- 
ables that y anipulated to create differ- 
ent environ: it was found that the feed- 
back facto the greatest effect of any 
single fact sitive feedback made the 
leaders mor: oriented, whereas negative 
feedback in: d their social-emotional re- 
marks. App y, positive feedback had the 
effect of p ng leaders who were more 
willing to into the main issue at hand 
and more wi to give and ask for informa- 
tion, suggest! and opinions. Negative feed- 
back, on the «her hand, caused leaders to be 
more disagre de and to show tension and 
antagonism 2: . higher rate. Thus, the results 
of this study indicate that in a production- 
oriented organization, positive feedback is to 


be preferred io negative feedback, and nega- 
tive feedback might have very little to offer 
if no specific suggestions for changing one’s 
behavior are civen. In fact, Deutsch (1949), 
House (1966), and Schutz (1955) all have 
shown that conflict and antagonistic behavior 
Often lead to a decrease in production. 
Content verbal changes in the social-emo- 
tional area were also noticeable as a result of 
the first-order interaction between the incen- 
tive factor and the manner of presenting the 
feedback. However, the significant interaction 
Pertaining to A + D was probably caused by 
4 difference in the proportion of acts falling 
1 D, since no differences were 
=e in Category A. Therefore interpreta- 
8 limited to the interaction that Pro- 
ti Significant differences in Category D. 
Bon: important to note that neither factor 
Bios was powerful enough to produce sig- 
z 1555 changes. This means that to produce 
7 5 it might be necessary to administer 
ni of them simultaneously. It is also impor- 
to consider the combination that elicited 


the low and high levels of negative social- 
emotional behavior. Lack of incentive led to 
fewer remarks under conditions of private 
feedback than under public feedback, whereas 
incentive resulted in just the opposite effects. 
When analyzing the results that were ob- 
tained when the $5 incentive was paired with 
public or private feedback, the fact that the 
public feedback combination resulted in fewer 
negative remarks indicates that it produced 
the more socially desirable behavior. 

A third area in which changes in verbal 
content were produced occurred in Cate- 
gory C, questions. It was found that incentive 
was able to decrease the number of times that 
a leader asked for information, opinions, and 
suggestions from his subordinates, This find- 
ing is somewhat consistent with Reilly and 
Jaffee (1970), who found that under condi- 
tions of incentive, individuals were less will- 
ing to be “led” by others. Apparently, then, 
leaders prefer to take things into their own 
hands, that is, to become less democratic when 
there is a chance at financial remuneration. 

The two remaining Bales’s Categories, A 
and B, were not subject to modification by 
any of the independent variables. It may be 
that these categories are extremely difficult 
to modify under any laboratory conditions or 
that the independent variables were just 
irrelevant for them. 

In summary, it appears that the variables 
involved in Jaffee’s (1968) theory concerning 
the amount of leadership-attempting behavior 
have at least partial relevance to the modifi- 
cation of the content of a leader’s verbal be- 
havior. More specifically, the feedback vari- 
able seems to possess some generality to both 
the amount and content of a leader’s verbal 
behavior, the incentive factor somewhat less, 


and the manner of presenting the feedback, — 
none at all to the 
when combined with 


publicly or privately, 
content area, except 
incentive in certain circumstances. 
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‘ARTICIPATIVE MANAGEMENT IN THE 
COLLEGE CLASSROOM 


hypothesis. 


Several sti 
ences betwe 

structor-cent: 
and motivat 

Anderson, 

examination 
the two type: 
consistent di 
thinking” ( 
analysis) an 
dent-centere: 


have investigated the differ- 
tudent-centered versus in- 
teaching vis à vis learning 
for an extensive review, see 
In general, with objective 
as an indicator of learning, 
‘eaching do not indicate any 
ices. However, for “critical 
essay exams and problem 
r motivational criteria, stu- 
aching seems more effective 
(McKeachie, 1969). A systematic under- 
Standing of == and how student-centered 
methods contribute to the goal of education 
that goes beyond acquisition of knowledge 
could lead to useful changes in classroom ac- 
tivities, This study assumes that the theories 
developed to improve managerial and organi- 
ation effectiveness are relevant to this under- 
Standing of the classroom situation. 
k S in industry has found that giving 
in viduals greater influence in decision mak- 
x increases the effectiveness of their per- 
mance and their levels of satisfaction (€.8-, 
pS . 1963). It seems reasonable 
an if the decision-making process in the 
Wales iin be designed to increase the 
0 of learning and motivation of stu- 
a all activities and circumstances of the 
9295 om, as in any life area, are subject to 
a . in decision making. For 
=e school administration may expect 
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anus for reprints should be sent to Ralph H. 
nn, Graduate School of Business, University 
urgh, Pennsylvania 15213. 
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hypothesized that students encouraged to provide information and 
making efforts for classroom procedures of importance to them give 
.vorable ratings to various aspects of the instructor's teaching effec- 
and to their perceptions of what they learned than do students not 
this encouragement and opportunity. Two graduate classes designed 
ourse within the latitudes of administrative constraints. As controls, 
aduate classes were presented the identical course design the following 
r as dictated by the instructor. Results indicated reasonable support 


that the class will meet according to some 
time schedule, that formal grading will be 
performed by the instructor, and that there 
will be some distribution of grades. Even 
within such constraints, however, there is 
latitude in the ways the constraints are satis- 
fied. Much of the day-to-day actual class- 
room activity is not specified by the admin- 
istration. 

The present study hypothesized that when 
students are encouraged to participate in 
decision making regarding classroom pro- 
cedures, they give more favorable ratings to 
various aspects of the instructor's teaching 
effectiveness and to their perceptions of what 
they learned. 


METHOD 


Subjects 

Subjects were graduate business students in four 
classes (each of about 25 students) of a required 
course. The same instructor taught all four classes. 
Two daytime classes, taught in the winter semester, 
were assigned to the experimental treatment; two 
night classes in the spring semester were used as 
controls. Consequently, it was unlikely that students 
in these two treatments would interact and, there- 
fore, discover differences between the instructor's 


methods for each class. 


Procedure 

On the first day of class for the experimental 
groups, the instructor outlined the constraints on 
the course. Within broad limits, he stated that the 
students, if they so desired, could determine the 
basic design of the course. Basically, the instructor 
permitted the students a voice in classroom issues 
they felt important to them. 


In the two control classes, the instructor dictated 
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the course design that was identical to that devel- 
oped by the students in the experimental groups. 

One week before the end of the semester, cach 
class was administered a teacher and course evalua- 
tion form (University of Pittsburgh’s Student Opin- 
ion of Teaching Questionnaire) by a student repre- 
sentative. This form contains 22 items concerning 
the student’s perception of what he learned, as well 
as his reactions to various aspects of the instruction. 

For each item (e.g., “The instructor has given me 
new viewpoints”) a rating is made on a 5-point 
scale (1 = instructor ranks below most of the teach- 
ers you have known, 2=is only fair, 3= is compe- 
tent and compares well, 4=is well above average, 
5 is one of the most outstanding). 


RESULTS 


Of the 22 variables tested, the experimental 
subjects ( = 54) rated all items higher than 
did the controls (n = 45), except for contri- 
bution to learning via lectures. According to 
the sign test (Siegel, 1956), this overall re- 
sult is statistically significant well beyond the 
.001 level. In addition, for eight separate 
items the difference in favor of higher mean 


ratings for the experimental group were sta- 


tistically significant by a two-tailed ¢ test 
(p < .05). These items were the following: 
the instructor has increased my interest, pre- 
sents worthwhile material, encourages initia- 
tive, has stimulated thinking, shows respect 
for opinions, and is sensitive to student diffi- 
culties; the class discussions contributed to 
learning, and would recommend the course 
as an elective. 


Discussion 


The only variable which was not rated 
higher for the experimental condition con- 
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cerned the contribution 
by the lectures. Since bot 
control condition were tau; 
the experimental condition, 
the instructor’s lecturing ‘ 
improved with practice for 

A limitation in the de 
differences between the d: 
dents may account for th 
ever, in the graduate prog 
the University of Pittsbure 
in class composition and m 
seem significant. Whether th 
effect is informational (e 
have a better understandin: 
is not possible to establish 
study. Further, a limitatio: 
that only one instructor 
though this helped to con 
differences, the results may 
able to the teaching styles ©: 
The study does suggest th 
on participative managemen: 
may be fruitful. 
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(NSIC, EXTRINSIC, AND RISK PROPENSITY 
\CTORS IN THE WORK ATTITUDES OF 

| “HERS, EDUCATIONAL ADMINISTRATORS, 
AND BUSINESS MANAGERS* 


IN 


CECIL MISKEL °? 


University of Kansas 


I | versions of Borgatta, Ford, and Bohrnstedt’s Work Components Study 
í nnaire were used to compare the job orientations of college-educated 
ces in educational and business organizations. The sample was com- 
432 teachers, 118 educational administrators, and 192 business mana- 
\iultiple- and single-classification analysis of variance procedures were 
oò explore relationships and group differences across the attitudinal sub- 
The hypothesis that the attitudes of educators differ from the attitudes 


During sc! 
that schools 
basis” becon 
plication of 
administered 
model as aré 
seemingly pla: 
sity and intri: 
value on hygi: 
norms regard! 


board elections the opinion 

id be “put on a businesslike 

articularly evident. One im- 

is that schools should be 

n a competitive efficiency 

siness enterprises. This model 
a high value on risk propen- 

c factors, while placing a low 
e factors. Consequently, the 
risk taking in the decision- 
ad the level and type of work 
ially are fundamental varia- 
aing the usable range of ad- 
models for different organiza- 


ministrative 
tions. 


This potential importance of employee 
attitudes toward administering organizations 
and 2 lack of sufficient comparisons between 
organizational types invite further investiga- 
tion. Consequently, the purpose of this study 
A to compare the orientations of college- 

ucated teachers, educational administrators, 
5 business managers toward risk propen- 
sity, extrinsic factors, and intrinsic factors 
their jobs. l 
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iness managers toward intrinsic-extrinsic rewards and risk orientations 
upported. Finally, a continuum of differing group 


profiles was proposed. 


RATIONALE AND HYPOTHESIS 


The conceptual foundation of work atti- 
tudes for this study was developed from the 
works of Herzberg, Mausner, and Snyderman 
(1959) and Blum (1961). Herzberg found 
that positive events are dominated by ref- 
erences to intrinsic aspects of the job itself, 
while negative events are dominated by rei- 
erences to extrinsic aspects of the job situa- 
tion. Sergiovanni (1967) concluded that the 
results of replicating Herzberg’s study in the 
educational organization supported the find- 
ings in industry. 

However, Blum’s findings modify these 
conclusions, showing that the desire for se- 
curity can be a deciding factor in vocational 
choice. A person might select a job where 
elements of security and other hygiene fac- 
tors are high. Other persons, less concerned 
with security, choose jobs in which motivators 
or satisfiers are high. Williams’ (1965) find- 
ings that low-risk takers are more concerned 
with extrinsic work characteristics and that 
high-risk takers are more concerned with 
intrinsic characteristics support Blum’s con- 
clusion. ; 
Pavalko (1971) places the work orienta- 
tions of educators and business managers at 4 
opposite ends of an occupation-profession 
continuum with the norm in business occupa- 
tions favoring personal aggressiveness. 
Krause (1971) supported this position, noting 
that the ability to handle uncertainty 1s 4 
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basic career contingency for business mana- 
gers. Further support is provided by the 
findings that individuals scoring highest on 
the Work Components Study Questionnaire 
subscale measuring competitiveness desirabil- 
ity were perceived as contributing the most to 
the company (Ford, Borgatta, & Bohrnstedt, 
1969). Conversely, the norms in educational 
organizations appear to place a premium on 
a more reserved, less aggressive orientation. 
In a simulated problem-solving situation, 
Brown (1970) found that business managers 
have a higher risk propensity and achieve- 
ment motivation than do educational admin- 
istrators. Based on these positions, the hy- 
pothesis was derived that the attitudes of 
educators differ significantly from the atti- 
tudes of business managers toward Herz- 
berg’s motivation factors and Blum’s security 
orientations as measured by parallel forms of 
the Work Components Study Questionnaire. 


METHOD 


The instruments used to measure the attitudes 
were the Work Compenents Study and the Educa- 
tional Work Components Study questionnaires. 
Borgatta (1967) developed the Work Components 
Study to merge and operationalize Herzberg’s two- 
factor theory with Blum’s findings regarding the 
security orientation among workers. Essentially, the 
items ask the respondents to judge the desirability 
of jobs with varying amounts of intrinsic factors, 
extrinsic factors, and intrinsic combined with risk 
factors. 

Miskel and Heller (1973) modified the Borgatta 

instrument to suit the educational organization by 
replacing words relating to industrial work situations 
with words pertaining to an educational work 
situation. They used three statistical techniques to 
test the instrument’s adaptability to the educational 
situation. Principal-components, varimax orthogonal, 
and maxplane oblique R-factor analysis procedures 
were used to determine the factorial stability of 
the Educational Work Components Study, while 
Cronbach’s (1951) alpha coefficient was calculated 
to estimate the internal consistency of the items. 

The criteria for determining the number of fac- 

tors were the following: scree test, discontinuity of 
eigenvalues (Cattell, 1963), interpretability (Rum- 
mel, 1970), the findings of earlier research on the 
WCS (Borgatta et al., 1968), and Kaiser’s (1960) 
eigenvalue of one. In addition, five criteria were used 
in selecting items: (a) minimum factor loadings of 
40 after varimax rotation, (b) minimum pattern 
~ loadings of .40 after maxplane rotation, (c) mini- 
mum structure loadings as product-moment cor- 
relations of the items with the oblique factors 
(Harmon, 1967) of .40, (d) items loading on the 
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factors indicated by the orig velopers (Bor- 


gatta et al., 1968), and (e) | cross-loading 
on two or more factors. No ori preferences 
were made regarding the ice of these 
criteria. Consequently, when t teria were in 
conflict, a judgment was made > which made 
the most overall sense. 

The result was parallel meas »mposed of 56 
Likert-type items representine factors, whose 
forms are identical except for es to company 
or school. A description of the ctors based on 


the work of Ford et al. (4% id the highest 


orthogonally loaded item folloy 


and Develop- 
tains items to 
responsibility 
g was .63 for 
to further my 


1. Potential for Personal Cha 
ment (eight items). This fact, 
measure the desire for creativi 
in the job. The highest factor 
the item “I would have a cl 
formal education.” 

2. Competitiveness Dbi 
Success (seven items). These ite, 


d Reward of 
asure whether 


an individual seeks job situat in which the 
salary is determined by merit e competition 
is keen. The item “salary inc would be @ 
matter of how much effort in“ had the 
highest loading of .64. 1 

3. Tolerance for Work Pre e items). This 
factor contains items measu tudes toward 
situations in which the work | ight be exces- 
sive. The highest factor loadin 65 for the 
item “the work might build up res on me. 

4. Conservative Security (1! „ These items 


security with 
job routines. 


measure the individual’s desire 
well-defined promotion guidelines 
The item “the work would be 
hard to do” had the highest f: 

5. Willingness to Seek Rev 
certainty versus Avoidance 
items). This factor contains i 
individual’s willingness to do 
though it might be a temporary 
factor loading of .75 was found for 
is little permanency of positions.” f 

6. Surround Concern (11 items). These items 
measure the person’s concern with the hygien 
aspects of the job. The item “the ventilation 1 
modern” had the highest factor loading of -3 


The intercorrelations and Cronbach’s alpha ot 
cients as estimates of reliability for the facon i 
presented in Table 1. With the exceptions of Fa 
tors 3 and 5 correlating at .55 and Factors 1 0 7 
correlating at . 30, the six Educational Work 12 
ponents Study factors are relatively indepen 1 
The reliability estimates of the original seven fig 1 
66-item Work Components Study have a range billy 
68 to 8s (Borgatta et al., 1968). The h 
estimates of the six-factor, 56-item Educational ably 
Components Study Questionnaire compare 7 5 
with the earlier results, ranging from . 73 to 8 605 
reliability estimates of the 56-item Work aie 
ponents Study with the present sample was 
what lower with a range from .61 to .84 


Ept 

The Work ( vents Study and Educational 
Work Compon idy were administered as self- 
report forms. T pondents read: “How desirable 
would YOU co: each of the following items in 
a job for YO job where. . . . The items 
followed, each a five-category, Likert-type 
response vary rom “Completely undesirable, 


job” to “Extremely desirable, 
greatly.“ The response cate- 
scending values of one to five. 


would never t 
would favor 
gories were as 


Sampling Pr es 


tified random sample of 683 
trators was selected from three 
combined staffs totaling 4,200 


Educators. 
teachers and 
school district 


in a midwest: tropolitan region. The following 
three propor \ cere used in selecting subjects: 
(a) of the z staffs, 25% in the smallest 
district and | in the two larger districts, (b) 
50% of the trators, and (c) a separate sam- 
ple of 50 ele y male teachers. Using an origi- 
nal and two up mailings, analyzable returns 


ra 118 administrators and 432 
of 550 returns or 80.5% of the 


were receive’ 
teachers for 
selected sam; 
Business „ rs. A sample of 265 business 
managers wa iomly selected from seven indus- 
trial concerns be same metropolitan area, The 


representation un each type is as follows: 140 
from compu: md communication service com- 
panies, 60 fr: loihing, steel, and tool manufac- 


ad 65 from insurance and brok- 
g an original and one follow- 


S 


turing compa 
erage compan 


up mailing, 4% of the questionnaires were 
returned for 2 
Data Analysis Procedures 

Multiple-c ication analysis of variance was 


used to explore the overall relationships among the 
Variables and groups, while single-classification 
analysis of variance was used to establish those 
Variables that contributed to group discrimination. 
Where significant F ratios were found using the 
univariate procedure, the Tukey (a) and, in one 
Case, the multiple contrast post hoc procedures 
(Winer, 1971) were used to compare all pairs of 
group means. The omega-squared was calculated as 
outlined by Halderson and Glasnapp (1972) to 
estimate the strength of the variable relationships. 
ibe analyses were computed for four groups— 
15 ers, principals, central office administrators, and 
pe managers. Since the programs required equal 
48 ple sizes and the smallest group had 48 subjects, 
t subjects were randomly selected from each of 
e remaining groups. 


RESULTS 


ve initial step was using a multivariate 
analysis of variance procedure with the four 
ale groups as the independent variable 
the six Educational Work Components 
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TABLE 1 


INTERCORRELATIONS OF EpucaTionaL Work 
Components Stupy Factors AND ALPHA 
COEFFICIENTS FOR EDUCATIONAL Work 

Components STUDY AND PARALLEL 


Factor 1 | 2 | 3 | 4 | 5 | 6 
1. PPCD | .80 13 13 —AT 17 39 
69 
Pepes |» 2.78, 230 e e 
61 

3, TWP — 7 —,23  .85\ “=108 
78 j 

4. C8 = 81 —½ 20 

B4 
5. WSR E ree 
80 

6. SC SES 
32 


Note. Data for Educational Work Components Study are 
for 683 educators and for parallel Work Components S udy 
are for 192 business managers. Cronbach's alpha coefficients 

with the Educational Work Com- 


Study. PP Cha 
velopment; CDRS = Competiveness Desirability and Reward 


of Success; TWP = Tolerance for Work Pressure; CS = C-. 


servatiye Security; WSR 
spite of Uncertainty versus Avoidance of Uncertainty; sc 


= Surround Concern. 


Study-Work Components Study factors as 
the dependent variables. In the present con- 
text an approximate multivariate F statistic 
of 5.77 (df= 18/518, PS 001) was ob- q 
tained. Based on this significant F statistic, 
tests for simple main effects became neces- 3 
sary across each of the Educational Work 
Components Study-Work Components Study 
subscales. These further analyses are pre- 
sented and discussed in the following para- 
graphs. 1 
A summary of the data for the four groups 4 
across each of the six Educational Work $ 
Components Study-Work Components Study | ; 
factors comprises Table 2. It shows that the 
groups differed significantly (df = 3/188, 5 < È 
.05) across the following subscales: Competi- 
ability and Reward of Success, 


tiveness Desir „ 
Tolerance for Work Pressure, Conserv i 


Security, and Surround Concern. E 
Using the aforementioned post hoc pro- 

cedures to compare all pairs of group means, 

the following differences were obtained. First, 


s 


r 


mean for the business managers appeared to 
be much lower (see Table 2). Consequently, 
the multiple contrast post hoc procedure was 
used to explore these relationships further. 
The calculated contrast weights furnished 
evidence that the business group’s mean is 
significantly lower (p S. 01) than a combi- 
nation of the educator groups’ means on this 
subscale. Fourth, on the Surround Concern 
measure, the Tukey(a) procedure revealed 
that the business group has a significantly 
lower mean than has each of the three edu- 
cator groups. 

To assist the interpretation of these dif- 
ferences, the average item means and stan- 
dard deviations by subscale for each group 
also are presented in Table 2. From these 
descriptive data, three systematic trends in 
the relationships become evident. First, the 
three subscales composed of attitudes toward 
intrinsic rewards and risk—Competitiveness 
Desirability and Reward of Success, Toler- 
ance for Work Pressure, and Willingness to 
Seek Reward in spite of Uncertainty versus 
the Avoidance of Uncertainty—have means 
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TABLE 2 
818 SULTS $ Four í N THE 
NS, STANDARD DEVIATIONS, AND ANALYSIS OF VARIANCE RESULTS FOR THE fou 
8 —— Work Components Stupy-Work Components Stupy Fa th 
— — — f 
Teachers | Principals Administrators Managers | a 
Factor aS —— — 
* SD * | SD x SD x | SD | Bie 
a BP eel ere * í 00 
PPCD 4.2 5 4.0 4 4.2 A 1 ; | 
CDRS 33 7 3.4 3.9 1.3 4.0 A | ? 8 tk 5 
Twe eee | 4 | 96 1 
cS 28 6 2.9 6 2.9 6 2.6 | „5 | sith : 
WSR 2.3 5 24 5 2.5 1.0 2.6 | 6 2. L & 
sc 40 4 4.0 3 4.0 7 3.7 | A | 4.8 3 | d 
j i Pr EAT, j ; CDRS -C jetiveness De ind Reward a 
Susces; TWP 3 te Work — . WSR a Willingness to ard in spite df 
Uncertainty versus Avoidance of Uncertainty; SC = Surround Concern. 
on the Competitiveness Desirability and Re- that increase directly from t ers to . 
ward of Success subscale, central office ad- cipals to central office admin tors to busi- 
ministrators and business managers scored ness managers. Second, the t actors mea- 
significantly higher than principals and teach- suring attitudes toward extrin job a 
ers. Second, on the Tolerance for Work Pres- — Conservative Security ancl ee * 
sure factor, teachers scored significantly lower cern are approximately the me 15 i 
than the other three groups. Third, on the educator groups but marked wer ps 
Conservative Security subscale, the Tukey (a) business group. Finally, only on the su am 
procedure did not indicate any significant measuring attitudes toward sic 10 
differences among the groups even though the — Potential for Personal Ch. nge an 


velopment did all four group have approx 
mately equal mean scores. 
The omega-squared value: r 
sented in Table 2. These values generally 
support the F values and expiain 2% to 14% 
of the total attitudinal variance due to group 


differences. 


iso are prê- 


Discussion 


Based on these findings, the hypothesis that 
educators’ attitudes differ from business miu 
agers’ attitudes toward intrinsic-extrinsi¢ s 
outcomes and risk orientations was 2 15 y 
supported and now can be elaborated. 155 
educators and business managers are 25 4 
in their attitudes toward intrinsic work 1 
tors since no differences were found on J. 
Potential for Personal Challenge and w 
opment subscale. Apparently all of pase 
equally desire opportunities for DERE., 
responsibility. However, when a risk 1 al 
sion is added to the intrinsic orientation 
as in the Competitiveness Heir 
Reward of Success, Tolerance for 


in 
Pressure, and Willingness to Seek Reward 


„ factors—significant dif- 
nt trends emerge. The 
had the highest mean 
e subscales, while the 
vest although not signifi- 
ngness to Seek Reward. 
administrator groups 
teacher and manager 
atral office administrator 
her only on the Competi- 
and Reward of Success 
den attitudes toward ex- 
are compared in the Conserva- 
? Surround Concern factors, 
are similar, but the busi- 
is significantly different. 
g these results yields a concep- 
í differing group profiles. 
acers evidence attitudes with 
ity and less concern for 
\t the opposite pole, the 
w-risk propensity with a 
trinsic factors. The two 
trator groups, appearing 
‘middle of ‘he continuum, are similar 
attitudes to teachers in having a 
mcern for extrinsic factors and se- 
but when risk is attached to intrinsic 
ey resemble the business managers. 
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e K Turn, 
CLUSTER ANALYSES OF INTEREST INVEN‘ 


SCALES AS TESTS OF HOLLAND'S 
OCCUPATIONAL CLASSIFICATION ` 


DEAN H. NAFZIGER? axp SAMUEL T. HELMS 
Johns Hopkins University 


This study compared Holland's occupational categories with groups o 
pations that resulted from the application of McQuitty’s iterative 
columnar correlational analysis to the scales of the Strong Vocational 
Blank (SVIB), the Minnesota Vocational Interest Inventory (VII 
§ Kuder Occupational Interest Survey (OIS) for men and women. The 
8 indicated that clusters of occupations exist that are internally consistent 
y these usually agreed with the groups of occupations in Holland’s cla 


en 
* 


tion. The hierarchical structure of the clusters followed the hexagonal o 


5 
of Holland 's occupational categories suggested in other studies. In additi: 
usefulness of all three letters in Holland's occupational classificati: 


supported 


Considerable research has been conducted 
by Holland and his colleagues in testing 
and revising a classification of occupations 
(Holland, 1973; Holland, Viernstein, Kuo, 
_ Karweit, & Blum, 1972; Holland, Whitney, 
Cole, & Richards, 1969). The classification 
‘contains six main categories—Realistic (R), 
Investigative (I), Artistic (A), Social (S), 
Enterprising (E), and Conventional (C)— 
and 72 subcategories within the main cate- 
gories, such as Realistic-Investigative-Enter- 
prising (RIE), Realistic—Investigative-Social 
(RIS) and so forth. All occupational cate- 
gories were derived from a single set of co- 
ordinating definitions, six scale scores from 
the Vocational Preference Inventory, VPI, 
olland, 1970). 

A hexagonal ordering (see Figure 1) of the 
six types based on intercorrelations of the 
_ VPI scales was proposed by Holland et al. 
(4969). On the hexagon, the adjacent cate- 
gories are the most closely related types; the 
nonadjacent and nonopposite categories are 
the next most closely related types; and the 
posite categories on the diagonal are the 


This study was conducted under the Careers and 
Curricula Program at the Center for Social Organiza- 
tion of Schools, Johns Hopkins University, with 
funding support from the U.S. Office of Education. 
Requests for reprints should be sent to Dean H. 
Nafziger, Center for Social Organization of Schools, 
5 2 Hopkins University, Baltimore, Maryland 
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least related types. For 
tional types are most high!) 
istic or Enterprising types, 
to Investigative or Social 
related to Artistic types. 

This report investigated t 
the theoretical occupational 
tem developed by Holland. ‘I 
(a) to obtain occupational d=! 
independent of the informatii 
velop Holland’s classification, 
these occupational data usin 
technique, and (c) to compare ine 
the empirical grouping with Hol! 
retical grouping. The correspon 


CONVENTIONAL 
te) 


ENTERPRISING SOCIAL 
tE) (s) 


Ficure 1. A hexagonal model for defining the psych 
logical relationships among the Holland typ 


all 


rn 
The dita ux the analysis of the Strong 
Vocations! Inte: * (SVIB) were the inter- 


occupational scales of the 
pp. 456-488). Analysis of 


curmetetions for 
WTR (Campi 


ñe Minnesota \ | Interest Inventory (MVII) 
wm done wing < 21 intereorrelation matrix 
of the MVII oc |! scales found in the manual 
Kun & Cam Data used for the analy- 
th of the Kude: ional Interest Survey (018) 
were the inter: ns of the 23 core scales for 
ma and the |: ations of the 21 core scales 
ter women (K 1, pp. 56-59). 

Because this ı concerned placing objects 
. variables int: categories using multivariate 
information, a malytic teci was appro- 
piste (Borger s, 1971). Clustering of the 
wake was ac d by using iterative, inter- 
telamnar corr analysis (MeQuitty & Clark, 
148). The c at result from this analytical 
technique arc lly consistent types. In the 
technique, th ive division of the original 
istercorrelatio proceeds first by grouping 
many object latively few characteristics in 
Common and v grouping few objects with 
Many characte n common, Misclassifications 
curring at h vels are necessarily carried on 
te lower group : is useful to cross-validate the 
Gaiters with o iata, but this was not possible 
i the present 

To compare ting clusters and subclusters 
fe the grou; £ the Holland classification, 
Weighted sum: odes were obtained for each 

: ester and su ter. To accomplish this task, 

Holland code sted in his classification, were 
timed to th upations in the cluster or sub- 
Geter. In th sification, every occupation is 
Meigned a three-lciter code, which describes an 
Serupation’s chars-ieristics. Taken in order, the three 
ktters of the cod: indicate the primary, secondary, 
ted tertiary orientation of a job. For example, 


n 1 
Sdysician (ISA) is primarily an Investigative occu- 
pation, secondly Social, and thirdly Artistic. The 
333 derivation of the occupational codes has 
i summarized by Holland (1973). 

Alter assigning three-letter codes to the occupa- 
Sens in the clusters or subclusters, the number of 
a letter appeared in the first, second, or third 
of an occupational code was counted and 
à weight of 3, 2, or 1, respectively, to account 
e relative importance of a letter's position. 
— the weighted values for each letter and 
— ne all letters provided a group summary 
„ oct the cluster or subcluster. For large groups, 
x a in the first divisions of the occupations 
Cluster analyses, the group summary code 
5 the diversity of the individual occupational 
energy ever, it provided information about the 
At po Orientation of even heterogeneous clusters. 

Save = division points, the group summary 5 
the described the occupations within 
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wie ersteren 1 


“as 
Selestitic - Literary -Ar ristie 


* „ 


Biclegist ISR 
Chemist IAR 
ene JAR 
Architect ALR 


Poyehiotrist 184 
Physicice 184 
Dentist 18 


hich I and A interests (summary code IAS); 
the other contained natural scientists with 
predominantly I characteristics (summary 
code IA R/S). 

84 directed: Working with people. Occu- 
pations in this cluster were clearly oriented 
toward working with people. The demand for 
a high degree of interpersonal activity was re- 
flected in the strong S orientation of the 
codes, which mainly had SA, AS, or SI in the 
first two positions. The summary code for the 
cluster was SAI. The E category, which is 
an adjacent hexagonal category to S, occurred 
frequently in the codes of this cluster. In 
fact, two predominantly Enterprising occupa- 
_ tions—lawyer (EAS) and investment manager 
_ (ESC)—fell in this cluster. The second of 
these occupations was incongruent with the 
 SA-directed cluster. 

The SA-directed cluster broke into two 
_ subclusters, the basis for division apparently 
having been the level of personal involvement 
required. The first subcluster contained occu- 
pations that observe human behavior in a 


Ficure 2. Main Cluster I for Strong Vocational Interest Blank (SVIB) occupat 
abbreviated group summary codes and descriptive labels. 


sa 


Potitiec! 
Scientist 


detached manner, such as political scientist 
journalist, sociologist, and investment mam 
ager. The summary code for this subcluster 
was SAI. Occupations in the second sub- 
cluster—for example, social worker, priest 
lawyer, and music teacher demand di 
involvement with people. Its summary CON" 
A/S E/I, reflected a relatively more divers® 
composition. 


Main Cluster 2—Residual Occupations 


Main Cluster 2, shown in Figure 3, was 
more heterogeneous than the first large cluster ö 
It appeared to be a residual set of oc a 
that did not belong with the relative 
more homogeneous I/A S occupations 
Main Cluster 1. ie 

The two large clusters resulting from © 
subdivision of Main Cluster 2 were an © 75 
data-oriented cluster and a business of 
management-oriented cluster. The 1 
clusters appeared to be IR directed an 
second, SE directed. 0 pul 

IR directed: Data and object mm 


él 


Fone ©. .. a Cluster II for Strong. Vocational Interest Blank (SVIB) cccupetional cake with 
descriptive labels. 


m. The 18 occupations in this cluster were 
ted toward the manipulation of data and 


one Every occupational code but one began 
+ either R or I, and most of the codes had 
or IR in the first two positions. The sum- 
Mary code of the cluster was IRS. The joint 
1 I orientation of this cluster was con- 
a in view of the relatedness of the types- 
2 R and I types are relatively task ori- 
3 and nonsociable, preferring occupa- 
* situations with low-interpersonal in- 
That differences do exist between 
S 8 I categories was demonstrated 
“divided gamba cluster was further sub- 
Shih e resulting split left one cluster 
8 directed toward the I vertex of the 
. a a second cluster which was di- 
(RIA) oward the R vertex. Cartographer 
5 dn u (IRS) were the only 
upations that fell i 

a ell into an unexpected 


jects, which was reflected in their Holland opposi 


abbreviated group summary codes and 


It is noteworthy that a subcluster involving 
occupations occurred. R and S are y 


ent. The 


code for he 


managemen| 

cluster was SEI. More than any of the other 
clusters, the occupations in this group re- 
quired a high level of competence in inter- 


personal relations. 


Interpersonal involvement 
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was an important dimension in the cluster, 
as is typical in all E and S occupations. 

Subdivision of the SE-directed cluster re- 
sulted in two large subclusters. The first was 
a business, finance, and sales cluster, Combi- 
nations of E and C in the codes were typical 
in this subcluster, as expected from its busi- 
ness orientation. However, the S and I dimen- 
sions also appeared in these occupations as 
may be noted from the summary code EC 
S/I. Further division of this subcluster re- 
sulted in one group of occupations that ap- 
peared to be a pure set of ECS permutations 
(beginning with banker and ending with 
business education teacher, finance and sales 
occupations), and another group of occupa- 
tions that was difficult to interpret (beginning 
with Army officer and ending with pharma- 
cist). 

The second large subcluster of the SE- 
directed cluster was composed of occupations 
that involved the management of people in 
institutional settings. Consistent with Hol- 
land’s theory, the occupations in the sub- 
cluster were dominated by S and E codes, 
and the subcluster had a summary code of 
SEI. Three occupations with somewhat in- 
congruent codes appeared in the cluster: 
rehabilitation counselor (SIA), elementary 
teacher (SAI), and chiropractor (ISR). 

A further division of this second subcluster 
resulted in one group of occupations that 
involved sales and administration and an- 
other group that involved contact with people 
in educational and recreational settings. Most 
occupations in the sales and administration 
group had a first-letter code of E. With one 
exception, the occupations in the educational 
and recreational subcluster had a first-letter 
code of S. The Holland codes for these groups 
were consistent with the general orientation 
of the occupations in the clusters. 


MVII Clusters 


The MVII was developed to measure the 
interest patterns of men in nonprofessional 
occupations. When Holland occupational codes 
were assigned to each of 21 occupations in 
the inventory, 16 were classified as Realistic, 
two as Social, two as Enterprising, and one 
as Conventional. In Holland’s terminology, 
then, the MVII measures interests that are 
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characteristic of primar 
cupations. In general, 
occupations would be 
intragroup differences 
homogeneous group. 
The clusters resultin 
are summarized in Fig) 
division of the MVII 
yielded two large group: 
occupations) and Main 
pations). A people versi 
appeared to account fo 


ealistic type oc- 
ilysis of these 
vhich examines 
ests of a rather 


this analysis 
The first major 
ipational scales 
in Cluster 1 (9 
er 2 (12 occu- 
ect orientation 
grouping, For 


Main Cluster 1, the sum: ode was RSE; 
for Main Cluster 2, RIC 
Main Cluster 1—Orientec ard People 


e MVIE occu- 
zupations that 


The first large cluster 
pational scales consisted 


require interpersonal intei n, often in a 
salesperson-customer relat‘: ip. The sort of 
work done in these occupa is of the light 


duty, clean hands type. ne orientation 
toward people was reflecies in the Holland 
codes of the occupations. Al of the MI 
occupations that have codes beginning with 
8, E, or C were found in Main Cluster J. 

Main Cluster 1, having à summary code 
of RSE, branched into two subdivisions of 
three and six occupations each. The summary 
codes were SR I/C/E for the former, and 
RES for the latter. This grouping divided 
those occupations with largely a SR orienta- 
tion from those occupations with relatively 
less of S but with more E and C components. 
That a unique set of characteristics appeaft 
to accompany RS codes was again SEE e 
by the separation of warehouseman, 
only RS occupation, from the larger R 
subcluster. 


Main Cluster 2—Orientation Toward Objects 


The second large cluster was oriented 
toward object manipulation. In contrast Ta 
Main Cluster 1, this cluster was typified if 
occupations involving heavy work, manipU 
tion and modification of materials, and ee 
one’s hands dirty, Generally the occupatio 


151 1 Se; 
were blue-collar jobs requiring little 1 
interpersonal contact, and this was le A 
he codes 


by a paucity of an S dimension in ue 
Without exception the occupations in 


5 
| 


Test or HOLLAND’S OCCUPATIONAL CLASSIFICATION 


Ficure 4. M 


11 2 as the first letter of the 
and code. The summary code for the 
luster was RI“ 
At the first dis 


i ision of Main Cluster 2 a 
3 lean split occurred. One subdivision 
ae of five occupations that had identi- 
= à and second letter codes of RI fol- 
The 195 either S or E as the third letter. 

be er Lanne contained seven occu- 
7a ag either RC or RI as the first two 
E Mo appeared to be an RI-ori- 
Ei a and the latter an R C/ Tori- 
E e cluster further divided into 
Eo of two and one group of three 
tode. Th, s, each with the same RI summary 

fice F ae subdivision, the R C/I 
= dert vided into two groups of three and 
ae 9 each. The division occurred 
A 10 115 letter (a tie), with RC occupa- 
uping 05 together and RI occupations 
Expte mong themselves. There was one 

n to this: truck driver (RCE). 


K; 
uder DD OIS Clusters for Men 


Figu 
iim the : summarizes the clusters resulting 
analysis of the OIS core scales for 


3 Clusters I and II for Minnesota Vocational Interest Inventory (MVII) occupa- 
tic cales with abbreviated group summary codes and descriptive labels. 


men. The first major division yielded two 
groups: Main Cluster 1 (8 occupations) with 
an RIE summary code, and Main Cluster 2 
(15 occupations) with a summary code of 
ISA. Within Main Cluster 1 were occupations 
that typically deal with objects and data. 
Those within Main Cluster 2 may be de- 
scribed as quantitative and life science occu- 
pations combined with a subcluster of people 
and business-oriented occupations. This clus- 
tering in terms of object-data versus scien- 
tific-social-business was reflected in the Hol- 
land group summary codes of RIE and ISA 
for the two clusters, respectively. 


Main Cluster 1—Object, Data and Thing 
Oriented 


The occupational titles for the scales in 
this first primary cluster mirrored the orienta- 
tion (of this group) toward manipulation and 
modification of objects and data. Subdivision 
of Main Cluster 1 resulted in a group with 
five occupations and another with three 


occupations. In the subdivision with three 


occupations, the engineering occupations and 
engineering college major remained together 
and composed an object-data-oriented sub- 


N x 
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Ficure 5. Main Clusters I and II for Kuder DD Occupational Interest Surve: 


IS) core scales 


for men with abbreviated group summary codes and descriptive s. 


cluster with an IRE summary code. The 
occupations that remained together in the 
other subcluster had an RI summary code 
and, although not as homogeneous as the 
group of engineers, seemed congruent with 
one another. Four of the five occupations 
required activities, competencies and interests 
typical of Realistic people. The one exception 
was banker (ECI). 


Main Cluster 2—Science, Social and Business 
Oriented 


Scientific, social, and business occupations 
comprised this large group, and the ISA sum- 
mary code for the main cluster reflected these 
components. Those jobs which are primarily 
natural, physical, or biological scientists and 
generally quantitatively oriented remained 
together in an IA R/S subcluster; while those 
that are people and business oriented grouped 
together in an SET cluster. The summary 
codes described well the nature of the differ- 
ence between the occupations in the two sub- 
clusters. 


The occupations of the first subcluster al 


had codes that were permutations of Í, A,R 
and S. With the exception of architect, : 
began with I. (The L code was used A 
gross summary for the premedicine, iy 
macy, and dentistry major.) The nature 4 
the occupations and their Holland zue 
code of IA R/S reflected a good deal of a 
gruency. The subcluster further subdivi 
into two other groups, one seemed to Me 
more quantitively oriented and the other ™ 
life science oriented. oa 
The second subcluster, coded 1 
comprised of social and business 9 ua 
After further division, the four socials 
ented occupations remained together white 4 
three business occupations clustered toge si 
The former subcluster had a Holland 7 the 
mary code of S A/T, and the code 1 g 
latter was ECS. These codes were dul oups 
scriptive of the occupations within the 15 70 
Accountants, usually coded CES, 11 1 
tually all certified public accountants, 
the CIS code applied to them. 


A Te 
. Test oF HOLLAND’S OCCUPATIONAL CLASSIFICATION 


art 


FIGURE 6. 
for women 
derived fror 


male criterion groups.) 


Kuder DD OIS Clusters for Women 


a * analysis for the OIS core scales 
ape is shown in Figure 6. The two 
e e from the first major division 
841 AS cluster of 15 occupations and 
* i cluster of 6 occupations. The scales 
science 8 ed into a heterogeneous group of 
ne tea and business occupations and 
“femini Homogeneous group of traditional 
mne” occupations. 


Mai 
vat Cluster 1—Science, Social, and 
mess Oriented 


1 OIS core scales, the pattern of 
4 Very d cae Main Cluster 1 for women was 
Seat ar to Main Cluster 2 for men, and 
nea codes seemed consistent with 
1 5 rn. Differences that did occur in the 
patterns between men and women 


Business ond Sociol Service 


Social Case- 
worker 


sia 


Tain Clusters I and II for Kuder DD Occupational Interest Survey (OIS) core scales 
bbreviated group summary codes and descriptive labels. (Asterisk denotes scales 


were confounded with the differences in the 
job titles included in the two forms of the 
OIS making a direct comparison difficult. 

Division of Main Cluster 1 yielded a small 
cluster of business and social occupations, 
coded ESC, and a larger group of scientific 
and artistic occupations, coded IAR. The 
latter cluster divided predictably into one 
group of artistic jobs and two groups of 
scientific jobs. The Holland codes of the 
occupations within the subclusters were all 
closely related. One unexpected result was 
that lawyer (EAS) was grouped with chem- 
ist (IAR), mathematician (IRA), and physi- 
cian (ISA). When the cluster of business and 
social occupations was divided, social case- 
worker (SIA) separated from accountant 
(CES), business and marketing major (E ), 
and personnel manager (ESC). 


a aaa _ 


Dean H. Narzicer AND Samusi T. HELMS 


3 
4 
3 


352 


Main Cluster 2—" Feminine” Occupations 


Two points concerning the occupations 
within Main Cluster 2 are important: (a) 
None of these appear in the men’s profile, and 
all are traditional “feminine” occupations. 
(b) All of the scales were derived from 
female criterion groups (Kuder, 1971, p. 56). 
With the exception of art and art education 
major, all OIS women’s scales normed on 
female groups were grouped together in Main 
Cluster 2. The remainder of the scales were 
developed by comparing women’s responses 
to male criterion groups, and these were all 
grouped in Main Cluster 1. 

On this basis, Main Cluster 2 seemed to 
represent a homogeneous group of women’s 
occupational scales that was reflected in the 
SA I/E summary code. An analogous result 
occurred in Main Cluster 1 of the OIS core 
scales for men that contained primarily “mas- 
culine“ occupations. The remaining sub- 
clustering for the women’s scales seems 
_ somewhat unclear. In particular, two incon- 
_ sistencies occurred where secretary (CSA) 
clustered with two SAI occupations, and 
home economics education major (SAE) split 


| off by itself at a high level. In any case, the 
| six occupations in the cluster seemed to be 
_ highly homogeneous relative to the other main 


cluster. 
DISCUSSION 
The results of this study provided evi- 
dence for the validity of Holland’s occupa- 


cupations in the empirical clusters from the 
SVIB, MVII, and the OIS for both men and 
women possessed similar Holland codes. Sec- 


4 ond, the clusters suggested that a few broad 


groups of occupations exist that are inter- 
nally consistent. Third, the hierarchical 
structure of the clusters followed the hex- 
agonal ordering of the occupational categories. 

Interpretation of the results of the analyses 
must be done cautiously. Individually and 
collectively, the instruments that were ana- 
lyzed included relatively few occupations, so 
all interrelationships that occur in the world 
of work were not necessarily accounted for in 
these analyses. In addition, the clusters were 
largely instrument specific regarding the ques- 


tionnaire items and scorin Kis. Despite 
these precautions, a great { similarity 
was apparent among the r or the four 
instruments. 

The SVIB contains scali larger pum. 
ber of occupations than the instruments 
analyzed in the study. Othe the R cate- 
gory, each of Holland’s t adequately 
represented by the SVIB er levels of 
the cluster hierarchy for t! B, classifica. 
tions into groups occurred pect to the 
first letter of the occupati de. In large) 
clusters where two or r f Hollands 
types appeared the types usually hex 
agonally contiguous and, lated. Divi 


sions at lower levels on the er hie 
sometimes were predictable the st 
or third letters of the Fi code. Many 
clusters consisted of occu with Hol- 
land codes of the same ters but ia 
various permutations. 1 ilt suggests 
that all three letters in t land occup® 
tional code are important ribing 000i- 
pational characteristics. T M occupations 
in the clusters were not pe! homogeneosi 
may have been the result classifications 
in Holland’s scheme, defi es in the mer 
sürement instrument, or tho inability of the 
cluster-analytic method to juately 
differences in the data. 

The cluster analysis of the MVII provided 
support for the notion that within any ce 


category of Holland’s classification, occups 
tions may be further sudcategorized in $ 
meaningful way by using second and 
letters of the codes. The MVI scales at 
primarily Realistic occupations and, 
eral, represent that category of occupations 
When the MVII scales clustered, 
that involved contact with people stay% 
gether, and those that involved the man ved 
tions and modification of materials stäl 
together..The former group of occupa * 
required more social competencies an 
terests, while the latter required more a 
tigative competencies and interests. The S 
ond letters of Holland codes reflected `. 
difference in terms of the social and in 


te 


he 


gative components in the group sul 
codes. The results of the MVII a 
u 


offered the clearest support of the 


iy, 1971, 


wn 


\ Test or Horraxo’s Occurationat CLASSIFICATION 


cd letters of the Holland 


iers of the OIS core scales 
nen and women: each had 
iat appeared to be sex spe- 
un cluster was similar for 
x-related cluster for men 
Realistic occupations. For 
lated cluster was mostly 
occupations. In both cases 
consistent with popular 
stereotypes, and reflected 
oles that are found in our 


r that was similar for both 
occupations that were sci- 
ce, and business oriented. 
and subclustering were 
or the men and women and 
with results that were ex- 
of Holland’s codes. 
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Pe ers 
CHECKING GROUPS OF LETTERS * 


IAN E. GORDON ? axo MOIRA AMOS 
University of Exeter, England 


In Experiment 1, 25 subjects searched through paired groups of letters to 
check for identity; nonidentical groups (10%) were to be marked. The re- 

times increased with the number of letters per group (from 2 to 6) 
and with horizontal separation between the pairs of groups (1, 2, 4, and 8 
centimeters). The optimum number of letters for efficient error checking 
appeared to be 4 or $. In Experiment 2, 10 subjects searched through some 
of the Experiment 1 lists. Horizontal eye movements were recorded. Results 
indicated that: (a) the time spent looking at groups increased with the number 
ol letters per group, (b) during error checking the second member of a pair 
of groups was looked at longer than the first, and (c) commonly, with small 
groups, more than one group was processed at a time. 


afín aed da: Boe 


Strings of digits and/or letters appear in- 
creasingly as labels. For instance, a typical 
example of the new British postal codes is 
EX3 OEU. The present report forms part of 
an investigation into the effects of various 
spatial arrangements of such codes on effi- 
_ ciency of search. In the present study interest 
‘is confined to all-letter groups and the situa- 
tion investigated is error checking. Data on 
error-checking groups of digits have been 
published by Klemmer (1959) who found that 
the maximum rate of checking digits occurs 
when they are grouped into 3s or 4s. 

The two experiments reported were de- 
_ signed (a) to find the optimum size of 

groups for letter checking, (b) to investigate 
_ the effects of increasing the spatial separation 
4 between horizontal paired groups, and (c) to 
examine subjects’ scanning patterns by record- 
ing their eye movements during séarch. 


* 


$ EXPERIMENT 1 
Method 


Subjects. The 25 subjects were undergraduates at 
the University of Exeter. Their mean age was 19 
years 7 months, 

Material. Groups of random lower case letters 
typed in Remington elite were reproduced with good 
black/white contrast by Xerox. Twenty search lists 
were constructed, each consisting of 50 paired groups 
of random letters forming two columns 21.5 centi- 


This research was supported by Grant B/SR 
7092 from the Science Research Council. 

2 Requests for reprints should be sent to I. E. 
Gordon, Department of Psychology, The University, 
Exeter, England. 


meters high mounted symmetrically abor verti- 
cal mid-line of a 20 * 33 centimeter ° card. 
Thus each card contained two columns digits. 
Letters were typed single-spaced. Pair letter 
groups were identical except for five pairs n every 
list that differed by a single letter. Ti vertical 
position of groups containing these wr: letters 
was randomized and varied from list The 
position of the wrong letters within gro. ıs also 
randomized. 

The number of letters per group vari: m two 
to six. Groups were printed using fo ifferent 
horizontal separations: 1, 2, 4, and 8 imeters. 
These figures refer to the distance ccn the 
rightmost letter of the left group and leftmost 


letter of the adjacent right group. With 
used the horizontal extent of a six-lette: 
1.3 centimeters. 

Procedure. The subject worked through a 
lists (5 Groupings X 4 Spacings) arranges 
order. His task was to work down a 
paired groups. A check mark was placed 
pair that did not match. Times per list 
sions were recorded. A 20-second rest 
between successive lists, and a 2-minute rest Was 
given after the tenth list. The subjects were tested 
individually and sat with the search list placed on 
a horizontal surface in front of them. They started 
to search as soon as a list was uncovered, and they 
indicated when they had reached the end of the 
list. The instructions placed emphasis on speed and 
accuracy. Knowledge of results was not provided. 


Results and Discussion 


The results obtained in Experiment 1 are 
summarized in Figure 1 and discussed in 
detail below. 

Group size. The overall mean search times 
across all the spacings for the two- to si 
letter groups were: 34, 38, 44, 56, and 73 
seconds, respectively. However, the aim of 
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t | was to discover optimum group- 
the search times/list were di- 
‘e total number of letters/list to 
nore appropriate measure—letter- 
ate. 
shows changes in the processing 
ed in Experiment 1. It is clear that 
range of conditions studied, the 
rocessing occurs when letters are 
ito fours or (with eight-centimeter 
) fives. The influence of group size on 
statistically significant (F = 80, df = 
< .001). This is in good agreement 
emmer’s demonstration that groupings 
e and four are optimal for rapid 
g of digits. 
rs and digits. After the completion of 
vent 1, and at the suggestion of an 
ous referee, one of the checklists used 
constructed using random digits in 
f letters. Apart from the change to 
the new list was identical with the 
oup, four-centimeter list previously 
‘bed. Ten new undergraduate subjects, 
age 21 years, searched through this list 
vo occasions. The procedural details in 
small study were identical with those 
eviously described. The resulting overall 
Mean search time was 55 seconds, which is in 
_ Strikingly good agreement with the time for 
the comparable letter condition (56 seconds). 
~ Spacing between groups. Reference to Fig- 
ure i shows that, generally, the greater the 
" Spacing between groups the slower is the let- 
ter-processing rate. This is a relatively small 
effect, at least over the range of separations 
employed, as the difference between overall 
average processing rates at one-centimeter 
Separation and eight-centimeter separation is 
only 1.2 letters/second. The spacing effect is 
= Statistically significant (F = 35, df = 3/72, 
$ < .001) and is probably due To the extra 
_ time required to prepare and execute saccadic 


movements over greater distances. 


Errors. All recorded errors were errors of 
_ omission. The average number of omissions 
across conditions was 13.4%. Errors and 
Search times/list were related (p = +.76, 5 


E < 01), thus larger groups took longer to 


Search and were searched less accurately. 
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Ficure 1. Search rate as a function o 
and group spacing in Experiment 1. 
represents the average of 25 subjects. 


This experiment was designed after 2 
sis of data from Experiment 1. Given the 
limited extent of foveal vision, it is obvious 
that when the spatial separation between twoi =. 


eye movements are required in the compari- 
son of the groups. In Experiment 2 the wid- _ 
est separation (8 centimeters) used in EX. 
periment 1 was again used so as to require 
the subject to make large, easily monito 
saccades. a 

Experiment 2 was designed to find out (a) 
how the average duration of visual fixations 
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? ; Ficure 2. Eye movement responses to changes in 
_ group size in Experiment 2. Each point represents 
the average of 10 subjects. 


varies with the size of a letter group, (b) 
which member of a pair of groups is fixated 
longer, the first or second, (c) whether any 
first/second differences in fixation time vary 
with group size, and (d) to what extent 
increases in overall search time with larger 
group sizes are due to the subject fixating 
longer or more frequently, 
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Method 


Subjects. Ten undergraduates from the 
of Exeter served as subjects. Their me 
21 years 4 months. 

Material. Lists containing paired grou 
to six letters, all with eight-centimeter 
were used. They were identical with lis 
Experiment 1. 

Procedure. The experimental details 
similar to those described in Experime 
subject was tested individually on f 
presented in random order, The subject ` 
supported but not constrained by a chi 
card bearing the search material was me 
table in front of the subject. The card 
the subject and inclined at 45 degrees. T! 
from the centre of the card to the sub 
was 40 centimeters. 

Instead of marking each error item (as 
ment 1), the subject pressed a button tha 
mark on his eye movement record. He v 
20-second’ rest between successive lists. ‘I 
was specifically instructed not to turn 
reexamine earlier pairs in a list. 

During search, horizontal eye moven 
monitored by electrooculography. A silv 
was placed at the external canthus of ea 
movements were analyzed by direct 
of the output wave form from a pen r 
by a high-gain direct current amplifier. 


a 
t 
t 

0 


ler fed 


Results and Discussion 


The eye movement data from Experiment 
2 are unambiguous and are reporied fairly 
briefly. The main horizontal eye movement 
patterns observed during error checking are 
summarized in Figure 2. 

Preliminary trials with the investigators as 
subjects showed that, with a horizontal sepa- 
ration of eight centimeters between groups, 
there was never any difficulty in deciding 
from the eye-movement wave form which side 
of the display (left or right group) was being 
looked at. Electrooculography does not record 
very small microsaccades, and vertical eye 
movements were not recorded. Therefore the 
term “fixation” used below and in Figure 2 
means only that the gaze was unambiguously 
directed to one or other side of the display 
and that no detectable horizontal eye move- 
ment was occurring. 

Fixation time. As the number of letters / 
group was increased, the average time be- 
tween large saccades increased. This effect is 
Statistically significant (F = 7.7, df = 4/36, 
Y 01), and suggests that there is a se- 


que mponent in the processing of items 

T dur checking. 

` oj fixations/list. Increasing the 
nu letters/group produced an increase 
in er of visual fixations made during 
a iown a list. This effect is best de- 
scr terms of median numbers of fixa- 
tio list, and these are illustrated in 
Fi lhe result is statistically significant 


(F „ df =4/36, p< .01). It is clear 
5o hai when the subject checked groups 
‘ng four letters or less he did not sepa- 
sate every group. Indeed, with groups 
two letters the median number of 
made during comparisons of 100 
per list was only 61. 
versus right groups. From Figure 2 
be seen that in all conditions right 
ire fixated longer than are left groups. 
_ Thi “fect is statistically significant (F = 73, 
| dj 9, p < .05), and the overall difference 
i ‘in fixation times to left and right 
is approximately 60 milliseconds. 
error checking the first group (usually 
member of a pair) must be scanned 
emembered and then a stored represen- 
a is compared with the scanned input 
fro the second group. The process of com- 
ba n might happen in two ways: (a) Items 
_ fron: first and second groups might be com- 
pared sequentially and if this occurred 
during a fixation pause it would explain 


why right fixation times tend to be lo 
(b) The subject might compare items 
taneously—thus whilst scanning groups € 
letters appears to be a sequential proc 
comparison after scanning might be a p I 
process, This second conclusion is support 
by the fact that the upper pair of curves i 
Figure 2 are roughly parallel—if comparisoi 
time were a direct function of the number o 
items compared the curves would be expect 
to diverge as group size increased. 1 
The present results, together with those of 
Klemmer (1959), lead to two practical rec 
ommendations: (a) If digits or letters are ti 
be grouped to facilitate rapid error checkin 
groupings of three to five items result in 
optimum performance; use of group sizes out- 
side this range depresses performance. (b) 
Within the limits of a normal printed page, 
increasing the spacing between adjac 
groups slows performance; this effect is 
to remain a small one, provided that gro 


ever, placing groups within one centimeter of 
each other facilitates search and checking. 
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emphasis technique. 


Textbooks often employ typographical cue- 
ing (capital letters, italics, and colored type) 
"for parts of the lesson assumed to have 
“greater importance or difficulty for the stu- 
nt. Some research has indicated that such 
R cal cueing results in an improve- 
ment in retention of the emphasized material 
hen compared with material printed in 
standard homogeneous typeface (Hershberger 
Terry, 1965; Klare, Mabry, & Gustafson, 
1955). In spite of these findings, the wide- 
Spread use of typographical cueing has not 
been apparent in educational materials, either 
ecause of cost factors or the reluctance of 
authors and publishers of textbooks to desig- 
te some of the material as unimportant and 
deserving of only superficial reading, 
In the absence of typographical cueing, 
students are often faced with the task of 
_ deciding which portions of text material are 
important enough to be isolated for intensive 
_ Study or memorization. A recent survey by 
the authors revealed that of 200 randomly 
elected used textbooks for sale in a local col- 
lege bookstore, some 92% contained signifi- 
cant applications of “emphasizing techniques” 
that had apparently been performed by the 
previous readers on the text material. Among 
t the more popular of these techniques were the 


1This study was facilitated by a University of 
South Florida Research Council Release Time 
Grant to the first author. 
Requests for reprints should be sent to Robert 
IL. Fowler, University of South Florida, 830 First 
Street South, St. Petersburg, Florida 33701, 


EFFECTIVENESS OF HIGHLIGHTING FOR 
RETENTION OF TEXT MATERIAL * 


ROBERT L. FOWLER,’ axo ANNE S. BARKER 
University oj South Florida 


The effectiveness of “highlighting” selected text material for improving reten- 
tion was examined in 4 groups of 19 college students each. Although there 
were no differences among the groups in total score on a retention test one 
week after reading with different highlighting techniques or no highlighting, 
a detailed analysis implied that highlighting improves retention of selected 
text material and that active highlighting is superior to passive reading of 
highlighted material. Highlighting seemed most effective when the reader had 
maximum faith that the highlighter could discriminate between important 
material and trivia. Traditional underlining also was found effective as an 


uses of underlining, asterisks, pai 
checkmarks, and brackets, usually in 
or brightness level that contrasted v 


ieses, 
olor 


the 
text print. The most recent isolation ‘hod 
in widespread use in the textbooks eù ined 
is known as highlighting, or accentin tigh- 
lighting changes the background color > line 
of black-on-white print through the e of 
a special marking pen of a bright ela- 
tively unsaturated hue such as yellow. blue, 
or red. The highlighting procedur: nor- 
mally performed with each student seving as 
his own judge as to which and how much 


material should be isolated for study empha- 
sis. However, a number of cases have been 
observed in which a low-achieving student has 
attempted to improve his test performance 
by “studying” from a textbook previously 
highlighted by a high-achieving student. 
Two experiments were carried out to ex- 
amine the effectiveness of various highlighting 
techniques. The purpose of Experiment 1 was 
to determine (a) whether or not highlighting 
is effective as an aid to the retention of in- 
struction material, (6) whether or not active 
participation in the highlighting task en- 
hances the presumed beneficial effect, (c) the 
degree of agreement among students reading 
educational material for the first time regard- 
ing what portions of text should be empha- 
sized, and (d) the relationship between the 
proportion of material chosen for emphasis 
and retention. 7 
Experiment 2 compared the facilitating 
effects of two specific “emphasis” techniques 
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EFFECTIVENESS or HIGHLIGHTING For RETENTION 


as a ctention. Several previous studies 
hav: the effectiveness of underlining 
in r Cashen & Leicht, 1970; Idstein 
& J 1972), but highlighting has ap- 
pare been compared with underlining 
in re since the highlighter pen is manu- 
fact ecifically for use as an emphasizer 
and x approximately twice the price 
of a all point pen, this question is of 
son est, 
EXPERIMENT 1 
Method 

Sul 

1 jects were 78 undergraduate male and 
fem ‘udents enrolled in general psychology 
cou wticipation was voluntary, but students 
wer sed extra points toward the course grade 
bas ctention test performance. Since the text 
mat onsisted of course-related outside readings 


rele the students’ interests, intrinsic motiva- 
ssumed to be relatively high as well. 


Ma s 

2 ct readings were reprints from the Scien- 
tific rican (Heron, 1957) and Science (Milgram, 
197°, ling with the pathology of boredom and 
the — ‘al perils of city life. These proved to be 
inh interesting, yet basically unfamiliar to 
the .lents. The length of the articles together 


Wa: ut 8,000 words and was appropriate for a 
cour initial study period. 
The highlighter pen used was a relatively un- 
Satu i yellow (Major Accent #2500) which 
ed produced a colored line of approxi- 
-eighth-inch width superimposed on each 
cd line of text. This instrument was chosen 
because it is the most widely used highlighter in the 
immediate area and because the yellow (approxi- 
mately 570 nm) is near the point of maximum 
ote sensitivity for photopic vision (Forgus, 

6). 

The retention test consisted of 54, 4-alternative 
multiple-choice items, The items were CO) 
in order to produce the appropriate balance be- 
tween highlighted and nonhighlighted material for 
the group for which this was the relevant inde- 
Pendent variable. Eighteen test items were based 
on highlighted statements, and the remaining 36 on 
nonhighlighted statements. It was presumed that 
this test provides an opportunity for highlighting to 
Produce an effect on total score; that is, 33% of 
the items were drawn from only 20% of the 
material. Yet, ample opportunity existed for the 
student-highlighted groups (described below) to 
improve their test performance by selecting for 
highlighting part or all of the material from which 
the remaining 67% of the items was drawn. Based 
on the administrations reported in Experiments 1 


a S 


and 2, the test yielded a Kuder-Richardon 
ity of 78. 


Design 


Each subject was randomly assigned to one of 
groups. One group actively highlighted while 
ing (Group AHL). Each subject in this 
allowed to highlight as much of 
deemed appropriate, In a second 
ject was “yoked” to a member of Group 
the sense that he studied from the copy 
material previously highlighted by 
although these subjects were not 
done the highlighting. This was then, 
highlighting group (Group PHL), tud 
assigned to Group AHL failed to highlight any < 
the text material, he was permitted to i 


Q 


subject was excluded from the analysis on 
and another Group AHL subject was 
before yoking because of his bizarre and 
sive highlighting technique. 

A third group studied copies of the 


material was 1:4 and was based on a sui 
textbooks used by honor students locally, 

fourth group was a control group that studied the 
same material without any highlighting (Group 
NHL). Other than for the appropriate amount and 
type of highlighting, all four groups were r 
identically. 


Procedure 


All study, review, and testing sessions were co 
ducted in individual sound-insulated cubicles, 
though the assignment of subjects to n 
conditions was as random as possible, obviously, 
subjects in Group PHL were run after their yo 
mates. 4 
The subject was seated alone at a table in one of 
the cubicles. On the table was placed the approp: 
reading material with a blank cover sheet, 
experimenter then read standard instructions wh 
informed the subject that he would be given one 
hour to study the articles at this time and would 
return in exactly one week for a 10-minute review 
followed by a test. Groups PHL, EHL, and NHL 
were admonished not to mark on their copies of the 
articles, while Group AHL was shown how to use 
the highlighter pen for “emphasizing material you — 
may think is particularly important.” 5 

At the end of the one-hour study period, 
experimenter retrieved the reprints and asked the 
subject to refrain from discussing the material o 
procedure with anyone and to return in one w 
The title and reference location of the articles 
been removed from all copies to reduce the po: 
bility that any subject could obtain and read 
articles during the intervening week. One w 
later, the subject was seated in the same cubicle 
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before and given the same copy of the articles for 
review. No highlighting or other marking was per- 
d during this period. After 10 minutes of re- 
W, the subject was given the 54-item objective 
ination with instructions to answer all items 

in a maximum time of 50 minutes. 


k Results 


A completely randomized analysis of vari- 


ance failed to reveal reliable differences in 
tion test scores among any of the four 
experimental groups on this measure (F <1, 
dj = 3/72). However, a comparison of Groups 
and NHL on just those 18 items drawn 
from material prehighlighted for Group EHL 
showed the former (X = 13.3) to be signifi- 
cantly superior to the latter (X = 11.8, 7636) 
= 2.04, p < .05). Thus, Group EHL enjoyed 
an advantage on these selected items, but it 
3 8 sais k 
E not sufficient to significantly increase 
their total score over that of the control 
Group NHL. An examination of the varia- 
bility of the scores within groups called into 
question the power of the experimental de- 
sign employed to detect differences which 
might exist among treatment conditions. 
Therefore it was decided to perform a more 
detailed analysis on an item-by-item basis. 
Since the subjects in Group AHL differed 
widely with respect to the proportion of ma- 
terial they chose to highlight (range = 4.2% 
= 32.1%, X=16.2%) and consequently, 
varied in the amount of test item material 


they happened to emphasize for test review, it 


was necessary to examine the individual 


Ficure 1. Proportion of material highlighted in common by a given number 
of students. 


Group AHL booklets to determine the il 
effects of active versus passive highli ig. 
For example, some test items came m 


text material highlighted by almost all p 
AHL subjects, while other items were een 
from passages highlighted by no one; ‘he 
mean proportion of Group AHL sub ets 
highlighting the answer to a given tes 
was 41. The distribution of proport 
item material highlighted in common | 
given number of Group AHL subjects is given 
in Figure 1. 

This item-by-item analysis of each Group 
AHL booklet made it possible to compare the 
performance of Groups AHL and PHL sepa 
rately for highlighted and nonhighlighted 
item material. By employing a randomized- 
blocks factorial analysis of variance (Kirk, 
1968) with items as the blocking factor (con- 
sidering only those 52 items from material 
marked by at least one Group AHL subject), 
the probability of passing an item for both 
groups combined, given that the answer to it 
had been highlighted ( = 72), was signifi- 
cantly greater than if it had not been high- 
lighted (X = .62, F=17.17, df = 1/153, 5 
< 001), although the mean proportion of 
items passed did not differ significantly for 
the two groups (F = 2.33, df = 1/153, p > 
-10). Interestingly, when Tukey’s HSD test 
for pairwise comparisons (Kirk, 1968) was 
employed subsequently, an item was passed 
more often within the Group AHL when it 
had been highlighted (X =.76) than when 
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TABLE 1 
PROPORTION OF Irxus Passep ny Eacu Group unper Each Conprnox i 
— — . —— 
Item condition 8 
Group n . FAN 5 = i ex 775 2 ka RIA N p 
Highlighted | Items“ | Nothigh- one djacent | Items 
| s involved lighted involved i involved 
| | 
AHL 19) fea 417 62 600 — — 
PHL 19 | -69 417 61 609 — — 
EHL 19 | 74 342 ‘68 684 73 190 
NHL 19 | (66) (342)t -69 1,026 (.76)* (190% 
Note. AHL ctively highlighting; PHL = passive highlighting; EHL = experimenter highlighted; NHL = no highlighting, 
a No, items olved summed over all subjects in group. 
ems highlighted for Group EHL, 


it had not (& = .62, p< 01), but the same 
Was not true within Group PHL. Addition- 
ally, the proportion passing an item, given 
that it had been actively highlighted, was sig- 
nificantly greater than the proportion passing 
all items in Group NHL (& = .69, 2 = 2.59, 
_ P<.01), when Fisher’s test of significance 
of a difference between proportions was em- 
| ployed (Guilford, 1965). The same compari- 
_ Son involving Groups PHL and NHL showed 
the proportions passing to be not signifi- 
_ cantly different. The means for each group 
under each condition are given in Table 1. 
Inn an ancillary analysis, which was per- 
formed to test the effects of highlighting on 
 fetention of adjacent material (spread-of- 
emphasis effect), the 10 items taken from ma- 
terial adjacent to material prehighlighted for 
Group EHL were examined separately. It 
Was determined that for Group EHL, ad- 
jacent items were passed significantly more 
often than were items which were neither 
= adjacent nor highlighted ( = 3.44, df = 18, 
© ?<.001). However, these adjacent items 
Could have been, as a group, less difficult than 
the remaining items, since Group NHL also 
Passed them significantly more often (f = 
2.37, df= 18, p< .05). The question of 
__ Whether or not the superior retention of em- 
_ Phasized material occurs at the expense of 
the remaining text material cannot be ad- 
_ dressed directly from the results of this study. 
However, the proportion passing nonhigh- 
| lighted items was significantly less in both 
Soup AHL (z= 2.66, p < .01) and Group 
PHL (2 = 3.30, % .01) when each was 
compared to the total item performance in 


ere 


control Group NHL using Fisher's test of 
significance of a difference between propor- 
tions (Guilford, 1965). The rank order oi 
means in Table 1 also suggests that isolation 
may lead to better retention of the empha- 
sized portions as well as concomitantly poorer 
retention of the remaining portions. This is. 
sue is further complicated by the observed F 
superior retention of items from material 
adjacent to highlighted portions. 
The relationship between the proportion of 
material highlighted by members of Group 
AHL and their total score on the test was 
not statistically significant (r = —.287, dj = 
17, .20 > p > .10), nor did the degree of this 
relationship differ for Groups AHL and PHL. 
However, the correlation between the propor- 
tion of Group AHL subjects highlighting a i 
given item and the probability of their pass- A 
ing the item was reliably greater than zero 
(r= .295, dj 52, p< .05), although the 
same relationship for Group PHL was not 
(r = 239, df = 52, p> .05). As a further 4 
control for the possibility that the answers to 3 
the easier items tend to be highlighted, the 
relationship between the probability of a 
Group AHL subject highlighting the answer 
to an item and the proportion of Group NHL B 
subjects passing the same item was found to 
be insignificantly greater than zero (7 = .082, — 
df = 52, p > 90). : 


EXPERIMENT 2 
Method 
Subjects 


The subjects were 24 volunteers from a graduate 
course in principles of learning. 


TABLE 2 


COUNTERBALANCED ÜRDERS OF THE EXPERIMENTAL 

Conprrrons FOR THE Fier THREE 
Marut Ireus 

7 — = a 


Item number 


10 | 43 
highlight control 
control highlight 
underline highlight 
underline control 
control underline 

underline 


highlight 


Materials and Design 


__ The same journal articles and retention test used 
in Experiment 1 were again employed. However, 
the emphasis markings on the booklets were applied 
differently in order to employ a within-subjects 
desien comparing the effectiveness of red under- 
lining, yellow highlighting, and no-emphasis con- 
trol conditions. To accomplish this, the item analy- 
Sis of the reading retention test administered in Ex- 
_ periment 1 was examined to identify multiple-choice 
items that had positive discrimination indexes and 
moderate difficulty indexes (.35 to .85). From these 
‘selections it was possible to match items by threes 
according to similarity of difficulty level. A total of 
12 such triplets were available with each member of 
_ 4 triplet being assigned to one of the three experi- 
mental conditions for a particular subject in the 
following manner: (a) The text material comprising 
the answer to one of the items in a triplet was 
= emphasized by highlighting; (b) that answering 
another was underlined; and (c) the answer to the 
_ third was left unemphasized. Since there are six 
possible permutations of the three experimental con- 
ditions across the three items of a triplet, each 
Į utation was randomly assigned to four sub- 
$ ?’ booklets in order to completely counterbalance 
~ this condition. Table 2 shows the arrangement of 

the experimental treatments for each of the six 
_ groups (orders) of four subjects for the first triplet 
of similar-difficulty items. The same procedure was 
followed for each of the remaining 11 triplets. On 
each booklet, additional emphasis was applied until 
a total of 10% of the material was underlined and 
10% was highlighted. This was done to create a 
more realistic situation where not all emphasized 
Material contained answers to test items. 


Procedure 


During the regularly scheduled Tuesday night 
meeting of the course, students were asked to re- 
main after class if they wished to volunteer to 
participate in a “learning” experiment. Booklets 
containing an equal number of the six different 
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orders of technique application w: ibuted 
randomly to the 24 students who cd. A 
cover sheet on each booklet contain 


ctions 
concerning the procedure to be folloy e sub- 
jects were again allowed 60 minutes ıd the 
material originally and were told to s they 
would for an exam but not to make n on the 
booklet, 

They were also informed that, on t owing 
Tuesday night, they would be given u copy 
of the articles to review for 10 minut to a 
multiple-choice test based on the mate: In sub- 
jects studied the material and took tì n the 
same room at the same time. 

Results 

Although the retention test co d“ 54 
items, only the 36 items comp the 
matched triplets were examined as dent 
measures of performance. Thus, sub- 
ject’s score could vary from O to inder 
each experimental condition, Th data 
were analyzed as a randomized-bloc) valy- 
sis of variance with each subject bein lock 
(Kirk, 1968) and showed highly re! > dif- 
ferences in performance among the hasis 


techniques (F = 5.79, df = 2/46, 1). 
A posteriori comparisons between < 


of means were performed with Tuke\ 

test (Kirk, 1968) and revealed th: wean 
performance for underlining (8.61) to ve sig- 
nificantly greater ( < .05) than for de non- 


emphasized control condition (7.13). The per- 
formance under the highlighted condition 
(8.04) did not differ significantly from that 
under either of the other two conditions. The 
improvement in performance due to under- 
lining was approximately 21% which again 
corroborates the previous findings regarding 
the magnitude of this effect. 


Discussion 


The question of whether or not highlight- 
ing, as an emphasizing technique, increases 
the retention of text material must be an- 
swered with a qualified “yes.” In Experi- 
ment 1, the test items based on material 
highlighted were passed significantly more 
often than were the remaining items in every 
comparison performed. Yet the total test score 
means did not differ significantly among any 
of the four treatment groups. In the same 
study, the improvement in performance by 
highlighting over the appropriate nonhigh- 


EFFECTIVENESS or HichucutTme ron Rerention 


lich: condition ranged from about 
12° compared to about 18% in a 
simi employing underlining by 
Cas! cicht (1970); thus, allowing for 
the ; that increased retention of em- 
phas terial may adversely affect the 
reten the remaining material, a result 
cons vith many studies dealing with 
the storff effect in verbal learning 
revi Wallace (1965), a liberal esti- 
mate overall increase in test scores due 
to en ing would be on the order of 6%- 
7%, is considerably less than both the 
obser tandard deviation of total test 
Score the error of measurement due to 
test iability, This result could account 
for | ilure of some investigators to find 
posi! sults in total test score measures 
when test contains items from both em- 
phasi nd nonemphasized text (Idstein & 
Jenkir , 1972), 

Pe the most interesting finding was 
the sı performance of Group AHL rela- 
tive t oup PHL on those items based on 
mater. actively highlighted by the former 
and ively read by the latter. Since these 
grou; i not differ in performance on non- 
high! d items, the implication is that 
active ,<rticipation in the emphasizing proc- 
ess is necessary for optimum results. It may 


also Le ‘hat students are more likely to mark 
materio! that is easier to retain, or that con- 
tains information with which they agree. 
Moreover, the superior retention of empha- 
sized statements by Group AHL may simply 
reflect the fact that active highlighting of a 
text statement guarantees that it has been 
read, whereas passive highlighting does not. 
Thus, exposure rather than isolation may be 
the controlling factor. Tir 
The choice of material for active highlight- 
ing by the students did reflect a relatively 
high degree of discrimination between state- 
ments suitable for objective examination and 
those serving merely as connecting discourse, 
that is, the average of 16% total material 
marked contained over 40% of the retention- 
test-item answers. Nevertheless, the large 
within-group variance on this factor plus the 
fact that the greater portion of retention test 
Material was not highlighted may have con- 
tributed to the relatively poorer performance 


on the highlighted items in the passive co 
dition, 


maintain that the encoding (by underlining) 
of text material is effective if done for the 
reader but not if done by the reader since he 
must guess what to encode while being unin- 
formed about the contents of a subsequent 
(retention) test. This interpretation is con- 
sistent with the positive results obtained in 
experiments where material is preemphasized 
(Cashen & Leicht, 1970; Crouse & Idstein, 
1972) and with the weak or negative effects _ 
resulting when readers do their own . 
sizing (Idstein & Jenkins, 1972; Stordahl & 
Christensen, 1956). The results of Experi- 
ment 1 support this view in that the four 
groups did not differ in terms of total test 
score performance, whereas Group EHL did 
outperform control Group NHL on those 
items prehighlighted for the former. The de- 
gree of enhanced retention of isolated ma- d 
terial by passive readers may ultimately de- 
pend on the amount of faith they have in 
the judgment of the person who did the 2 
highlighting. Unlike the von Restorff effect 
obtained in serial lists where retention of all 
elements in the list is required, the theory of 
emphasis employed in the present study as- 
sumes the differential importance of the vari- 


ous text segments and that some portions X 


should be retained to a greater extent, even 
at the expense of the remaining portions, It 
follows that a student studying from a pre- 
highlighted text must believe that the more i 
important statements have been emphasized 
on his copy or he may ignore, rather than 
concentrate on, the highlighted portions. In 
fact, a bias could develop in the early stages 


of studying an assignment when the student a 


notices a discrepancy between his judgment of 
what is important and what he observes on the 
prehighlighted text. 


The data of Experiment 2 suggest that the 7 


more expensive and special-purpose high- 
lighter pen is no more effective as an empha- 
sizer than are more traditional techniques 
such as a ballpoint pen containing ink of a 
different color than the text material print, 


het i irl hi 


observed relative ineffectiveness of the 
highlig pen is consistent with the weak 
fiect found under the passive conditions in 
Experiment 1. Perhaps a different outcome 
would be obtained if the subjects were al- 
lowed to select their own emphasis technique 
and then actively apply it while reading the 


a Thus, the findings confirm that highlighting 
can produce some improved retention and 
that traditional underlining of portions of 
‘text material may be as good as use of the 
more expensive highlighter pens. The evi- 
dence suggests further that readers of in- 
structional material containing little or no 
typographical cueing may benefit from high- 
lighting done previously by others (especially 
if done by students seen as high achievers). 

However, students retain more if they do the 
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A TOR ANALYSIS OF AIR PASSENGER REACTIONS TO 
\CKING AND AIRPORT SECURITY MEASURES AS 
LATED TO PERSONAL CHARACTERISTICS AND 

ALTERNATIVES TO FLYING * 


MICHAEL R. COOPER? 
GTE Laboratories, Inc., Waltham, Massachusetts 


CHARLES E. BOLTWOOD axo ROBERT J. WHERRY, Sx 
Ohio State University 


\ 28-item questionnaire administered to 127 airline passengers at Columbus 
International Airport, Ohio, was factor analyzed and rotated to orthogonal 
hierarchical structure. Results indicated that passengers did respond differ- 
cntially to items concerning their reactions to hypothetical skyjacking situa- 
tions and airport security measures when items were couched in terms of 
willingness to fly as contrasted to probability of actually taking a given 
fight. Subgroups were differentiated by one or more of the following vari- 
ables: age, sex, enjoyment of flying, distance to destination, and the extent 
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nt studies (Boltwood, Cooper, 
\Vashburn, 1972; Cooper, Fein, 
& Boltwood, 1971) have shown 
that difievent passenger groups reacted dif- 
ferentia!iy to the elaborate security measures 
designed io combat skyjacking. The present 
Study attempts to investigate the extent to 
Which factors accounting for these differen- 
tial reactions could be explained by means 
ol personal variables or passenger preferences 
for various modes of transportation. 


— 


This study was conducted while the first author 
search associate in the Organizational Re- 

e, Department of Psychology, Ohio 
State University. The views expressed in this article 
are th 
Teflect the views of GTE Laboratories, Inc., or 
Onio State University. 

*The authors wish to thank Victoria Fein Bern- 
Stein for her invaluable assistance during the pre- 
liminary phases of the study, Thomas M. Ostrom 
for his comments on the construction of the ques- 
tionnaire, and the late Milton D. Hakel, S. Rains 
Wallace, and Michael T. Wood for their construc- 
tive criticisms of earlier drafts of this report. Also, 
die cooperation of the Federal Aviation Administra- 
tion and Daniel Ginty of Columbus International 
Airport is gratefully acknowledged. 

Requests for reprints should be sent to Michael 

Cooper, Behavioral Science Research, GTE Lab- 
Sratories, Inc., 40 Sylvan Road, Waltham, Massa- 
chusetts 02154. 
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of their dislike for buses. Other than buses, preferred modes of travel appeared 
relatively unrelated to favorableness or unfavorableness of Passenger reactions. 


METHOD 


A 28-item questionnaire was constructed to mea- 
sure the following variables: 


A. Willingness to fly after seeing or hearing about 
(1) metal detection devices, (2) armed se- 
curity guards, (3) frisking, (4) gencral se- 
curity level, (5) a recent skyjacking, (6) a 
passenger death due to skyjacking, (7) past 
threat of skyjacking. 

B. Probability of still taking a trip after secing 

or hearing about 
(8) metal detection devices, (9) armed se- 
curity guards, (10) frisking, (11) general 
security level, (12) a recent skyjacking, (13) 
a passenger death due to skyjacking, (14) 
past threat of skyjacking. 

C. Personal Characteristics 
(15) sex, (16) age, (17) enjoyment of fly- 
ing, (18) distance to destination. 

D. Preferred Mode of Travel for Short Trips 
(19) Bus, (20) Car, (21) Boat, (22) Train, 
(23) Plane. 

E. Preferred Mode of Travel for Long Trips 
(24) Bus, (25) Car, (26) Boat, (27) Train, 
(28) Plane, 


The questionnaires were randomly administered 
at the Columbus International Airport, Ohio, over 
a one-week period to 127 passengers within the 
general waiting area, Only two persons refused to 
be interviewed. The sample comprised a variety of 
flights, both short and long, for passengers of both 
sexes and a wide age span. 


365 


{ 
| 


TABLE 1 
Wauwoxrss, Paosasiusry, ANO PERSONAL 
Cuasactezmmec Factors 
Metal detectors Ed 
mes gasi ds = 
wenrhy H 
recent we 
peek N 
1 
of Fiyise 
detectors 2 
weards “J 
s 
meurlty J 
roest " 
S&ypackhing, wevere 1 
ban 
threat os 
Characteristics 
-35 
Js 
-03 


1 
8 


1 
gt 


Note. Variables with no loading greater than . II have been 
omitted. 


The variables were intercorrelated and factor 
analyzed (principal factor plus minimum residual 
to ckan up communalities). The cight factors 
extracted (highest residual equal to .09 and lowest 
eigenvalue equal to .30) were subjected to a vari- 
max rotation followed by a hierarchical rotation 
(Wherry, 1959). The hierarchical rotation resulted 
in introduction of three subgeneral factors. 
reader should bear in mind that the findings 
of this study apply only to airline passengers and 
not to the general public or even to travelers in 


RESULTS AND Discussion 


_ Willingness, Probability, and Personal 
Characteristic Factors 


Significant factor loadings on the question- 
naire items related to willingness to fly and 
probability of flying were restricted to the 
first and largest subgeneral factor with four 
of the primary factors clustered under it. The 
_ loadings for these factors appear in Table 1. 

Subgeneral Factor A—General Favorable 
Reaction to Situations, This factor is com- 
posed primarily of older males who disliked 
buses for short trips and who had little reac- 
tion of any kind to the past threat of sky- 
_ jacking. They were either so favorably im- 
pressed with the various security measures 
designed for their protection or so little 
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frightened by reported sky: (either 
recent or severe) that they ported a 
high willingness to fly and a h bability 
of taking their flights. The su s under 


this Subgeneral A originally wed this 
same pattern to some degree; er, that 
overlap has been removed by ! urchical 
rotation. 


Factor 1—Relatively High | seness 10 
Fly. This subgroup of mal veling to 


nearby destinations, who had ted more 
negatively to the past threat jacking 
were apparently more willing y when 
metal detectors, armed guards ng, and 
a general high-security level we: idence. 
Their nearby destinations may nvolved 


smaller planes less apt to be sh i toa 


foreign country, 

Factor 2—Relatively High ! ility of 
Taking a Flight, This subgroui persons 
about to embark on relatively flights, 
who had responded neither n ly nor 
positively to the past threat í yjacking 
were responding more highly wit pect to 
actual probability of taking thei ts than 
in terms of willingness to fly. proba- 
bility of flying was increased by signs of 
protection, with the exception O -actal de, 
tectors, and especially by the gener! level of 


security. They had not responde) much to 
the past threat of skyjacking and were not 
affected by hypothetical reports oi recent OF 
severe skyjackings. i 

Factor 3—Relatively Favorable Reaction 
Despite Skyjacking. This subgroup of men 
embarking on relatively short trips indicated 
relatively high willingness and probability of 
flying with respect to both the past threat of 
skyjacking and the two hypothetical sky- 
jackings presented. Their willingness to fly 
was increased more by seeing passengers 
frisked than by other signs of security pre- 
cautions. 

Factor 4—Relatively Favorable Response 
Despite Past Skyjackings. This subgroup of 
relatively older passengers who did not like 
to fly but nevertheless preferred planes do 
buses for short trips responded particularly 
favorably despite past skyjackings which 
taken place. Their probability of flying Was 
increased when metal detection devices wert 
used. 


Se ee a a © 


‘JACKING, Scr Reactions, AND Presowar Vana EN 


Thus tors indicate older men are TABLE 2 
less ap! ntimidated by either past or Cuoicns or Traver Moon rou Saosr axe 
presen: betical) skyjackings or by Lowe Tuir Facross 
signs oí cd security activity. The lour ===; i 
subfact ate that age, sex, enjoyment 
of fyir nce to destination, and dislike 
for b v cause certain subgroups to 
respon: otially to the various threats 
or pre with respect to their willing- 
ness 01 mation to actually fly. 

Mode isportation Factors 

The ing two subgeneral factors, each 
associa ‘ith two primary factors, are 
mostly wd to varying preferences for 
modes avel as related to distance of 
destin One of the subgenerals is mini- 
mally | to willingness to fly in view of 
past ere skyjackings with one of its 
prima clated to flying enjoyment. Load- 
ings a en in Table 2. 

Su? Factor B—Preference jor 
Plane + Boats or Cars for Long Trips. 

This of passengers, who still showed 


slight cher willingness to fly in the face of 
Past o -evere skyjacking reports, preferred 
Plane. cr either boats or cars for long trips 
(over g miles). The two subordinate fac- 
tors rely show increased antagonism 
toward cither boats or cars but not toward 
both. 

Factor S—Preference for Planes over Cars. 
This sut xroup indicates further preference for 
Planes over cars. Their long trips may have 
been intracontinental where boats would not 
have been a feasible alternative. 

Factor reference for Planes over Boats. 
This subgroup possibly represents persons 
who are planning trips across oceans to for- 
eign countries where boats would have been 
the only alternative. 

Sudbgeneral Factor C—Boats over Trains 
for Long Trips; Planes or Cars over Trains 
for Short Trips. This subgroup is primarily 
marked by a dislike for trains, especially 
for long trips, The two group factors repre- 
Sent smaller subgroups of individuals. 

Factor 7—Planes over Cars for Short Trips. 
This subgroup, when boats are not available, 
expresses a strong preference for planes over 
cars for short trips only. Trains may not have 

available for the short trips in question. 


— i ~ | tH 


when frisking, use of metal detectors, 


. especially when armed guards were in 
evidence. (c) Persons who were little affected, 


_ embarking on relatively short trips showed 
more willingness and were actually more 
likely to fly despite past actual skyjackings 
or hypothetical skyjackings. They seemed to 
be particularly assured by the evidence of 
= frisking. (e) Older passengers who did not 
like to fly but were strongly opposed to trav- 
-eling by bus seemed to be least disturbed by 
past skyjackings and most reassured by the 
use of metal detectors. (f) Planes were pre- 
ſerred over cars and/or boats for long trips 
by persons whose willingness to fly was not 
_ shaken by previous, even severe skyjacking 

incidents. (g) Some members of the sample 
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preferred boats over trains 
(A) Certain members of the sa: 
planes over cars for short trips 
members, who were especially í 
and preferred planes over trains for Tong 
trips, nevertheless preferred cars over trai 
for short trips. 
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SHORT NOTES 


CATION OF WHITE-COLLAR-BLUE-COLLAR DIFFERENCES 
` SOURCES OF SATISFACTION AND DISSATISFACTION 


THOMAS C. HARRIS axo EDWIN A, LOCKE?* 
University of Maryland 


nall samples of white-collar and blue-collar employees were asked to describe 
urces of satisfaction and dissatisfaction on the job. These critical incidents 

re Classified using the Schneider-Locke event-agent classification system. 
t was found, consistent with previous findings, that white-collar employees 
vere more likely to derive satisfaction and dissatisfaction from “motivator” 
vents (especially achievement/failure), and blue-collar employees from “hy- 


gienes” (especially money). 


Prey studies have indicated that white- 
collar yees more often get pleasure from 
intrins motivator factors and blue-collar 
emplo om extrinsic or hygiene factors 
(Herz \fausner, Peterson, & Capwell, 1957; 
Locke . The present study attempted to 
replic ese results using the Herzberg critical 
incide ethod and the Schneider & Locke 
(1971 t- agent classification system. 

METHOD 

The subjects were 31 white-collar and 14 blue- 
collar employees from three organizations. Four- 
teen white-collar subjects were professionals, 
while the remainder were clerical. Fourteen 
white-collar employees were male. All of the 


blue-collar subjects were male; 13 were book- 
binders, while 1 was a carpenter, The mean 
tenure of both groups was 4-5 years. The mean 
age of the blue-collar group was higher (43 years 
Versus 30 years), Internal analyses indicated, 
however, that neither the age nor the sex differ- 
ences between the samples could account for the 
differences reported below. 

Each subject was asked anonymously to de- 
Scribe one satisfying and one dissatisfying inci- 
dent and to classify the main event and agent 
that characterized each incident. The Schneider 
and Locke event categories were: task activity, 
amount of work, smoothness of work, achieve- 
ment/failure, promotion, responsibility, recogni- 
tion, money, interpersonal relations, and working 
ae 


* The authors wish to thank William Jeffries and 
William Roach who aided in the collection of the 
data for this study. 

Requests for reprints should be sent to Edwin A. 
Locke, College of Business and Public Administra- 
tion, University of Maryland, College Park, Mary- 
land 20742, 


conditions (for complete descriptions, see Locke, 
1973). The first seven categories were classified 
as “motivators” and the last three as “hygiencs” 


(Herzberg, Mausner, & Snyderman, 1959). Since 


Grigaliunas and Herzberg (1971) indicated that 
subjects cannot always be trusted to categorize 
accurately, the incidents were recoded indepen- 
dently by the two authors into event categories. 
(Agent recategorization was not done, due to 
lack of sufficient detail in the stories.) There was 
80% agreement on the individual categories, but 
92% agreement as to whether the incident was a 
motivator or a hygiene event. Disagreements 
were resolved by discussion. The investigators’ 
coding agreed with the subjects’ coding only 
48% of the time based on individual categories, 
but 87% of the time based on the motivator/ 
hygiene classification. 


RESULTS 


The 2 X 2 frequency analyses of motivator/ 
hygiene by satisfaction/dissatisfaction for both 
white-collar and blue-collar employees and for 
both subjects’ classifications and investigators’ 
classifications yielded no significant chi-squares. 
This indicates that, for both employee groups 
and both methods of classification, the same 
events were seen as leading to both satisfaction 
and dissatisfaction. Thus for the remaining event 
analyses, the satisfying and dissatisfying incidents 
were combined. 

Table 1 shows the frequency of motivator and 
hygiene events for the white-collar and blue- 
collar samples. Both the results based on the 
subjects’ own classifications and those based on 
the investigators’ classifications are shown. De- 
spite the subject-experimenter disagreements 
concerning the coding of individual incidents, it 
is evident that the overall frequencies are similar 
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TABLE 1 


FACTION AND DISSATISFACTION ror WHrTE-COLLAR 
AND BLUE-COLLAR EMPLOYEES 


| 
I 


Employee 
Causes of satisfaction 
and dissatisfaction 
= White-collar | Blue-collar 
Motivator event 52 15 
52t 17% 
Hygiene event 10% 13 
10% 11” 
x 9.31 * 
S. 78 . 
s Authors’ coding. 
> Subjects’ coding. 
*p <05. 
„ <.01. 


for both methods of coding. The chi-squares in 
both cases were significant at p S. Os or better, 
indicating that white-collar employees are more 
likely to report getting pleasure and displeasure 
from motivator events and less likely to get these 
experiences from hygiene events than are blue- 
collar employees. 

This overall difference was caused almost ex- 
clusively by the more frequent mention of 
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_ achievement/failure events among white-collar 
i employees and the more frequent mention of 
i monetary events among the blue-collar workers. 
These data are shown in Table 2. 
3 With respect to the data for agents, white- 
4 TABLE 2 
S FREQUENCY OF MENTION OF ACHIEVEMENT/FAILURE 
s AND MONEY As SOURCES OF SATISFACTION AND 
3 DISSATISFACTION 
2 
4 E 
3 Causes of satisfaction mPa 
and dissatisfaction ` 
White-collar | Blue-collar 
Achievement/failure 268 qs 
12 1% 
Money 2a 7 
3b 5b 
x 21.43% 
7.29% 
dee coding, 
i. 
= 8 


Morwatons versus Hycwxes as Sources or SATIS- 


Snort Notes 


collar employees had a signific dency to 
take credit for satisfying event to blame 
others for dissatisfying even = 10.15, 


~<.01). This result did not ate with 
the blue-collar employees. There o overall 
difference (combining satisfying an atisfying 
incidents) between the blue-col d white- 
collar samples with respect to att: ig events 
to the self versus to others. 
DISCUSSION 

These data were congruent wit 1s find- 
ings regarding white-collar-blue-co erences. 
The failure to replicate Herzber o-factor 
theory when the event-agent cla: ion sys- 
tem is used also agreed with pri findings 
(Locke, 1973; Schneider & Locke, Finally, 
in one respect the results replicate findings 
of Grigaliunas and Herzberg (197 iat sub- 
jects and trained coders may dis. substan- 
tially in their coding of incidents. he other 
hand, the disagreements in the 7 it study 
were predominantly within the moti: r or hy- 
giene categories rather than = ese cate- 
gories. The result (Table 1) was ~-hstantial 
overall agreement when the individ categories 
were grouped according to the moti, hygiene 
dichotomy. In contrast, there was substantial 
disagreement across motivator/hygien: categories 
in the Grigaliunas and Herzberg study. This may 
have been due, in part, to the logics! inconsist- 
ency in the Herzberg classification system which 


confuses agents with events (see Schneider & 
Locke, 1971, pp. 442-443). 
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CORRELATES OF JOB SATISFACTION IN 
NAVAL ENVIRONMENTS! 
BLAIR W. McDONALD axb E. K. ERIC GUNDERSON * 
Navy Medical Neuropsychiatric Research Unit, San Diego, Colijornia 
Demographic, military status, job-related and health-related variables were 
correlated with a job satisfaction measure for 5851 Navy enlisted men. 
Health-related variables were most highly correlated with satisfaction, followed 
by type of job and seniority. A multiple regression equation with six variables 
(symptom index, type of job, length of service, specific ship, number of men 
supervised, and worked closely with others) yielded a cross-validity coefficient 
of .49. The results indicated some important correlates of job satisfaction 
in the naval environment and suggested hypotheses for further investigation, 

T ‘erminants and consequences of job ranean area, The duration of deployment for the 
sat n in military settings have received six ships ranged from 6 to 8 months, The sub- 
rel little attention except in the area of jects were tested enroute to their overseas sta- 
care cntion. In fact, very few studies have tions under the Supervision of experienced re- 
bee lucted of job satisfaction in enlisted search teams Approximately 80%-90% of the 
per since the large-scale Army studies of enlisted personnel aboard the cruisers and the 
Wor ar II (Stouffer, Suchman, DeVinney, battleship participated in the study, while ran- 
Star Villiams, 1949). dom samples were drawn from the carrier crews. 

T rpose of the present study was to deter- Approximately one half of the Subjects had two 
mir me of the important correlates of job years or less of military service and were under 


n as reported by Navy enlisted men 
in o ional settings. Further, it was felt that 
an stigation of this type might provide a 


basi: ‘or more detailed investigations aimed at 
imp zents in the level of satisfaction within 
the Navy, 

METHOD 
Subjects 


Subjects were 5,851 enlisted Navy personnel 
aboa: six ships (three cruisers, two aircraft car- 
riers, and one battleship). Five of the ships were 
deployed to Southeast Asia, and the sixth ship 
participated in training exercises in the Mediter- 
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values for the three items were summed to ob. 


21 years of age. Slightly more than 60% of the 
subjects had completed high school and about 
30% were married. 


Procedure 


Personal history, military status, and job satis- : 
faction information were obtained from a single 
questionnaire, All questionnaires were checked 
for completeness and appropriateness of re- 
sponse, and about 3% were discarded because 
they were incomplete or invalid. 

The measure of job satisfaction used for the 
major study was composed of three items. The 
key item, that is, the one given greatest weicht 
i was: “How satisfied are you with 
your present job?” The response alternatives for 
this item were: “Very much,” “Slightly satis- — 
fied,” “Slightly dissatisfied,” and “Very dis- 
satisfied”; these were scored 4, 3, 2, 1, respec- 
tively. The second most important item was: “Do a 
your present duties employ your abilities in the 
best possible way?” The response alternatives 
were: “Not at all,” “Partly,” and “Very much,” 
scored 1-3. The third item—“Are you often 
bored? —had only “Yes” (scored 1) or “No” 
(scored 2) as response alternatives. The response 


tain the total job satisfaction score. The internal 
consistency reliability estimate, based on average 
item intercorrelations and corrected by the 


} ogra, 
military status, and job-related information all 
were derived from the questionnaire, including 
scores on a brief index, the Health 
Opinion Survey (Gunderson, Arthur, & Wilkins, 
1968). An index of family socioeconomic status 
was derived from the fathers’ education and 
occupation. The number of illnesses (dispensary 
visits) during the overseas deployment period 
was determined from ships’ medical records, 
Variables concerned with duty or job type and 
specific ship were created by means of a 
< linearization procedure, that is, ordering response 
categories in terms of mean criterion (job 
t satisfaction) scores. 


RESULTS 

Ide Navy job satisfaction scale correlated .61 
with the Smith Job Descriptive Index (JDI) 
Cale in a large sample of Navy men aboard ship 
n= 1,021). This relationship was of the same 
magnitude as that between the JDI and Porter’s 
Need Satisfaction scale (Imparato, 1972). In view 
of the fact that the JDI was not specifically 
designed for military populations and settings, 
the relationship with the Navy job satisfaction 
‘Measure provides evidence of substantial agree- 
ment with respect to the general concept being 
measured. The relationships between JDI and 
25 Navy satisfaction measures are described in 


detail elsewhere. 
Relationships between the 17 variables reflect- 
ing personal, military, job, and health status, and 
job satisfaction were tested for significance by 
means of Pearson product-moment correlations. 
OW the 17 correlations, 16 were significant 
( Ss). Only racial or ethnic group, that 
is, Caucasian versus non-Caucasian (primarily 


was negatively 
level of educa- 
tion and the socioeconomic status of parents 
5 (r > = and —.06, respectively, p < O5). 
Wich regard to the type of work erformed, 
higher satisfaction was reported 1 8 Who 
were working in clerical, ordnance, or cook / 


Snonr Notes 


steward jobs (r= 12, .11, and ectively, 
~<.01). Subjects in temporar, for ex 
ample, compartment cleaner) or ree as- 
signments reported the lowest sfaction 
(r=—.21 and —.16, respectively 1). 

In order to evaluate further t rs pre- 
dictive of job satisfaction, a mul gression 
analysis in a cross-validation de s used. 
The total sample was stratified asis of 
age and education and a sample subjects 
was drawn for the validation of the 
analysis. A regression equation z of six 
variables was developed against j sfaction 
as the criterion, The variables wh ibuted 
Significantly to the multiple correla = 49) 
were the Health Opinion Survey type of 
job, length of service, the specific s hat the 
subject was serving aboard, the ni of men 
supervised, and working closely w others. 
When this regression equation was 2 d to the 
cross-validation sample of 2,453 jects, a 
cross. validity correlation coefficient -° .49 was 


obtained. 


DISCUSSION | 
The cross-validity coefficient obtai 


strated that satisfaction was predict 
moderate degree from the variables 
The variable that correlated mos 
job satisfaction was the Health O 
scale (r =—.32), a brief sympto 
has been shown to discriminate psy p 
tients from healthy controls in the n service 
(Gunderson, Arthur, & Wilkins, 1968). This re- 
sult, coupled with the fact that actual illness 
(dispensary visits) correlated negatively with 
satisfaction (r =—.12, p< .01), strongly sug- 
gested that health status is significantly involved 
in job attitudes. The schedule of recent experi- 
ence, which has been shown to be related to 
health in a series of studies (Rahe, 1972), also 
correlated significantly with satisfaction (r= 
—.18, Pp < .01). The schedule of recent experi- 
ence measures recent changes in the individual’s 
personal, social, and occupational life that are 
reported to be psychologically stressful. It ap- 
pears plausible that the “wear and tear” of life 
stresses outside of the job situation affect job 
satisfaction to some extent just as such stresses 
affect health, 

The next most important set of variables re- 
lated to satisfaction reflected naval experience 
(age and length of service) and level of respon- 
sibility (pay grade), The number of men 
supervised was clearly related to pay grade and 
experience, Working closely with others was cor- 


Snort Nores 


it not entirely dependent on se- of organizational climate may be of 
‘ ber of men supervised. This vari- in 

able co: cd uniquely to the prediction of aboard ship, 

job sat. n in the regression equation, thus 
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RELATIONSHIPS BETWEEN PERFORMANCE AND SATISFACT)! 
f UNDER CONTINGENT AND NONCONTINGENT 
REWARD SYSTEMS 


GERALD A. KESSELMAN,: MICHAEL T. WOOD,? ann EILEEN L. HAGI 
Ohio State University 


Relationships between performance and satisfaction with specific job facet 
were examined for two samples of female telephone company workers. In thc 
contingent system, pay was based on piece rates and promotions were tied to 
performance; in the noncontingent system, these rewards were based on 
seniority. Performance levels were similar under the two systems. However, 
in the performance-contingent sample, performance was related to satisfaction 
with the work itself, pay, and promotions. In the seniority sample, perform. 
ance was related to satisfaction with interpersonal factors, pay, and work. 


A major concern of organizational psycholo- 
ists has been to determine the relationship be- 
tween job behavior and job attitudes, Compre- 
hensive reviews of early studies relating job 
Satisfaction and job performance (Vroom, 1964) 
f indicated that if satisfaction is related to per- 
formance, the degree of association is rather 
_ weak. Recent studies have considered conditions 
under which these two variables should be 
telated (Schwab & Cummings, 1970). 

The hypothesis that performance causes satis- 
faction suggests that reward contingencies medi- 
ate relationships between these two variables, In 
a laboratory experiment, Cherrington, Reitz, and 
Scott (1971) found that when rewards were 
contingent on performance, performance and 

satisfaction were positively related. When re- 
wards were delivered randomly to high per- 
formers and low performers, performance and 
satisfaction were unrelated, Negative correlations 
were obtained when low performers were re- 

warded and high performers were not. Other 
experiments have shown higher levels of per- 
formance by subjects who were told that their 
earnings were contingent on the effectiveness of 
their performance (Atkinson, 1958). 

In the present field study two hypotheses were 
tested: (a) Performance is greater under a re- 
ward system that explicitly rewards performance 
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with pay than that under a system u © admin- 
isters pay noncontingent on perfor c. (b) 


Performance is positively related to faction 
with pay, promotions, and work undc eward 
system that explicitly rewards perřòr with 
pay and promotional opportunitic: hy- 
pothesis assumes that performance is ed to 
satisfaction with job facets specifical! -orpo- 
rated in the reward contingency s Thus, 
increased performance in the work should 
associate with work satisfaction, since work ef- 
fectiveness determines rewards. Satisfaction with 
pay and promotions should relate to performance 


since those specific rewards are contiigent on 
performance. 


METHOD 
Subjects 


The subjects were 37 draftswomen and 39 toll 
operators in a midwestern telephone company. 
Their median age was 29.5 years and median 
service was 5.4 years.® Draftswomen performed 
final artistic work on engineering blueprints and 
figures for equipment, cables, etc. While the two 
groups performed similar jobs with respect to 
work hours, training, and evaluation methods, 
their reward systems differed. The toll operators 
constituted the “seniority” group, where pay and 
promotions depended on length of service with 
the company, as defined by the union-company 
contract. Draftswomen in the “performance 
group operated on the same contract combined 
with a piece-rate bonus system, where rewards 
were directly tied to job performance. The bonus 


»The two samples were similar in terms of age, 
length of service, and salary. The education of 
draftswomen was greater than that of operators; 
however, the education variable was not related to 
any of the measures of interest in this study. 
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SHort Nores 37 
systen ed on individual weekly produc- TABLE 1 
tivity he standards determined by aver- Ci TWEEN PERFORMANCE 
age w ductivity and task difficulty, The eRe <a 3 — ny SAMPLE on 
bonus ninistered every three months in — 
addit gular hourly pay. Satisfaction variable 
Measu Sample 
Sat with pay, the work itself, promo- Work e. Pay ; 
tions ers, and supervision were measured — 2 — — 
by th. Descriptive Index (Smith, Kendall, Total 434% | 46°) 18 A 
& Hu 9) which was included in a larger Performance | .$8** 27 |39 4 21 
quest on job attitudes. The performance Seniority 53] 44° | S19) 26 | 4900 
measu sisted of supervisors’ ratings of indi- 
vidua! ormance that are completed semi- 27 8405. 
annu d retained in company files, These 
rating © on a 5-point Likert-type scale an- 
chore poor job performance” and “superior However, in this situation, performance was 
perf ona more strongly related to satisfaction with inter- 
personal components of the job, supervisor, and 
RESULTS co-workers than was the case for the performance- _ 
Hy is 1 Was not supported. Levels of eas sample, Promotion satisfaction did 
perfo e under the performance (M = 2.97) not correlate with performance as strongly as in 
5 ‘ty (M 3.13) reward systems were the performance sample. Contrary to expecta- 
simila tion, pay and work satisfaction were highly 
2 G * 2 2 
Pri testing Hypothesis 2, it was necessary "elated to superiors’ performance ratings. : 
to det: ine whether the subjects were actually 
being «pled from two distinct reward systems. Discussion i 
Include, in the questionnaire were items assess- j F 
> : i The results of the present study did not sup- _ 
ing the yerceived contingency of pay and promo- port the hypothesis that 8 is oui 
tions job performance. In the performance under a reward system that explicitly rewards 
sampie, 94.6% responded “Yes,” while 2.7% performance with pay than that under a system 
n No,“ and 2.7% “2” to the meup which administers pay noncontingent on per- 
s otion based on job performance? formance. It must be reiterated, however, that 


Among seniority subjects, 7.7% responded “Yes,” 
59% and 33.3% “?.” Subjects also re- 
sponded to the following statement on a 5-point 


Scale ranging from “strongly disagree” to 
“strongly agree”: “In my job, the amount of pay 
I receive is primarily based on my job perform- 
ance.” Means were 4.11 and 2,03, respectively, 
for the performance and seniority subgroups 
(t= 9.73, df = 74, p < .01). Thus, pay and pro- 
motions were viewed as significantly more related 
to job performance in the performance sample 
than in the seniority sample. 

The results in Table 1 generally supported 
Hypothesis 2. Although all correlations in the 
Performance sample were positive, the only sig- 
nificant relations were between performance and 
Satisfaction with work, pay, and promotions. In 
the Seniority sample performance and satisfac- 
tion were also, generally, strongly interrelated. 
_ 


For the performance sample only, measures of 
Unit output were available. This productivity mea- 
Sure was highly related (r=.97) to supervisors’ 
Performance ratings. 


this study was correlational, and reward con- 
tingencies were not experimentally manipulated. 
Although the two groups were similar in some 
organizational and personal respects, any differ- 
ences between their behaviors could be attributed 
to differences on variables besides the reward 
systems, which were not measured. Moreover, 
the partial overlap of reward systems may ac- 
count, in part, for the lack of observed per- 
formance differences; that is, in the performance 
group, only rewards beyond seniority-determined 
rates were contingent on productivity, and the 
reinforcement delay was substantial. A 

Studies which have been cited in support of 
the hypothesis have been conducted in the lab- 
oratory where reinforcement schedules and time 
intervals between performance and subsequent 
reward were under the control of the experi- 
menter (Cherrington et al., 1971; Yukl, Werley, 
& Seymore, 1972). As operant theory suggests, 
an important parameter of the performance- — 
reward contingency is the time interval between 
performance and reward (Kimble, 1961, Ch. 6). 


nee-reward contingency on future performance 
will decrease. In an organization, pay and promo- 
tion may be contingent on performance, but 
usually this does not occur until some time after 


moderating effects of the time variable. 
Ihe performance-reward contingency mediated 
" performance-satisfaction relationships. These re- 
_ sults suggest further research on relationships be- 
_ tween performance and specific facets of job 
_ satisfaction under organizational conditions where 
different rewards depend on performance. For 
example, if performance were rewarded by some 
change in supervisory practice, performance 
micht relate to satisfaction with supervision. Re- 
search might also examine properties of reward 
systems that would explain performance-satisfac- 
tion relations when the reward contingency is 
seniority. In some situations where rewards are 
tied to service, workers may perceive perform- 
ance as a path to continued service and rewards. 
Additional variables, such as individual reward 
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ELATIONSHIP OF EARLY WORK CHALLENGE TO JOB 
PERFORMANCE, PROFESSIONAL CONTRIBUTIONS, 
AND COMPETENCE OF ENGINEERS* 
H. G. KAUFMAN ? 
Polytechnic Institute oj Brooklyn, New Vork 
is study investigated the relationship of the work challenge experienced by 
5 engineers at the beginning of their career to their subsequent job perform- 
ce, professional contributions, and competence. Engineers whose initial job 
xperience involved challenging technical work tended to contribute to knowl- 
dge relatively early and to maintain professional competence and good job 
performance during their careers. For those who possessed high technical 
bility, early challenge further reinforced professional contributions and 
competence later in their career. 
Ther accumulating evidence that being METHOD 
assigned challenging work during the first job 
experie as a favorable effect on performance The sample included 85 engineers, from three 
and at ‘cs in succeeding years. One of the few technologically based organizations, who received 
longitu career development studies was car- their baccalaureate degrees between 1953 and 
ried o1 ‘ong college graduates hired as man- 1955. All had approximately contemporaneous 
ageme: inees by the American Telephone and Work experience for a period of 14 years. The 
Telegra; Company (Berlew & Hall, 1966). The degree of technical challenge was measured at 
Study th. ized that at least two factors con- the end of two years of job experience by esti- 
tribute = the relationship between early work mates of time allocated to as well as importance 
challenge and subsequent performance, namely: rankings of challenging technical work activities, 
(a) the organization initially assigns the most Using a form developed by the Educational 
capable individuals to the most demanding jobs; Testing Service (Hemphill, 1963). These esti- 


and (b) a first job that is demanding leads to 
better performance and success in subsequent 
years, is likely that these two factors, both 
indeper y and in interaction, affect not only 
Subsequent job performance but also profes- 
Sional competence and contributions to knowledge 
in one’s field. The following hypotheses were 
Investigated in the present study: (a) challeng- 
ing work at the beginning of the career encour- 
ages good job performance as well as profes- 
sional competence and contributions to knowledge 
later in the career; and (b) this early work 
challenge interacts with initial individual ability 
to affect job performance, professional contribu- 
tions, and competence beyond the independent 
effects of ability and challenge. 
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mates were collected from both engineers and 
their managers and resulted in a total of four 
measures of early technical work challenge. A 
convergent-discriminant validity analysis demon- 
strated that all four estimates measured the 
same construct (Kaufman, 1973). In addition, 
test measures of engineering ability as well as 
interest in theory and ideas were obtained from 
the engineers shortly after their employment 
began (Kaufman, 1972). Those engineers who 
left their organizations did not differ significantly 
in ability and interest from those who remained. 

Criterion data were collected for three time 
periods designated as early (first 8 years of 
experience), middle (9-11 years), and recent 
(12-14 years). The middle period was selected 
as being just at or beyond the “half-life” of an 
engineering degree, which has been estimated at 
10 years or less (Roney, 1966). Management 
ratings of each engineer’s job performance were 
collected for each time period, primarily from 
personnel records. Engineers contribute to new 
knowledge via patents and publications (LeBold, 
Perrucci, & Howland, 1966), and consequently, 
the number of publications and patent disclosures 
was used to measure professional contributions 
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TABLE 1 


Reranoxsmps serween EAA Work 
CHALLENGE AND CRITERIA 
oOo 
4 Work Chal- 
Early work challenge lengeX Ability 


Mean % in- 


crease in & 
after account- 
ing for indi- 
vidual differ- 
encesand 
challenge 

Performance 

Yı 754 bs 
(m= 85) Y: 4.3˙ 

Y: 1.6 
Publications 

Yı 1:2 
(n=85) Y: 0 

= Yı 5.2° 

Patents 

Y; 0 
(n=85) Y, 0 

Y: 1.8* 
Competence 
(n=24) Y: 7.0* 
Diversity 
(n=24) Y: 6.4* 


Note, The data presented are the means of the results obtained 
using four different measures of challenge. Yı = early period; 
2 = middle period; Ys = recent period. 

* At least one relationship significant at the p < .05 level. 


during each career stage. In addition, an inven- 
tory of technical specialities, completed by the 
engineers in one of the organizations (n = 24), 
provided highly correlated measures of current 
technical diversity and competence in sophisti- 
cated professional knowledge (Kaufman, 1972). 
Competence, diversity, and patent disclosures 
were all significantly correlated with the number 
of publications during the recent period. With the 
exception of these relationships and the correla- 
tions between earlier publications and recent 
_ competence, the other criteria were not signifi- 
cantly correlated with each other during or across 

time periods. 

The statistical analysis involved the obtaining 
of product-moment correlations between the 
measures of work challenge and the criteria for 
each of the three time periods. Multiple regres- 
Sion techniques were used to test for the in- 
crease in criteria predictability contributed by 
early work challenge as well as by the interaction 
between this challenge and individual ability 

after accounting for the effects of ability and 


Snort Notes 


interest (Cohen, 1968). The lati dure wi 
carried out since ability and i predic 
the assignment of engineers to ging 
(Hemphill, 1963). 

RESULTS 

A summary of the results app: Table 1. 
Since the four utilization crite: measures 
of the same construct, the mear orrelation 
and percent increase in varianc » used to 
summarize the results. The t-moment 
correlations between early wor! nge mea 
sures and criteria in later periods all posi- 
tive, with many being significant, ending to 
support the first hypothesis. 

Early work challenge consisten ed to 
prediction of subsequent job pe nee dur- 
ing both early and later periods.“ eraction 
between initial ability and work ch e signi 
cantly contributed to the predicti i job per 
formance during the early and n period 
thereby providing some support í e se 
hypothesis. 

The prediction of publications ~ signifi 
with a clear tendency for the rei- onship b 
tween early work challenge and p ations 


be strongest during the initial PB 
diminish progressively during the ! 
However, the prediction of patents 
during the middle or “half-life” per: 
Providing further support for the sec 
pothesis was the finding that the interaction 
individual ability and work challenge signifi- 
cantly contributed to the prediction oi 
ber of publications and, to a lesser degree, the 
number of patent disclosures during the n 
recent period. In addition, both technical diver 
sity and professional competence during the mos 
recent period were significantly predicted by earl} 
work challenge treated both independently an 
in interaction with individual ability, thus pre 
viding additional support for both hypotheses 


Discussion 


The finding that early work challenge con- 
sistently predicted job performance, even in er 
stages of an engineer’s career, may be attributed 
to the stimulation provided by the early ch 
lenge toward good initial performance. This 
likely resulted in more challenging work a 
ments generating a continuous process of 
forcement. Such a speculation tends to be SUP 
ported by evidence from another longitudi 
study of engineers, indicating that relation 
between organizational factors and perform 
remain consistent over time (Farris, 1969). 


Unlike 
to know! 


quence < 
lag in di 
than pub 
neers in 
early ch 
professi 
contribu 
indepen: 
profess 
effects o 
Specific 
interest 
(Kaufrr 
Althe 
job enr 
mainly 
profess 
to be r 
of thei 
change 
in orde. 
individ 
1974), 
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Snort Nores 


ince, professional contributions 
only be an immediate conse- 
hallenging work with the time 
patents appearing slightly longer 
rticles, However, for those engi- 
ssessing high technical ability, 
g work may help to stimulate 
clopment later in the career. The 
í early work experience to the 
teria of job performance and 
cvelopment indicates that the 
challenge may be global and not 
or profession, whereas ability and 
be more specific in their effects 
2). 
he emphasis on such techniques as 
nt in recent years has focused 
professional employees, it is the 
in the organization who are likely 
‘fected by the intrinsic motivation 
x. Hence, organizations can initiate 
e job design of their professionals 
enhance the development of the 
both job and career (cf. Kaufman, 
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SOME MODERATOR INFLUENCES ON RELATIONSHIPS ] 
CONSIDERATION, INITIATING STRUCTURE, ANI 
ORGANIZATIONAL CRITERIA? 


EN 


IRWIN J. BADIN 2 


New York University 


Group size, tenure, position power, and task structure were Studied as px 
tial moderators of the relationships between Consideration (C) and Ini 
Structure (IS) and employee satisfaction and group effectiveness. The sı 
were 489 male employees of a large manufacturing company. C was posi 
related to satisfaction across all moderator conditions and to group eff: 


a a 


ness in high-position power and high- 


tively related to 
low-position power, 
tionship between IS 


task structure conditions. IS was 
group effectiveness for small groups, seasoned empl 
and high-task structure. There was no significant 
and either criterion for large groups, new empl 


high-position power, and low-task structure. The consistency of the C Tesu 
as compared to the more situational nature of the IS result, replicated ot! 


findings, 


The identification and description of the 
Consideration (C) and the Initiating Structure 
(IS) dimensions of leader behavior (Fleishman, 
1953) has served as an important framework 
for much of the contemporary research into 
leadership effectiveness. However, several au- 
thors have pointed out the need for better defini- 
tions of the conditions that may moderate the 
relationships between these leader behaviors and 
various measures of organizational effectiveness 
(Badin, 1971; Fleishman, 1973; Fleishman, 
Harris, & Burtt, 1955; Korman, 1966; Vroom, 
1973), 

The aim of the present research was to exam- 
ine the extent to which the relationships between 
C and IS and measures of employee satisfaction 
and group effectiveness were moderated by cer- 
tain situational characteristics. Four potential 
moderators, commonly hypothesized sources of 
influence in leadership theory, were chosen; work 
group size, subordinates’ tenure, supervisors’ 
position power, and extent of task structure, 


Work Group Size 


Generally, it has been found that employees 
tend to accept leader-centered direction in large 
work groups (Hemphill, 1950; Vroom & Mann, 


This study is based on a doctoral dissertation 
submitted to the Department of Psychology, New 
York University. The author gratefully acknowledges 
the helpful suggestions of Abraham K. Korman and 
Jeffery H. Greenhaus. The author is also indebted 


to John R. Hinrichs of International Business Ma- 


chines for his cooperation and assistance. 


Requests ſor reprints Should be sent to the author 
who is now at Montclair State College, Department 
of Psychology, Upper Montclair, New Jersey 07043. 


1960). As for C, a guiding ratio 
the study was that people-or: 
should be related to effectivene 
tion regardless of variations wi 
moderators. Accordingly, the fo! 
ses were formulated: (a) The 
tween IS and the criteria are 
large groups and negative within 
(b) The correlations between C 2 
are positive within large and smal! 


hroughout 

behavior 
1 satisfac- 
ny of the 
z hypothe- 
lations be- 
ive within 
iil groups. 
he criteria 


ups. 
Tenure 


Bass (1960) points out that structuring and 
organizing leader behaviors help the new em- 
ployee understand his environment. A continuing 
emphasis on structuring the role of a seasoned 
employee, however, leads to his dissatisfaction. 
Accordingly, the following hypotheses were 
formulated: (a) The correlations between IS 
and the criteria are positive for new employees 
and negative for seasoned employees. (b) The 
correlations between C and the criteria are posi- 
tive for new and seasoned employees. 


Position Power 


Fiedler (1967) has stated that it is more 
“favorable” to the leader if he has high-position 
power and that subordinates are more likely to 
accept task-oriented behavior from a manager 
who was perceived to have such high-position 
power than low-position power. Accordingly, the 
following hypotheses were formulated: (a) The 
correlations between IS and the criteria are Bey 
tive for employees who perceive that their ere 
agers have high-position power and is nee 
for those who perceive that their managers a 
low-position power. (b) The correlations betwee 
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TABLE 1 
LATIONS BETWEEN LEADER BEHAVIOR AND Errecnvieunss ACROSS DIFFERENT 


Conprtions or SITUATIONAL VARIAnLES (us ov Pasexrnesxs) 


— = = ee 
Correlation 
iterator 
* 
Cosat C-group 
a effectiveness 
Group Size 
n> 12) oan 33°* (254) 34 (19) 
Ay - É 1,05 23 
Small (x < 11) «13° (227) —.00˙ (23) -22°* (228) 27 (23) 
New (<6 months) 00 (185) —.19 (21) 33** (185) 3t (21) 
Maur ‘ioe 5 a 80 1.16 KLU 
Seasoned ( months 07 (206 - 43° ier 
do (206) 43° (21) .23°* (297) 31 (21) 
High 12 (201) —.18 (18) 19˙ (201) 52° (18 
ya 1e 1.10 39 1.33 ag 
—.05 (197 —.51* (17 * 
atk structure 7) (17) 23° (197) 08 (17) 
High —,.03 (148) —.56° (17) .22°* (148) SS* (17) 
Sai AB 217° 26 99 
02 (153) 20 (16) «25% (153) 23 (16) 


Sp <.01: 


C and the criteria are positive regardless of the 
der's perceived position power. 
Task Structure 

Fiedler (1967) has posited that it is more 
favorable to the leader if there is a high degree 
of task structure. An important question is 
Whether high-task structure can lead to effective- 
ness regardless of the leader’s behavioral orienta- 
tion. It seems likely that in conditions in which 
the task is inherently structured, any further 
attempts on the manager’s part to initiate struc- 
ure would be seen as superfluous and would 
interfere with employee effectiveness. For tasks 
hich are perceived to be ambiguous, structuring 
behavior may be seen as more beneficial. Accord- 
ingly, the following hypotheses were formulated: 
(a) The correlations between IS and the criteria 
are negative for employees who perceive their 
as being high in structure and positive for 
chose who perceive it as being low in structure. 
(6) The correlations between C and the criteria 
are positive regardless of the variation in per- 
ceived task structure. 


METHOD 


Subjects were 489 male employees of a large 
nufacturing company, who performed their 


Note. IS = initiating structure; Sat = individual satisfaction, C = consideration, 


duties in one of 42 work groups. All subjects N 
were in some way directly involved in the 
marketing efforts of the company. 


Research Instruments 


Leader behavior variables. C and IS scores 
were obtained from items that appear in the 
Supervisory Behavior Questionnaire (Fleishman, 
1957). Although this questionnaire has 48 items, 
only the 24 items with the highest and purest | 
loading patterns were chosen for the present 
research (12 items for each dimension). 3 

Moderator variables. Group size was obtained 
from company records, Tenure was obtained 
from a question in the survey booklet. The ex- 
tent to which the manager was perceived to have 
a position of power was based on the mean of 
five items which were completed by each sub- 
ordinate in his group. The content of the items 
were obtained from a “Position Power Question- 
naire” devised by Fiedler (1967). Task structure 
has been defined by four factors (Shaw, 1967): 
decision verifiability, goal clarity, goal-path — 
multiplicity, and solution specificity. The task 
structure score was based on the mean of four 
items, one for each factor. 

Criterion variables. Individual satisfaction was 
measured by a 6-item scale which had been used 
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in several previous job satisfaction surveys at the 
company. The group’s effectiveness was measured 
by a 12-item rating scale which was completed 
by the branch office manager to whom the 
supervisor of each work group reported, 

Questionnaires were filled in anonymously, 
coded according to supervisor, and mailed to an 
office at New York University, 


RESULTS AND DISCUSSION 


Before undertaking the hypothesized moderator 
analyses, correlations were computed for the dif- 
ferent leader behaviors over all conditions. This 
analysis indicated positive relationships for the 
C dimension for both the individual satisfaction 
(r=.26, p <.01, n= 484) and group effective- 
ness (r= 35, p <.05, n= 42) criteria. On the 
other hand, IS showed no significant relationship 
to individual satisfaction (r = .04, n = 484) and 
a negative relationship to group effectiveness 
(r= —.36, p< 05, n= 42). Hence, some rela- 
_ tionships did exist on the general level, relation- 
chips which it was hoped the moderator analyses 
would clarify further. 

Table 1 presents the data for these predicted 
effects, (A median split on the moderator was 
used in all cases except tenure which was 
dichotomized on an a priori basis). As pre- 

dicted, IS was negatively related to group ef- 
fectiveness for small groups, seasoned employees, 
low-position-power managers, and high-task- 
structure conditions. These relationships were all 

_ significant. However, contrary to the hypotheses, 
no significant positive relationships were found. 
Group size and task structure moderated the 
relationship between IS and group effectiveness. 
No significant relationships were found between 
IS and individual satisfaction in any of the 
conditions except small group size (r=.13, 

5 05). 

All of the correlations between C and indi- 
vidual satisfaction were positive and significant 
at the .01 level. For ratings of group effective- 
ness, all correlations were positive, although 
Significant in only two cases (high- position 
power, high- task structure). 

The positive relationships between C and the 
criteria over all conditions remained quite stable 

throughout all levels of the moderator variables, 

Although the overall negative correlation between 

IS and group effectiveness was found in certain 

levels of the moderators (those predicted to con- 

tain significant negative relationships), in others 

(those predicted to contain significant positive 
relationships), no significant relationships existed. 


> Seat 


SHort Notes 


The finding that the C result 


nore con- 
sistent across situations thar he IS ree 
sults was consonant with the i z trend of 
data reported in the liter Fleishman, 
1973). The findings of this research 
should help specify the relative rtance of 
specific moderators as well as tances in 
which situational variation may uence the 
relationship between leader be! ind effec- 
tiveness. This need for suc} ition has 
been stressed by Fleishman ( i Korman 
(1973). 
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NOTE ON THE PREDICTIVE VALIDITY OF INITIATING 
`TRUCTURE AND CONSIDERATION IN ROTC TRAINING 


NORMAN M. PRYOR * 
University of Tenneswe 


Sixty-eight advanced Reserve Officer Training Corps (ROTC) students par- 
ticipated in a Leaderless Group Discussion (LGD), filled out the Leadership 
Opinion Questionnaire (LOQ), and were described by peers with Form 12 of the 
Leadership Behavior Description Questionnaire (LBDQ). Four to five months 
later, measures of their performance during summer field training with the 
U.S. Army were obtained and correlated with their scores on Initiating Struc- 
ture (IS) and Consideration (C) as measured by the LGD, LOQ, and LBDQ. 
LGD-IS and LGD-C were significantly correlated with several of the perform- 
ance measures. IGD. IS was the best predictor (r= 45) of the Leadership 
Potential Index (LPI), the Army’s overall rating, while LBDQ-C was signifi- 


Merion 
Subjects 
The subjects were 68 ROTC students at the 
University of Tennessee, beginning their third 
year of training in military science. 


Procedure 
Subjects were administered the LOQ in class. 


Sowa vchology 
1974. As; 
M. M. PETTY? 
F and Industrial Relations Institute 
University of Alabama 
cantly correlated with the Field Problem Test, 
M of the evidence on the validity of the 
two known dimensions of leadership behav- 
ior, 2 ting Structure (IS) and Consideration 
(C) ncurrent validity. Korman (1966) and 
Fle (41972, 1973) have pointed out the 
nee more studies of the predictive validity 
of ; ires of these dimensions. 
1 leadership dimensions have been mea- 
sure several instruments, most notably vari- 


ous 
tior 


ns of the Leadership Behavior Descrip- 
estionnaire (LBDQ), by means of which 


subo: dinates typically describe their supervisor 
(Stell & Coons, 1957; Fleishman, 1953a, 
1972); and the self-report Leadership Opinion 
Questionnaire (LOQ), (Fleishman, 1953b, 1969). 
The Lvaderless Group Discussion (LGD) is an- 


other procedure that has been used to measure IS 
and C (Bass, 1954). In the case of the LBDQ and 
LOQ the leadership dimensions and scoring keys 
Were identified by psychometric techniques such 
as item analysis and factor analysis. The LGD 
Measures are inferred from an overall single be- 
havior rating provided by observers of the group 
members activity. Earlier, Bass had reported eight 
Predictive validities of .33-.65, with a median of 
46 for the LGD-IS score. 

The purpose of the present study is to examine 
the predictive validities of IS and C as measured 
by the LOQ, LBDQ, and LGD, for a sample of 
ROTC students, using several criteria of leader 
performance. 


1 Requests for reprints should be sent to M. M. 
Petty, P. O. Box 2737, University of Alabama, 
Alabama 35486. 

Nom at Ford Motor Company, Dearborn, 
Michigan. 


During the same class period each subject par- 
ticipated in a four-man LGD from which mea- 
sures of IS and C (Bass, 1954, p. 468) were 
obtained. Each subject was rated by two senior 
ROTC students who served as laboratory assist- 
ants to the instructor. Seven weeks later each 
subject was asked to describe three of his peers® — 
with the 100 item, Form 12 of the LBDQ. The 
IS and C scales each contained 10 items. Four 
to five months later the subjects attended sx 
weeks of summer field training at a U.S. Army 
fort. During the course of their training the 
subjects were evaluated on the following criteria: 
(a) three graphic ratings by the training officer 
including an assessment of the subject’s leader- 
ship abilities in four, one-day leadership roles, 
mal characteristics of the subject (eg, 
“knows how to handle men”), and the overall 
willingness of the training officer to allow the 
subject to represent him in any situation; ()) 
peers’ rating (PEER) of their willingness to 
follow the subject into combat; (c) Field Prob- 
lem Test (FIELD)—an average behavioral — 


$ The ratings were more than peer ratings, since 
all students had to assume “leadership positions” 
(platoon leader, squad leader, etc.) during the course 
of their training. Thus at various times, the raters — 
were subordinates of their “peers.” ` 
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3% Snort Notes 
TABLE 1 
3 INTERCORRELATIONS BASED ON 68 FIELD TRAINING PARTICIPANTS 4 3 
= ———————— = = 
4 Measure ISLOQ | C f IS-LGD Í C-LGD [ISHQ | CLEDQ | PEER* LPI 
4 I 3 — 
t 10018 ie * 
loo =.17 
_ LGD-IS 0 07 
E LGD-C — 02 02 sne 
LDS — 02 OF RY 22 
~ LBDO-C — 05 0 12 04 36°* 
PEER 4 11 42². 20 05 12 
_ FIELD -14 AS 10 — 03 10 3i* .38** 
LPI 0 10 Ase .28* 08 22 e 
go —1 ip Opi tay = Initiating Sı Fan miei Jess G 5 1 = Con- 
sideration: LEDO = Lentersbin Beko airen IS Questionnaires BEER en renee FIs D m Test; 
LPI = Leadership Potential Index, 
a Pid Preis Tet 
"Leadership, Potential Index, 
“$ <05. 
p <o 
_ Tating by a noncommissioned officer of the sub- tions appear to indicate that the instru- 
_ ject’s leadership ability in three or four simulated ments Were measuring different phe: (ie, 
military missions; and (d) cadet Leader Poten- self, peer, and observer perceptions dership 
tial Index (LPI)—a weighted composite of all! status). The low-homeoinstrumen rotrait 
Summer field training criteria. correlations for the LOQ and LB! > con- 
N sistent with the previous findings ted by 
. RESULTS Fleishman (1969, 1972). The high e on be- 
. A , tween LGD-IS and LGD-C is simila: the re- 
À The intercorrelations between IS and C mea- sults of Bass (1954) who believes th e LGD 
Sures and the performance measures may be 


found in Table 1. The graphic ratings provided 
by the training officer have been omitted from 
this table because of their high intercorrelations 
(83-96) and their high correlations with the 
LPI (.88-.95). The peers’ rating and the Field 
Problem Test are included in Table 1 even 
though they are contained in the LPI, since some 
degree of independence is demonstrated and they 
are conceptually separable. 
An examination of Table 1 indicates that 
-LGD-IS appears to be the best predictor of the 
‘Peers’ rating and the LPI, while LBDQ-C was 
the only significant predictor of the Field Prob- 
: Test. LGD-C is also significantly correlated 
With the LPI, but its addition to LGD-IS in a 
Multiple regression equation does not add any 
incremental predictive validity for the LPI. The 
LOQ failed to predict any of the performance 
criteria, 
The low heteroinstrument-homeotrait correla- 


There were four other measures of summer field 
‘training performance in addition to the three cited 
in text. The additional criteria included: rifle marks- 
manship, physical fitness, map and compass reading 
ability, and a comprehensive test covering the 
Cognitive aspects of summer field training. 


7 


3 
4 


is more a measure of IS than C. 


Discussion 


The results of the present study indicated that 
LGD-IS was the best predictor of the most im- 
portant leadership measure, the LPI. This latter 
Measure is important because it is used to make 
some personnel decisions that will affect the sub- 
ject’s future with the Army (e.g., branch assign- 
ments). One possible explanation for the superior 
predictive power of LGD-IS was that the condi- 
tions were very similar under which both the 
LGD measures and many of the summer camp 
Measures were obtained. That is, the subject's be- 
havior was observed and evaluated immediately 
by independent observers. The superior predikt 
power of the LGD in this study appears to len 
support for the continued use of the LGD in 
asssessment centers. 4 

The generality of results from this study is 
limited. Some predictive validity of IS and C is 
demonstrated, but this depends on the criteria and 
the instruments used. 
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Supervisors from three different organizations completed a version of t 
Embedded Figures Test and ranked 10 inner/other-directed adjectives accordi 
to their importance for success on their jobs. The hypothesis that field ind 
pendent supervisors rank inner-directed attributes as most important for succ 
was clearly supported for the attribute of “independence” in all three organ 
zations. For the remaining attributes differential effects depending on the typ 
of organization were observed, but the direction of the observed relationship 


was consistent with the hypothesis. 


According to Witkin, Dyk, Faterson, Good- 
enough, and Karp (1962) when a person inter- 
acts with his environment, his responses vary 
along a continuum from articulated (field inde- 
pendent) to global (field dependent). At the 
field independent end of the continuum, the 
characteristic social behavior is independent and 
Self-reliant. Linton (1955) has found empirical 
Support for this relationship between field inde- 
pendence and independent social behavior, Using 
Asch’s classical conformity studies, Linton ob- 
Served that field independent individuals were 
much more able to resist social pressure for 
conformity. Cattell (1971) has added further 
evidence supporting this relationship. He noted 
that measures of field independence are a mani- 
festation of the source trait “temperamental 
independence” in the perceptual performance 
realm. Temperamental independents or field in- 
dependents showed self-assured and independent 

iors. 

The descriptions of the behavioral referents 
of field independence given by Witkin and 
Cattell overlap considerably with Riesman's 
(1950) description of inner- directed types. 
Riesman indicated that individuals who were 
inner- directed were secure, relatively self-assured, 
and socially independent. Taken together, these 
findings suggest that there is a direct relation- 
ship between field independence as measured in 
the perceptual realm and inner-directedness as 
measured in terms of preferred social orientation. 

At the other end of the continuum of re- 
Sponses to the environment, the field dependent 
end of the continuum, the characteristic social 


* Requests for reprints should be sent to L. W. 
Gruenfeld, New York State School of Industrial and 
Labor Relations, Cornell University, Ithaca, New 
York 14850. 


behavior is dependent and other- l. This 
finding by Witkin et al. (1962) s that 
there is a direct relationship bet) ld de- 
pendence as measured in the per realm 
of preferred social orientation. Rie 1950) 
indicated that individuals who other- 
directed emphasized the need curity, 
approval, and social dependency. 

Based on the presumption that ind ils per- 
form their roles in a manner congr: th the 
behavioral pattern which is character i their 
cognitive style, the following specific |: pothesis 
was formulated to guide this researc: When 
asked to rank traits which they const impor- 
tant for success on their present jobs, field in- 
dependent supervisors consistently k inner- 
directed traits as more important, whereas field 


dependent supervisors consistently rank other- 
directed traits as more important. 


METHOD 
Sample 


The subjects came from three different orga- 
nizations. The first group, a production group 
consisted of 73 male lower- and middle-level man- 
agers in a food processing plant. The second group 
included 51 male managerial employees of a state 
department of taxation and finance, and the third 
group consisted of 35 male and female mana- 
gerial and professional employees of a county 
government unit. The county government group 
was relatively field dependent; the state civil 
Service and production groups were more repre- 
sentative although the latter was relatively more 
field independent than the other two groups. 


Research Instruments 


Three different but related instruments bie 
used to measure field independence. The in 
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led Figures Test, EFT (Jackson, 
nistered to the state civil service 
Hidden Figures Test, HFT-V 
ssick, & Myers, 1964), a group 
inistered to the production sample, 
len Figures Test, Cf-1 (French, 
Price, 1963), was administered in 
zs to the county government group. 
these instruments are essentially 
at they require the subject to find 
gure in an embedding background. 
l. (1964) and Weissenberg (1973) 
ted that all three forms are valid 

f psychological differentiation. 
rument used to measure the impor- 
‘nner/other-directed traits for success 
sent job was developed by Porter and 
1968). A list of 10 adjectives is pre- 
the subject and he is instructed to rank 
e “order of their importance for suc- 
us] present management position.” The 
vortant trait is ranked as “1,” while the 
vortant is ranked as “10.” The inner- 
traits listed are forceful, imaginative, 
lent, self-confident, and decisive. The 
rected traits are cooperative, adapte 

. agreeable, and tactful. 


RESULTS AND DISCUSSION 


le 1 shows the significant Pearson product- 
nt correlations between rankings of certain 
raits and scores on a measure of field 
independence, 

the case of one adjective, independent, 
there was clear support for the hypothesis in all 
three groups. The field independent individual 
ranks this inner-directed trait as being more im- 
portant for success than does the field dependent. 
For the imaginative trait only the production 
group results supported the hypothesis. No sig- 
hificant correlations were observed between rank- 
ings of the other three inner-directed adjectives: 
forceful, self-confident, and decisive, and scores 
on the various measures of field independence/ 
field dependence. 

For the other-directed adjectives, field depen- 
dent subjects in the production and county gov- 
ernment groups ranked tactful as being more 
important for success. Positive correlations be- 
tween rankings of the two other-directed traits, 
cautious and agreeable, and scores on the Hidden 
Figures Test in the county government group 
further indicated that field dependent individuals 
ranked these two traits as more important for 
success. In no group were there significant cor- 
relations between rankings for the remaining 


Snort Nores 


TABLE 1 


CORRELATIONS BETWEEN Tarr Raxxoxas axp Soo 
ON A Measure or Preto INN 


i Production | State civil — 
Adjective 2 ai ment 
= 73 
09) | @=m@ | foe 
Imaginative — 25° 
Independent — 25° 
Tactful 1 
Cautious — 
Agreeable — 
oh 8K. 


other directed adjectives, adaptable and coopera- 
tive, and scores on the measure of field in 
pendence/field dependence. 
Table 1 reveals (a) a strong predominance of 
correlations between rankings of inner- directed 
traits and scores on the measure of field inde- 
pendence/field dependence in the production 


pie in the county government group. In 
the state vil service group there is only one 


dence and ‘the score on the individual EFT. 
correlations observed are in the expected di 
tion, but further research needs to ne 
the fact that there were significant correlations 
for only some of the characteristics and that 
these do not occur in all three groups. 
Nevertheless, as Table 1 shows, there is 


gardless of the type of organization, rank f 
pendence higher than do field dependent 
visors as an attribute required for ma t 
success. These results provide evidence for Bi 
validity of the construct field independence as 
measured by various forms of the Embedde 
Figures Test. They further support the proposi- 
tion that Riesman's inner-directedness and Wit 
kin’s field independence are related, Furthermore, 
the field independent/field dependent cognitive 
style is related to characterictics believed to be 
associated with managerial success, 

There are several additional implications fe 2 
further research suggested by these findings. 
Porter and Lawler (1968) related differences in 
the perception of traits required for success 
the effectiveness of supervisory — 


Inner. d rected supervisors were ranked higher on 
job effectiveness than were other-directed super- 
ors. In addition, in an unpublished study the 
thors found that in a manufacturing organiza- 
n first- and second-line supervisors who were 
independent were ranked by their super- 
ors as more successful than field dependent 
ervisors (r= 48, p<.01). When combined 
h the results of the present study, these find- 
ings suggest that measures of field independence 
‘may be fruitfully used in subsequent studies of 
ership effectiveness, 
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MATISM AND THE CONGRUENCE BETWEEN SELF-REPORTED 
JOB PREFERENCE AND PERFORMANCE AMONG SCHOOL 
SUPERVISORS 
JAMES P. ESPOSITO: axo HERBERT C. RICHARDS 
University oj Virginia 
One hundred and seventy-four instructional supervisors were administered the 
Rokeach Dogmatism Scale and an instrument requiring the ranking of 10 
supervision tasks according to ideal and actual priorities. Respondents were 
grouped into three categories according to level of measured dogmatism. 
Findings indicated that closed-minded individuals manifest significantly lower 
correspondence of ideal and actual task rankings than did their medium- or 1 
low-dogmatic counterparts. q 
( i the most persistent findings among authoritarianism (Adorno, Frenkel-B 
stu supervisory personnel and their multi- Levinson, & Sanford, 1950), psychologists 
fac ole in public education is the existence identified its manifestations through a multitude 
of discrepancies between what supervisors of criterion variables. However, the 
act do during the work day and tasks they of closed-minded individuals on authority 
be most important or desirable to perform (e.g., Feather, 1967) provides the basis for the 
(e irman, 1970). Although a variety of present study which involves supervisory per 
ex) ions have been suggested for the sur- sonnel, More specifically, the purpose of this 
pr poor match of intention and perform- study is to investigate the prediction that out oi 
an dings of the Regional Curriculum Proj- deference to authority figures closed. mi 
ect 8) indicate that supervisors perceive a supervisors will perform those tasks prescribed 
m de of administrative or political hin- for them by their superiors regardless of personal 
dr (real or imaginary) to job performance. priorities; and furthermore, that closed-minde 
Un cbtedly there are some functional hin- individuals will manifest a lower corres 
dran to performing a supervisory job such between ideal and actual task self-rankings 
as (Jing out routine forms or the preparation of their open-minded colleagues. 
burevucratic reports, but such hindrances plague 
mos! jobs performed in an organizational setting. METHOD 
Subjects 


It is equally apparent that some individuals, 
despite the limitations imposed by the “system,” 
ely perform those tasks they rate as most 
interesting and important. The presence of indi- 
vidual variability implies that something in the 
personal-social makeup of the individual super- 
visor lies at the core of his inability to match 
intention with performance rather than in the 
external matrix of bureaucratic hindrances. Thus, 
both personal competence and individual tempera- 
ment are implicated as key antecedent variables. 

One of the most salient aspects of an indi- 
vidual’s temperament is the extent to which his 
core beliefs about the world are flexible, open to 
modification, and subject to change based on in- 
coming information. In fact, since the concept of 
belief system rigidity was first subjected to 
quantitative measurement under the rubric of 


1 Requests for reprints should be sent to James 
P. Esposito, Curry Memorial School of Education, 
University of Virginia, Charlottesville, Virginia 22903. 


The invited sample (those initially contacted 
as potential subjects) consisted of 274 individuals 
from 67 Florida counties who were identified as 
general supervisors, elementary and secondary 
consultants and specialists, or area instructional 
coordinators. Directors of instruction and as- 
sistant directors of instruction were included for 
those counties where no other general super- 
visors were listed in the Florida Educational Di- 
rectory. Although data were obtained for 203 
subjects (approximately 74% of the invited 
sample), 29 questionnaires were ere 
cause of incompleteness or because the n. 
dent chose not to participate. Thus, the effective 
sample contained 174 subjects, 64% of thos y 
initially identified for sampling. E 


Measures X 
z 
The Rokeach (1960) Dogmatism Scale (Form 
E) was adopted as the independent measure for 
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the present investigation. This instrument yields 
a three-digit score intended to indicate the de- 
gree to which the belief system of an individual 
is open to change on the basis of new experience. 
The Rokeach Dogmatism Scale has an impres- 
Sive validity network, and readers are referred to 
Rokeach (1960) for an extended (although some- 
what dated) treatment of this topic. 

The criterion measures for the present study 
consist of forced choice rankings of the 10 major 
tasks of supervision identified by Harris (1963): 
developing curriculum; organizing for instruc- 
tion; staffing; providing facilities; providing ma- 
terials; arranging for in-service education; ori- 
enting new staff members; relating special ser- 
vices; developing public relations; evaluating. 
Each subject was presented with a listing of 
these tasks and a description of each. The rank- 
ing procedure was explained through the fol- 
lowing written instructions: 


Please rank order from 1 through 10, the 
10 tasks in terms of the frequency in which 
you (1) actually perform that task (actual), 
and (2) would prefer to perform that task 
if given the Opportunity (ideal). 

Indicate the actual rank in the space pro- 
vided at the left of each task and the pre- 
ferred or ideal rank at the right of each 
task. The ideal rank should be determined 
by your preference as to the frequency in 
which you would prefer to perform each 
task. 

Place a 1 by the task with the highest 
frequency of performance, 2 by the task 
with the second highest frequency, 3 by the 

task with the third highest frequency . . . 
9 by the task with the ninth highest fre- 
quency and 10 by the task with the lowest 
frequency. 


Thus, in addition to dogmatism, 20 scores 
vere obtained from each participant, 10 corre- 
ponding to actual task rankings and 10 others 
‘orresponding to ideal task rankings, 


inalysis 


Tn order to classify subjects according to level 
f measured dogmatism, all respondents who 
cored in the upper third of the sample of the 
tokeach scale were designated as high dogmatics 
n= 57), while those who scored in the lower 
hird were designated as low dogmatics (n = 56). 
Subgroup sample sizes were not perfectly equal 
ue to ties on the dogmatism scale). Thus, in 
ddition to having a three-digit dogmatism score, 


Sort Notes 


each subject was classified as hi dium, or 
low on dogmatism. 

To facilitate the interpretati reporting 
of the data, all task ranks were >d direc- 
tionally so that a high score indi: gh pref- 
erence for (or performing of) k. This 
was done by subtracting each ra from a 
constant (11). The resulting scal d from 
1 to 10; a 1 indicated low pre or per- 
formance. 

The correspondence of actua! al task 
rankings was estimated for each hy com- 
puting a rank-order correlation (i arman's 
Rho) between the two sets of scor =rences 
between real and ideal rankings h task 
separately were also obtained by su g ideal 
from actual ranks, 

Mean Spearman rhos for high-, i-, and 
low-dogmatic groups were compu J dif- 
ferences among these means acro three 
groups were tested for significa hrough 
analysis of variance (independent gr sign). 
Differences between individual g means 
(pair-wise comparisons) were testei ignifi- 
cance by the Newman-Keuls method Winer, 
1962). To identify more specifica! he task 
rankings that are differentially a with 
dogmatism, actual rankings, ideal zs, and 
differences between actual and ankings 


were each correlated with dogmatism. 


RESULTS AND Discussion 


The results of the overall analysis of variance 
were significant (F=12.2, p<.001). Thus, 
level of dogmatism is related to the correspon- 
dence of actual and ideal task rankings. How- 
ever, inspection of the individual group means 
(84, .63, and .59 for high, medium, and low 
groups, respectively) reveals the effect to be a 
nonlinear one. Whereas high dogmatics mani- 
fested significantly lower Spearman rhos than 
either of their medium (93 = 6.5, p<.01) or 
low (92 = 5.6, p< .01) dogmatic counterparts, 
the difference between medium and low dog- 
matics was not significant (92 = 91, ns). 

Significant product-moment correlations of 
dogmatism with differences in actual-ideal rank- 
ings were found for Task 1 (developing curricu- 
lum; r =—.52) and Task 2 (organizing for in- 
struction; r = —.35). Other significant correlates 
of dogmatism include actual rankings of Task 1 
(r = —.61) and ideal rankings of Task 2 Ta 
45). Tt is apparent from these results that high 
dogmatism is associated with low-actual rankings 
of curriculum development and high-ideal rank- 
ings of organizing for instruction. 
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learly support the hypothesis of 
estigation and help augment the 

ity of the Rokeach Dogmatism 
limited by the self-report nature 

the finding that closed-minded 
anifest significantly lower ideal- 
ndences of task priorities than do 
aded colleagues is consistent with 
closed-minded individuals are 

to the external control of author- 
irthermore, the correlations of dog- 
task rankings support the notion 
dogmatic individuals see themselves 
og innovative tasks (e.g., develop 
ss, and they idealize task organiza- 
ganize for instruction) more than 
w-dogmatic counterparts. Taken to- 
iœ results lend credence to the theo- 
nulations of Rokeach (1960) and help 
or the puzzling variability among 
in the degree to which they match 
ind perceived performance. If the 
‘ence of ideal and actual task rankings 
as a sensible index of job satisfaction, 


‘lows that dogmatic supervisors are gen- 


content with their work, However, it 


emphasized that the present results are 


SHORT Norm 


exploratory and additional research is needed to 
validate the link between indices of job satisfac- 
tion and the correspondence of task rankin 
Hopefully, such studies provide additional 
sights into how personality influences job 
faction and performance. 
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An important development in the field of or- 
ganizational psychology within recent years has 
been the empirical testing of propositions related 
dio the classical school of management thought 
(see Filley & House, 1969). Such testing is criti- 
cal if the classical principles of management 
thought are to be applied in a precise and bene- 
ficial manner. 
One of the prominent classical principles js 
unity of command. This principle postulates that 
a subordinate should receive orders only from 
one superior. Only a few research studies of this 
principle have been completed. A recent review 
of the research has suggested that the violation 
ol the principle does, in fact, result in dysfunc- 
tional consequences (Rizzo, House, & Lirtzman, 
1970; see also Kahn, Wolfe, Quinn, & Snoek, 
_ 1964). These consequences include an increase in 
role conflict, employee dissatisfaction, and in- 
_ effective performance. 
However, some theorists have argued that the 
strict use of such classical principles appears to 
= Maximize efficiency and reduce role conflict only 
when the technology demands a bureaucratic 
_ Organizational structure (e.g, see Lawrence & 
Lorsch, 1967). Such a situation would be ap- 
proximated by a technology of medium com- 
_ plexity, for example, mass production assembly. 
the technology becomes more complex, it 
The Civil Service Commission Provided support 
and assistance for this research in a government 
agency. 
Ihe authors wish to thank Rodney Morton for 
Processing the data. 
Requests for reprints should be sent to Martin 
Gannon, College of Business and Public Admin- 
istration, University of Maryland. 
r 8 
= *Both authors are also at the Bureau of Person- 
Management Evaluation, Civil Service Com- 
mission, Washington, D.C. 


UNITY OF COMMAND AND JOB ATTITUDES OF MANA‘ 4 
IN A BUREAUCRATIC ORGANIZATION * 


MARTIN J. GANNON ® axb FRANK T. PAINE è 
Department of Business Administration, University of Maryland 


Several principles, commonly termed the classical principles of managem« 
although enunciated in order to help managers increase organizational 
ciency and effectiveness, have been tested only on a minimal basis. The pr 
study concerns the classical principle of unity of command, that is, a sub 
nate should report only to one superior. The subjects were 304 governr 
managers working in a bureaucratic agency, 181 of whom reported only to 
superior while 123 reported to two or more superiors. Results indicated t! 
at least within a bureaucratic milieu, the violation of this principle was rela 
to the predicted negative attitudinal orientation of managers. 


appears that individuals are m: fied if 
more flexibility is provided throug olation 
of the principle of unity of com) ice the 
requirements of the work necessi a Sys- 
tem. This particular situation js pproxi- 
mated in technologically advanced s indus- 
tries such as chemicals, 

Some descriptive research appe ubstan- 
tiate this point of view, For insta: odward 
(1965) found that supervisors an igers in 
process industries generally enjoy ir work 
when the principle of unity of nd was 
openly violated, since they needec ylity in 
order to carry out their assignments nversely, 
individuals responded favorably to rict use 
of the principle of unity of command «hen they 
worked in an organization that was - ructured 
in terms of medium technological complexity, 
for example, the automotive assembly Jine. 


The present study was conducted in a large 
government agency, the General Services Admin- 
istration, which is a large nationwide organiza- 
tion engaged in supply, maintenance, and pro- 
tective activities. In terms of Thompson's (1967) 
well-known typology, this agency can be char- 
acterized as one with a mediating technology, 
since its basic purpose is to link clients or cus- 
tomers who are interdependent. Under such con- 
ditions the imposition of bureaucratic rules of 
behavior appears to be most advantageous. Con- 
sequently, we expect the strict use of the princi- 
ple of unity of command to be associated with 
less role conflict and increased employee effec- 
tiveness within this bureaucratic setting. In ef- 
fect, these are the major hypotheses under 
investigation in the present study. 


Metiop 


Owing to the complexity and geographical 2 ! 
persion of operations in General Services 
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ministr researchers decided to use a 
question could be answered meaning- 
fully a nously by the respondents. A 
random í 20% of the managers was then 
taken s were operationally defined as 
those w t the work of other supervisors 
and/or unagers. Questionnaires were sent 
to 440 rs. Only 330 managers returned 
the qu’ res, However, only 304 question- 
naires ible. Hence, the usable response 
rate w Of the 304 respondents, 181 re- 
ported one superior while 123 reported 
to tw re superiors, While this fact was 
determ ‘ely by means of the questionnaire, 
the an of the organizational chart indicated 
that e nager should be reporting only to 
one Hence, violations of the principle 
of uni command seemed to exist, at least as 
measu y the responses to this issue by the 
manag volved in the present study. 

Th tionnaire was designed to measure 
severa racteristics of managerial and organi- 
zation ectiveness which have been identified 
as mic ¿o criteria (see House & Rizzo, 1972; 
Mahe & Weitzel, 1969; Paine, 1970). The 
questi ire included items touching on the 
dimen. of organizational effectiveness such as 
emplov > utilization, planning, selection based on 
ability ad role conflict and ambiguity. Many 
of th ms were taken from the validated or- 
ganiz: al practices’ scales developed by House 


and R and by Mahoney and Weitzel. How- 
ever, ome of the items were selected because of 
the e characteristics and problems facing 
the orzanization under study- 

The questionnaire contained 50 attitudinal 
items. Subjects could respond to each item in 
terms of a S-point scale of agreement (1) to 
disagreement (5). These items were then factor 
analyzed by means of the principal-component 
method, orthogonal rotation (BMD 03M). The 
eigenvalue specified in this rotation was 1.0; the 
commonality estimate for the diagonal was 1. 
Eleven factors were derived. Two criteria were 
then emnloyed in order to interpret the derived 
factors. First, only items loading greater than 
zs were considered. Second, complex items— 
those with relatively high loadings on two or 
More factors—were not used for interpretation. 
The ¢ test for independent samples was then 
used to test the significance of the difference 
between the mean factor (standard) scores for 
each factor for those managers reporting only to 
one superior and for those managers reporting 
to more than one superior. 


Resvutrs * 


with the concept of selection based 


are available for people 
(b) People who do a 
(,72). (c Promotions are based on ability 
talent (.70). 

Initiative-flexibility appeared to be reflective 
of Factor 2, The items with the heaviest loade 


instructions readily (40). ; 

Factor 3 seemed to be indicative of role con- 
flict and pressure. The items loaded heavily on 
this factor were: (a) People have questions 
about who is really their supervisor (.66), (b) 


as: (a) People have a pretty clear idea of the 
quality expected in their work (.76). (b) People 


know pretty clearly what is expected of them 


(.60). 

Employee utilization seemed to be the basis for 
Factor 5, as two items were loaded heavily on 
it: (a) People feel their jobs are too narrow and 
specialized (—.74). (6) People do too much 
unnecessary work (—.46). Factor 6 was indica- 
tive of coordination. The items loaded heavily on 
it were: (a) People are not receiving the help 
they need to learn the job right (—.69). (b) 
People can’t seem to agree on just what they are 
trying to accomplish (—.58). (c) Work that is 
started is often never completed (—. 84). 

Factor 7 was loaded heavily by one item: Peo- 
ple are not satisfied with performance appraisals 
(84). Factor 8 was illustrative of adequacy of 
delegation. The items with heavy loadings in- 
cluded: (a) Higher level people do too much 
lower level work that could be given to others 
(—.71). (b) People must get approval for deci- 
sions that they should be able to make them- 
selves (—.50). Factor 9 included two items with 
heavy loadings that are concerned with employee 
effort: (a) People put in more effort than they 
are required to do (.77). (b) People do their 
jobs without too much follow-up and checking 

More detailed statistical tables and a comparison 
of the factors derived in this study to those of the 
Mahoney and Weitzel (1969) study and to those 
of the House and Rizzo (1970) study may be ob- 
tained from the first author. 


(64), Factor 10 focused on planning activity. 
The items loaded heavily on this factor were: 
(a) Jobs are planned before they are started 
= (64). (b) People are asked for Suggestions on 
bow to improve the work (.60). Factor 11 
~ seemed to involve employee development. The 
items with heavy loadings were: (a) There are 
not enough people who can do good work (—.62). 
_ (6) People do a quality job (.52). 

Results of the £ test indicated that four of the 
II factors derived in the analysis were statisti- 
= cally significant, and all of these relationships 
were in the predicted directions. Thus, managers 
with two or more superiors were significantly 
different from managers with only one superior 
in that they tended to feel: (a) selections are 
not based on ability (f = 2.27, $ S. 01). (b) 
Role conflict and pressure are high (t= 2.54, p 
<.01). (c) There is need for coordination t= 
3.03, p<.01). (d) More delegation is needed, 
that is, there should be more responsibility and 
autonomy associated with each job (t=2.70, 
7 <01). 


DISCUSSION 


The present study has added support to the 
proposition that the strict use of the principle of 
unity of command is beneficial if the technology 
p demands a bureaucratic structure. In the govern- 
ment agency under study, the violation of the 
Principle of unity of command was related to 
dysfunctional consequences as measured by the 
attitudinal orientations of managers. 

3 In summary, it appears that a mediating tech- 
nology which links clients or customers together 
in an interdependent manner demands the strict 
implementation of the Principle of unity of com- 
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Snonr Notes > 
mand. In such a situation it appears that 
ordinates want to report only to one sup 
since the work requirements are such as 


demand a well-defined bureaucratic s ture 
While other technologies may necessitate d 
ferent strategies, it seems that a mediating 
nology can achieve organizational efficiency a 
effectiveness if subordinates operate in a situa- 
tion where they can clearly identify their 
superior. 
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REPETITION OF HIGHLY SIMILAR MESSAGES 
AND ATTITUDE CHANGE * 


EE McCULLOUGH ? THOMAS M. OSTROM 


Ha ites, Philadelphia, Pennsylvania Ohio State University 
lessage repetition was predicted to increase positive attitudes in a situation 
here highly similar communications are used. Five similar advertisements 
i.c., those using the same basic arguments but differing in the phrasing and 
rder of points raised) served as stimulus messages. These ads were sequen- 
ially presented with attitude being measured by a cognitive response analysis 
f thoughts recorded by subjects attending to each message. Results supported 
he prediction of a positive relationship between the number of presentations 


ind attitude. 


Ady s have long advocated repeated ex- vious commitments, thereby allowing greater 
posure ame ad and attempted to develop agreement with the advocated position. Second, 
measur investigate the efficacy of this ap- people attend more closely to the message, For exe 
proacl igman, 1968). Several laboratory ample, Brock, Albert, and Becker (1970) demon- 
studies dberg (1954), Wilson and Miller strated that subjects show higher selective atten- 
(1968) ison and Watkins (1971), however, tion to a new communication advocating a known . 
have s! that repeated presentation of a per- stand than to a previously encountered message, 
suasive ige does not produce more immedi- even when holding such factors as message utility 
ate atti change than does a single presenta- and supportiveness constant. Thus, it was hypoth- 
tion. important similarity among these esized that in a situation in which highly per- 
failures ` obtain a repetition effect is that they suasive communications were used as stimuli, 
all repea'<- identical communications, and conse- à positive relationship is found between the 
quently, e subjects did not receive new infor- number of message repetitions and increasingly — 
mation «> arguments as a function of repetition. favorable subject attitudes. 

Sears and Freedman (1965) found evidence that 

People ae more willing to change their attitudes a METHOD 

when they think a message contains new infor- Subjects 

mation than when they expect a message to The subjects were 57 introductory psychology 
repeat viously encountered information. Tt student volunteers from Ohio State University, 
may be, then, that the repetition effect can be who viewed the ads while in groups of from one 


obtained when similar but nonidentical messages 
are used. 

There are two reasons for believing that the repe- 
tition effect might emerge for similar messages. 
First, Sears and Freedman (1965) hypothesized 
that the expectation of new information provides 
à satisfactory justification for relinquishing pre- 
— — 
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to five persons. Data from seven subjects were 
discarded before analysis because of their failure 
to correctly follow instructions. 


Communications 


The persuasive communications were 10 maga- 


zine ads published between 1960 and 1965. Half 


of these ads argued for the purchase of the 
product, Yardley After Shave, while the other 


five solicited contributions to the United Service 
Organizations (USO). Within each unit the par- 
ticular ads chosen used the same general themes 
to support an identical position. The five Yardley 
ads each mentioned that (a) the product was not 
a typical after-shave; (b) it contained agents to 
prevent infections; (c) the fragrance was pleas- 
ant; and (d) Yardley replaced skin moisture. 
The USO messages stressed the following four 
themes: (a) Loneliness of young men away from 
home, (b) their tough working conditions, (c) 
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The various argument topics within a unit, 
though present in advertisements, were 
ap- 


= Subjects were Informed that they would see 
five ads, cach of which had been broken into 


Two groups of 25 subjects viewed cither the 
five Yardley or USO messages. For each adver- 
the subjects first very briefly saw an 
picture of the message. After this brief 


for 45 seconds, showed the graphic dominating 
the advertisement and the first segment of the 
‘written message, While this slide was being 
bonn, the subjects recorded their reactions in 
the response booklet. At the end of the 45-second 
Period the subjects were told that they could 
continue writing until they were finished record- 
ing their reactions, The second slide showing the 
same material as the first, plus the next segment 
of the message, was then projected for 45 sec- 
X onds. The beginning of the new message segment 
was indicated by a small triangle. The subjects 
again recorded their reactions, after which the 
last two parts of the ad were exposed using the 
same procedure. Following the first ad the ex- 


TABLE 1 


MEAN ATTITUDE AS A Function oF 
REPETITION AND Messace Toric 


Topic 


Yardley 
uso 
Overall 


Nole, USO = United Service Organizations, 


Swoxr Nores 


perimenter told the subjects that each ze 


was to be considered separately and í `e- 
quent ads they were not to refer to uS 
communications, The four remaining Y or 


USO messages were then presented. 


Five different sequences of the five a hin 
a unit were prepared according to Lat ure 
counterbalancing. Five subjects viewed e- 
quence, The four parts into which eac as 
divided were planned to basically corte to 
the four general themes, previously ni hat 
were present in every ad. 
Dependent Variable 

The dependent variable was the sub) net 
cognitive response score for each mes cf. 
Greenwald, 1968). To obtain this score ter 
determined the number of favorable, un ible 
and neutral responses recorded by a sul ind 
computed the net value by subtrac’ the 
number of negative from the number oi tive 


responses. A single response was de! as 


expressing a complete thought, fact, va! rood 
or bad feature, feeling, or belief about osi- 
tion advanced by the advertisement or « the 
ad itself. Cullen (1968) has found that gni- 
tive response measure significantly corre! with 
other “traditional” attitude indicators sa 
variety of issues. 

The scoring technique was found to i- reli- 
able. Four days after the initial scoring =c rater 
rescored 10 randomly selected response oklets 
from each of the two units, The test-reicst cor- 


relations were 97 and .99 for the USO and 
Yardley units, respectively. Ten other randomly 
selected response booklets were scored by a 
second rater, resulting in interrater correlations 
of .83 (USO) and .90 (Yardley). 


REsutts 


Net cognitive response scores were subjected 
to a three-way analysis of variance containing 
the within-subjects effect of repetition (number 
of similar ads presented) and the between- 
subject factors of ad type (either Yardley or 
USO) and counterbalanced order (within each 
unit ads were presented in five orders). Refer- 
ence to Table 1 indicates that, as predicted, the 
overall mean cognitive response score was posi- 
tively related to the number of previously 
viewed ads (linear trend F = 4.55, df = 1/160, 
p<.05). In addition USO ads were found to 
receive somewhat more favorable evaluations 
than did the Yardley messages (F =3.47; 
df = 1/40, p < .10). None of the other compo- 


ding the Ad Type X Repetition inter- 
Was Significant 

i be argued that the score obtained by 
ng negative from positive cognitive re- 
Might fail to differentiate between a 
nin which subjects actually became more 
and one in which they simply attenuated 
Megative responses as the experiment pro- 
d. To examine such a possibility, positive 
Md negative responses were tallied separately 
0 n averaged over both topics. Table 2 indi- 


not actually contain different 
crucial factor is simply the new 
information (to reduce commitment and facili. 
tate attention), 
the old arguments in a 


that the increasing favorability of net 
resulted not just from a restriction of 
č reactions but rather from a tendency 
ive responses to increase while negative 
decreased. In addition Table 2 shows 
“the total number of scored responses re- 
ned relatively constant over repetitions, thus 
ing that there was no decrement in 
on to later ads, 


Discussion 


ne present results show that when similar 
r identical messages were employed, 
on did have an immediate effect on post- 
ure attitudes. Although this finding needs 
e replicated with subjects drawn from other 
ons, several implications deserve men- 


TABLE 2 


or Positive AND NEGATIVE RESPONSES 
AS A Function oF REPETITION 


Repetition 


2 3 4 5 


2.76 | 3.24 | 3.22 | 3.58 | 3.70 
4.64 | 434 3.70 3.54 | 3.04 
7.40 7.58 | 6.92 | 7.12 | 7.64 
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DEVELOPMENT OF SCALES TO MEASURE ATTITUDES TOWAR 
MARIJUANA AND MARIJUANA USERS 


VICTORIA COLAIUTA o GEORGE BREED 1 


University of South Dakota 


Four hundred and fifty students at the University of South Dakota served as 
subjects in the two-phase development of a 22-item marijuana attitude scale. 


The final form of the scale, partitioned i 


nto three factorially-defined, mutually 


exclusive subscales, deals with (a) the psychological, physiological, and socio- 


marijuana as a psychosocial panacea ; 


(c) liberal conservative ambivalence toward the usage of marijuana. 


Much of the information about attitudes 
toward marijuana has derived from attitude 
scales incorporated into questionnaires (DeFleur 
& Garrett, 1970; Eells, 1968; Gallup, 1969; 
Popoff, 1970). However, little evidence exists 
concerning the reliability and validity of these 
scales. Consequently, generalizations based on 
such data are limited, 

The purpose of this study was to develop a set 
of reliable subscales for measuring several dimen- 
sions of attitudes toward marijuana, 


METHOD 
Phase 1 


Subjects. Three hundred and seventeen under- 
graduate and graduate students served as volun- 
teer subjects, 

Procedure, 


heretofore contained items relevant to the psycho- 


that substance (Eells, 1968; Gallup, 1969; Pop- 
off, 1970). Both types of statements were in- 
corporated in the present study, but for the 
coverage of attitudes in these areas additional 
items were included, Some dealt with the roles 
of the federal, state, and local governments as 
monitors, enforcers, and prosecutors under pres- 
ent law. Other statements had to do with atti- 
tudes about marijuana as a first step toward the 
use of “hard” drugs, as a carcinogen, and as a 
causative agent in birth defects. To increase 
coverage of a third content area, psychosocial 
aspects of marijuana use, items about marijuana 
as a social problem, and about the possible 
effects of marijuana use on the social develop- 
ment and adjustment of individuals, were added. 


Requests for reprints should be sent to George 
Breed, Department of Psychology, University of 
South Dakota, Vermillion, South Dakota 57069, 
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Forty-one items were developed in man- 
ner. To reduce the possibility of res; bias, 
18 of the statements were worded so t igree- 
ment indicated a positive attitude tow mari- 
juana, and 23 statements were phrase. that 
agreement indicated a negative attitude ‘wenty 
graduate psychology students served as ges in 
evaluating the 5-point Likert-type state: ats as 
favorable or unfavorable toward mariju- Next, 
sets of 21 and 20 of the statements ran- 
domly assigned to each of two differ: forms 
of the scale (Forms “F” and “S”, resp. ely). 

Form F was administered to 155 dents, 
Form S to 162 students, The correlation natrix 
of each form of the scale was factor «ialyzed 
by the basic-structure-successive factor ..cthod 
with varimax rotation (Horst, 1963}, using 


squared multiple correlation communal y esti- 
mates (Harman, 1969), An adaptation of the 
goodness of fit test for the factor model (Mor- 
rison, 1967) was computed for eight essi ve 
factors extracted in each analysis. 

Nonsignificant chi-square/degree of freedom 
values, an eigenvalue cutoff of .95, and the rela- 
tive drops in the Percentage of variance ac- 
counted for by successive factors were criteria 
that indicated the number of factors actually 
warranted for each form was less than the origi- 
nal eight extracted, Therefore, a second factor 
analysis was carried out in which four factors 
were extracted for Form F and three were ex- 
tracted for Form S. 

Each factor defined a subscale, with the abso- 
lute maximum factor loading of an item across 
factors determining the subscale in which it 
would be exclusively included. Item-total corre- 
lations with the effect of the item removed from 
the total score? (r)) were computed for 
each statement within a subscale (McNemar, 
— Ss 

*Total score refers to a total subscale score, not 
the total score for all items in all subscales. 


SC) 


„ 


Sort Notes 


1969). Sever nents with rx) values 
* equal to or les 0 were omitted, while the 
34 statements ed were combined into a 


Single scale fi e€ 2 of the study. Phase 2 
was performe: ‘erive a single form of the 
scale involving us subscales, 
Phase 2 
Subjects. I bjects were 137 male and 
female under; € and graduate student vol- 
unteers. No sı had taken part in Phase 1. 
Procedure clions given the subjects and 
the procedur ed to in analyzing data were 
identical to nployed in the initial phase 


of the study 


R 8 AND Discussion 


Four facto 
final factor 
ever, any ite 


> extracted and rotated in the 

is of the attitude scale; how- 
ü an x. value of equal to 
or less than is discarded. This resulted in 
the elimina: Factor 3. Twenty-two items 
comprised ti al three subscales. 


Subscales © final attitude scale dealt with 
(a) the psy: ical, physiological, and socio- 
logical effec; marijuana on users (Effects) 


(eight items) 
panacea (Pan 
Conservative 

Marijuana (A 


7) Marijuana as a psychosocial 
a) (six items); and (c) liberal- 
bivalence toward the usage of 

alence) (eight items). Exam- 


ples of the j included within each subscale 
With factor loscings are presented in Table 1.3 
Items within ; bscale Effects have marijuana 


Wers as their attitude object. Agreement with 
the items indi aies that the effects of marijuana 
On users are regarded as both personally and 
Socially detrimental and that those individuals 
Mio use marijuana should be held legally re- 
Ponsible for their actions. 

The statements comprising subscale Panacea 
neasured attitudes toward marijuana per se. In 
macea, marijuana is established as beneficial 
oth to the individual and to the society. Be- 
ause marijuana produces desirable effects and 
onsequences, usage is encouraged. 

7 the third subscale, Ambivalence, a central 

tude of personal conservatism is coupled with 

x feeling that others should be allowed to use 
‘tijuana without the threat of legal penalty if 
so choose, In fact, each individual should 
free to do “whatever he wishes” about mari- 
—— 


s 7 
oe detailed information on scale character- 
Lan be obtained from the authors. 


v7 eas 6 OP 
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TABLE 1 


EXAMPLES OF SUBSCALE STATEMENTS AND 
THEIR Factor Loapings 


Statement Factor 
loading 
Factor I (subscale Effects) 
Marijuana has been found to cause birth 
defects. 72 
Anyone who uses marijuana is immature, 66 
Use of marijuana results in self-destruction, 60 
Factor II (subscale Panacea) 
Everyone should be encouraged to try mari- 
juana. 83 
Marijuana brings out the best qualities in 
men. +63 


Marijuana is the single thing in the world 
today that can help promote love and 
brotherhood among nations, 62 

Factor IV (subscale Ambivalence) 

A person should be allowed to use mari- 

juana without the threat of legal penalty 


if he so chooses, —.69 
Marijuana should be avoided at any cost. | —.69 
The slight benefits of marijuana hardly 

justify its use. —.65 


juana. Ambivalence may indeed closely represent 
the attitudes of a number of people toward mari- 
Juana—specifically, that they are not sure how 
they feel about marijuana, given conflicting in- 
formation about the positive and negative effects 
of the substance. Thus, they are cautious re- 
garding personal use, but are willing to allow 
others to do as they please. 

The factors Effects, Panacea, and Ambivalence 
account for 34%, 11%, and 3% of the variance 
in the total scale, respectively. Estimates of 
internal consistency (coefficient alpha) for the 
respective subscales were .90, -86, and .92, 

Potential users of the scale should keep in 
mind that the scale is based on the responses of 
South Dakota college students, Until other popu- 
lations are sampled, the scale’s general applica- 
bility should be viewed with caution. In con- 
structing the present scale, efforts were made to 
include items from all relevant content areas 
(psychopharmacology, law, enforcement, physical 
and social effects of marijuana), Future develop- 
ment of the scale should include a further 
broadening of item content. 

While the present analysis bears on the con- 
tent validity of the scale, the degree of predic- 
tive validity and the ability to discriminate 
among various classifications of marijuana users 
and nonusers need to be determined, 


400 Snort Notes 


REFERENCES Horst, P. Factor analysis of data matrics. Ne 

DeFixen, L. B., 4 Garnet, G. P. Dimensions of Volk: Holt, Rinehart and Winston, 1965. 

marijuana use, Journal of Counseling Psychology, McNemar, Q. Psychological statistics. (4th 

1970, 17, 468-476. New York: Wiley, 1969. 
Erus, K, Marijuana and LSD: A survey of one 


Morrison, D. F. Multivariate statistical met 
college campus. Journal of Counseling Psychology, i Hi 
1968, 15, 459-467. New York: McGraw-Hill, 1967. 


Gallup. Poll published in the New York Times, Oc- Pororr, D. Feedback on drugs. Psychology Ti 
tober 23, 1969. 1970, 3, 51-52. 

Haeman, H. H. Modern factor analysis. (Rev. 2nd y 
ed.) Chicago: University of Chicago Press, 1969. (Received March 22, 1973) 


ANNOUNCEMENT 


As of June 1, 1974 the editorial office of the Journal of Applied Psychology 
is moving to a new address. Please send all manuscripts and correspondence regard- 
ing manuscripts to Dr. Edwin A. Fleishman, Editor, American Institutes for 
Research, 3301 New Mexico Avenue, N.W., Washington, D.C. 20016. 


Journal of Appli 
1974, Vol. 59, N 


LANDSCAPE OFFICE ATTITUDES: 


RE} LIONS OF PERCEIVED DEGREE OF CONTROL OVER 
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mponent structure of attitudes toward a new office development con- 


en Office landscaping, were examined. A principal-components factor 

of attitude data from 600 respondents identified the following six 
der oblique factors: (a) greater job satisfaction, (b) more effective 
inication, (c) greater productivity, (d) lack of auditory privacy, (e) 
personal privacy, and (f) lack of confidentiality. An underlying bi- 
second-order component termed “degree of perceived control over one’s 
tions with the environment” was found to underlie the attitudes. 


> 
A major in the design of office accom- 
modation ha the use of open office land- 
Scape forma acoustically screened off work 
Spaces repla: onventional walled offices. Its 
controversia! e is typified by conflicting re- 
search repori its positive and negative conse- 
quences (e.s ter communication and cohe- 
l sion versus er noise and loss of privacy; 
Cereghino, Riland & Kurtz, 1968). 
A This study z=>resents an attempt to provide a 


Preliminary thes 
and relates tj; 
Pothesized th: 


ctical structure which underlies 
conflicting feedback. It is hy- 
4 single bipolar theoretical fac- 
tor, namely, he degree of control a person 
perceives h f to have over his two-way 
transactions with his environment, underlies the 
often cited positive and negative office landscape 
evaluation feedback, 


A METHOD 
Participants 


The participants were 600 employees repre- 
sentative of the total 1,130 (except for an under- 
Tepresentatation of support staff) of a Canadian 

ederal Government Department housed in a 
Modern office tower centrally located in a large 
8 Metropolitan area. Office planning and system 

70 expertise had been employed in the de- 

“opment of the landscape office design and 
a 


esate for reprints should be sent to Michael 

ogy Carrey, who is now at, Faculty of Psychol- 

Peay of Ottawa, Ottawa 8, Ontario, 
a. 


; c Now at 


anada, the University of Ottawa, Ontario, 


implementation, and, overall, 120 square feet of 
floor area had been allocated per employee. 


Measuring Instrument 


A 53-item Likert format, Open Office Opinion 
Inventory, was developed on the basis of (a) 
earlier measuring instruments (Allison, 1970: 
Riland & Kurtz, 1968), and (b) departmental 
input from representatives of each branch. Con- 
tent included items designed to assess attitudes 
toward the visual layout, the noise level, the 
communication facility, perceived solidarity, job 
satisfaction, clarity of work space definition, and 
so on. 

The reliability of inventory was checked by 
examining both internal consistency and test- 
retest data. In the case of internal consistency, 
the response data of the 600 subjects on seven 
randomly chosen repeated items (slightly differ- 
ent format) yielded a contingency coefficient of 
C=.72 with a chi-square test (p < .001). The 
response data of 28 subjects who took the 
complete inventory twice yielded a test-retest 
contingency coefficient of C=.67 for all 53 
items over a 24-day time interval. The re- 
sponse data thus appeared to possess acceptable 
consistency and stability over time. 

The validity of the response data was checked 
by first looking at the extent to which acquies- 
cence and social desirability response sets were 
operative. Contingency coefficient estimates of 
the relationship between item endorsement and 
item bias and social desirability cue value were 
C=.13 and C = .24, respectively. The construct 
validity of the 53-Likert items was investigated 
by intercorrelating (point-biserial r) the 600 
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responses generating a 53 X 53 matrix. A prin- 
cipal-components analysis (little Jiffy Mark IV, 
Kaiser, 1970) of this matrix yielded empirical 
evidence that items intended to tap a specific 
characteristic did in fact cluster with other simi- 
lar items, thus giving evidence that they had 
functioned in accord with the design intent.“ The 
overall measure of sampling adequacy was .90, 
while the index of factorial simplicity was .78. 


RESULTS AND DISCUSSION 


The exploratory principal-components analysis 
of Kaiser was run on the 53 X 53 matrix, and 
orthoblique rotation was performed. Six main 
components were obtained and rotated (see 
Footnote 3). 

Component 1 appears an overall preference 
factor combined with greater job satisfaction, 
while Component 2 seems the familiar more ef- 
fective communication, solidarity and contract 
feature. Component 3 seems to indicate the re- 
current greater productivity variable in this feed- 
back data, while Component 4 appears the often 
repeated lack of auditory privacy feature. The 
recurring lack of personal privacy consequence 
of landscape office appears to have come out 
as Component 5, while lack of confidentiality 
communications content seems indicated by 
Component 6. 

The component intercorrelation matrix showed 
all first-order components to be significantly 
related to each other (see Footnote 3). A second 
principal-components analysis (Little Jiffy Mark 
IV) was run on the 6X 6 first-order matrix. 
Only one eigenvalue was greater than unity so 
the analysis identified a single second-order com- 
ponent that accounted for 67% of the variance. 

It is suggested that this single second-order 
component reflects an underlying perceived de- 
gree of control over the individual’s two-way 
transactions with others in the office landscape 
environment. Conceptually it appears the item 
definers for the six first-order components pro- 
vide evidence to support the idea that these six 
are attitudinal reflections of one underlying 
construct, namely, degree of perceived control 
over transactions with the environment, 

Looking at the negative pole, it is suggested 
that when respondents in landscape office per- 
ceived themselves to have inadequate control 
over input to and from the environment, they 
reported (a) lack of auditory privacy (second- 
order factor loading, —.77), (b) lack of personal 
privacy (factor loading, —.71), and (c) lack 


®Tabular data is available from the authors on 
Tequest. 


Suort Notes 


of confidentiality of communica content 
(factor loading, —.58) as consequen 

In other words, too much noise, my dis- 
tractions, lack of ability to put su d effort 
on a problem, lack of territory defi overly 
great accessibility to others, and 8 appear 
empirical reflections of a perceive: lequate 
degree of control over their own others’ 
transactions with the environment. ‘I \ppears 
thus evidence of an “information os 1 phe- 
nomenon” that seems the conseque f con- 
tinual exposure to unpredictable lation 
(noise) from a landscape enviro which 
may continually alert and irritate t) ividual 
in his work space. In the event tha viduals 
cannot take appropriate compensat teps to 
deal with the greater constraints ; n their 
behavior, it appears they report the ar loss 
of visual, auditory and personal priv 

On the other hand let us look at positive 
pole and examine the attitudinal fec k from 
office landscape incumbents who ms ve suf- 
fered no such loss of control over =raction 
with their environment. It is hypoth=< ed that 
the item definers for Components and 3 


(second-order factor loadings of „ 47, 93, 


respectively), such as, permits grea solidar- 
ity, offers greater contact, permits er job 
satisfaction, am more productive, sverlie an 
adequate level of perceived control over two-way 
transactions with the environment. 1 
How can the same office landscape environ- 
ment induce both the hypothesized late and 
inadequate perceived control over environmental 


transactions in it? Barker (1968) su 


behavior in the environment. By way of example, 
the environmental “press” or inputs on a motor- 
ist traveling at 95 miles per hour is not the 
same as that of another traveling at 60 miles per 
hour even when both are in the same traffic 
system on the same expressway. This research 
appears a rich area for further exploration both 
empirical and theoretical. 

In retrospect then, it has been suggested the 
positive and negative attitudinal feedback data 
concerning an office landscape experience of 600 
respondents is reflective of an underlying more 
basic perceived adequacy of control over trans- 
actions with the environment. One second-order 
component underlies Components 1 to 6, SUC 
that greater job satisfaction, more effective com 
munications and greater productivity reflect an 
adequate ability to control interaction with one $ 
environment, However, lack of visual and aud- 


tory privacy, 
munications, lac 


of freedom int: 


ability to contr 


ment, resulting 


load phenome 
adequate or in 
ronment is the 
job performan: 
objective meas 
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environment 


SxHort Notes 


f confidentiality of com- 
territory definition and lack 
tions reflect an inadequate 
‘eraction with one’s environ- 
possible “information over- 

A consequence of such 
ite control over one’s envi- 
eption of greater or poorer 
hat may well relate to an 
cnt of job performance. 
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DEVELOPMENT OF A BIOGRAPHICAL INVENTORY BLANK 
TO PREDICT FAKING ON PERSONALITY TESTS. 


JEFFREY COHEN ? axb JOEL LEFKOWITZ 
Baruch College, City University of New York 


Fourteen Biographical Inventory Blank (BIB) items significantly predicted 
the propensity to fake personality inventories in a socially desirable manner, 


as measured by the K Scale of the Min: 
(MMPI). Item analyses were performed 


and the derived scoring weights were cross-validated on the responses of 42 


other job applicants (r= .66, p < 01). 


The use of psychological tests for personnel 
Selection is often criticized on ethical as well as 
empirical grounds. Personality tests are addition- 
ally criticized because they are subject to faking. 
Indeed, one writer indicates that faking consti- 
tutes the major source of systematic variance 
of most personality inventories (McGee, 1962). 

A biographical inventory blank (BIB) which 
Successfully predicts faking could serve four 
purposes: (a) a moderator of personality test 
validity (hence interpretation) for individuals, 
(b) a suppressor of invalid personality test vari- 
ance, (c) a relatively unobtrusive independent 
personality (predictor) measure, and (d) a 
means of identifying those personality charac- 
teristics which reliably differentiate falsifiers 
from nonfalsifiers in any given situational context, 


METHOD 


In this study “faking” is defin 
case of the social-desirability-response set, op- 
erationally measured by the K Scale of the 
Minnesota Multiphasic Personality Inventory 
(MMPI). The K Scale was arbitrarily chosen 
over other possible psychometric measures be- 
cause of its widespread usage and acceptance in 
clinical practice, as opposed to instruments 
usually deemed suitable for only research pur- 
Poses. Moreover, a “one shot” measure was 
purposely chosen, rather than operationalizing 
faking as a change in response following an 
induced set to fake. This was done because (a) 
the research was undertaken with job applicants 


1 This study is based on the first author's master’s 
thesis, written with the Supervision of the second 
author, Also at BFS Psychological Associates, Inc, 

Now at Personnel Research and Development 
Corporation. 

* Requests for reprints should be sent to Joel 
Lefkowitz, Psychology Department, Baruch College, 
City University of New York, 17 Lexington Avenue, 
New York, New York 10010, 


ed as a special 
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nesota Multiphasic Personal Inventory 
on the responses of 76 job applicants, 


in the real-life (i.e., nonresearch) process 
applying for specific positions and (6) the in- 
vestigators were interested in approximating 4 
measure of actual faking rather than “fakability.” 
A shortened MMPI was used, consisting of all 
30 K Scale items and the 45 items immediate 
preceding and following those in the com 
MMPI, for a total of 75 items. 
The equivalence of the K Scale in the short- 
ened form to that in the complete MMPI 
estimated by administration of both measures to 
20 undergraduate and graduate students. The 
order of presentation was reversed for half ol 
the group. The time interval between adminis! a. 
tion was three to 10 weeks. The correlation be- 
tween the two measures was .90. f 
The BIB consisted of 80 items derived fron 
Division 14 of the American Psychological As- 
sociation catalog of life history items (Glennon, 
Albright, & Owens, 1966). 
Both measures were administered to 118 appli 
cants for a variety of clerical, supervisory, and 
sales positions that averaged $8,000 annual 
ary. The measures were included as part of @ 
larger battery administered by a consulting firm 
to which the applicants had been sent by ; 
prospective employers. 
Item analysis for the BIB was performed on 
a randomly selected group approximately twi 
as large as the hold-out group to be r 
cross-validation (n = 76 and 42, respectively). 
The criterion group was dichotomized at 
median K-scale score (18). Chi-square anal 
the vertical percent method, and Strong’s tab 
of net weights converted to unit direction 
weights were used to develop the BIB (Guio 
1965). 


RESULTS AND DISCUSSION 


Fourteen BIB items were statistically sign A 
cant (eight items, p <.05; six items, p< 1 
Applying the derived weights for these items t0: 


the responses 
correlation be:t 


The BIB pr 
is of one who 
“occasionally 
“middle of 
bothered by ix 
problem. He te 
its and takes 
not repaid on 
his parents per 
his own, anc 
ill; he mas 
pendent read 
experience v 
and he went 
while in high 
man, and hi 
living quarte- 

These den 
at face value 
reflect the < 
fostered a p: 
tory respons 
However, the 


of 66 (p < 01 


Suort Nores 


hold-out group yielded a 
he BIB and K-scale scores 


f the predicted dissimulator 
r 26 years of age, married, 
jokes, considers himself 
d? politically, and is not 
‘errupted in the middle of a 

) ignore co-workers’ bad hab- 
action when overcharged or 
n. During his school years, 
most desires if obtained on 
| a physician when he was 
difficult problems by inde- 
most outstanding negative 
of achievement in sports; 
ne or fewer nights per week 
|, His father was a business- 
is satisfied with their present 


} 


hic characteristics can be taken 


is, as personal attributes which 


mental conditions that have 
sity to fake personality inven- 


a socially desirable manner, 


ems themselves may, in part, 


e 
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reflect primarily that same social - desirabillty- 
response set measured by the K Scale. (Ten of 
the 14 significant items request opinions or 
unverifiable facts which could be biased in a 
stereo-typically-socially-desirable direction. Nine 
of those 10 positively weighted alternatives are 
responses which imply independency, achievement 
orientation, and conventionality.) In order to 
determine whether such is the case, future re- 
search ought to include: (a) other potentially 
fakable BIB items (which, however, might not 
be predictive), (b) items which seek to ascer- 
tain the same information but in a more unob- 
trusive manner, and (c) items which are subject 
to external verification, 
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THE INFLUENCE OF TECHNICAL KNOWLEDGE 
ON POSITION ERROR IN RANKING 


EDWIN E. WAGNER * axo THOMAS O. HOOVER 


— 


University of Akron 


to 
the dimension being ranked. 


Symonds (1936), who studied the influence of 
jon on ranking error within the con- 
of an interest questionnaire, concluded that 
overranking tends to i 
a ranking series and then progressively decreases, 
an experimental design which 
more adequately controlled for context effect, it 
has been shown that there is a tendency to 
overevaluate the first and last positions when 
ranking a set of five entities. This terminal error 
not only under theoretical conditions 
when subjects may have been relatively unin- 
volved with the stimuli to be ranked (Wagner & 
1974) but also under more meaning- 
circumstances which might have been ex- 
to evoke strong personal feeling (Wagner 
Hi » 1974). Since ranking procedures are 
t for applied areas of psychology, fur- 
research on this phenomenon seemed war- 
ted, particularly under conditions approximat- 
real-life situations. The present study at- 
to investigate the effect of technical 
knowledge (i e., correct information relating to 
the entities and/or dimension to be ranked) on 
position error in a situation requiring the direct 
evaluation of live human beings. 


ji 


METHOD 


Five Air Force Reserved Officers’ Training 
Corps (ROTC) cadets, members of the Saber 
Squadron, a military drill team at the University 


clearly discernible to the TV viewer. All military 
honors, rank, decorations and other means of 
identification were removed from their uniforms 
prior to the drill. The drill sequence was then 


1 Request for reprints should be sent to Edwin E. 
Wagner, Department of Psychology, University of 
Akron, Akron, Ohio 44325, 


The biasing effect of position error in ranking was demonstrated in a situation 
approximating real-life conditions. It was also shown that position error is less 
when subjects undertaking the evaluation are knowledgeable 


presented via closed circuit TV to a | group 
of 120 general psychology underg es who 
had no previous exposure to servi ted ex- 
periences and were therefore cons to be 
relatively unskilled in judging milita ill. 

The experimental group was com of 120 
cadets in the Air Force or Army R( vho had 
been trained in drill exercises and cou herefore 
be considered expert observers. All = ts were 
instructed to rank the individual me: of the 
drill team from one to five in tern judged 
ability to properly execute the mane: Evalu- 
ation sheets were then handed ou! ich ob- 
server. Ostensibly, these forms w alike, 
but actually, the five names to be d were 
completely counterbalanced so that combi- 
nation occurred just once for each g that is, 
5!=120. In this manner it was sible to 
neutralize any context effect and i iste the 
influence of position per se on rank! error. 

RESULTS AND Discussion 
Both the skilled and unskilled judes signifi- 


cantly differentiated among the five individual 
members of the drill team (see Table 1). Al- 
though median selection values varied, both 
groups ranked Tom first and Bob last indicating 
some general agreement between groups despite 
differences in technical knowledge. Of greater 
interest, however, was the fact that significant 


TABLE 1 


MEDIAN AND FRIEDMAN Two-Way ANALYSES 
OF VARIANCE COMPARING THE RANKINGS OF 
Five CADETS BY UNSKILLED AND 
SKILLED JUDGES 


Cadets 
Group a ee 
Bob | Dan 


2.01 | 3.50 | 4 
1.69 | 3.33 | 4 


Ron 


2.61 
3,33 


Tom 


120 | 3.70 
120 | 3.83 


Jim 


3.04 
2.87 


Unskilled 
Skilled 


* <.001, 
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occurred for the unskilled 
he skilled judges, suggesting 
is more likely to crop up 
k information relating to the 
lable 2). The practical impli- 
ing seems clear: lack of ex- 
id to random ranking error 
s which induces subjects to 
position over another, thus 
ng certain individuals for 
o the purpose of the evalua- 
iment the bias seemed to be 


TABLE 2 


RIEDMAN Two-Way ANALYSES 
OMPARING THE RANKINGS OF 
‘OSITION BY UNSKILLED 
> SKILLED JUDGES 


Serial position 
E ae 
2 eee 

5 | 3.40 | 2.79 | 2.95 | 2.67 | 4 | 9.88" 
2,63 | 3.10 | 2.89 | 3.08 1.59 


concentrated on the first 
received a median 
error pattem confirms declining 
linear model advocated nor th 

terminal error model proposed i 
Hoover; yet, it is apparent that position error 
has been demonstrated in a real-life situ tion, 
that is, the evaluation of live human beings o 

a meaningful task, Considering the extensive 
of ranking procedures, particularly in ind 4 
and in the military, it seems that further efforts 
should be bent toward defining the parameters 
of position error and developing remedial and/or 
compensatory procedures, K 
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BRIGHTNESS CONTRAST EFFECTS ON RECALL OF PROJECT! 
HIGHWAY SIGN-TYPE STIMULUS MATERIAL 


ROBERT W. SWEZEY * 


Applied Science Associates, Reston, Virginia 


Stimulus-surround brightness contrast of projected highway sign-type stimuli 
was varied by manipulating photographic exposure time. Stimulus materials 
comprising six contrast levels of black on white (e.g., 97%, 92%, 90%, 85%, 
67%, and 34%) were developed and reproduced on 35-millimeter slides. These 
materials were presented at two different stimulus-exposure-time conditions 
on each of two separate recall tasks. It was concluded that recall of pro- 
jected materials is not seriously degraded as contrast decreases from 97% to 
67%. Below the 67% contrast level, a significant degradation in recall was 


observed. 


Effects of brightness contrast variations in 
character legibility have been widely investigated 
(Cornog & Rose, 1967; Tinker, 1963). Most 
Studies have measured variables such as, visibil- 
ity, apprehension, reading speed, and blink rate. 
These variables have historically been employed 
as criterion measures in legibility research 
(Tinker, 1963). Only a relatively small number 
of studies have employed recall and/or compre- 
hension as legibility criteria. Research that in- 
cludes these types of variables as criteria is gen- 
erally lacking in the early literature. Recently, 
however a number of studies by Poulton ( 1959, 
1965, 1969a, 1969b, 1972) and his associates 
have employed such variables as criteria. These 
studies have been concerned with variables such 
as printing types and formats (Poulton, 1959), 
letter differentiation (1965), type size (1969a, 
1972), style and spacing (1972), etc. Poulton 
(1969b) studied brightness contrast effects on 
ability to skim lists of food ingredients. 

The purpose of the present study was to inves- 
tigate stimulus- surround brightness contrast on 
recall of projected highway sign-type stimuli, 
Stimulus-surround brightness contrast was varied 
across six levels on two Separate tasks, and the 
effects of these variations on recall were assessed, 


This study is based on a dissertation completed 
in partial fulfillment of the requirements for the 
doctoral degree at the University of Maryland, 1972. 
A more detailed description of the study is pre- 
sented in that dissertation (Swezey, 1972). 

The helpful comments of Irwin L. Goldstein, 
Nancy S. Anderson, C. J. Bartlett, and of John D. 
Folley, Jr. are gratefully acknowledged. 

Requests for reprints should be addressed to 
R. W. Swezey, Applied Sciences Associates, Reston 
[nternational Center, Reston, Virginia 22090. 


METHOD 

Subjects 

The subjects were 120 students led in 
driver education courses at a sub: high 
school. Of these, 111 were in the junic ss and 
nine were in the senior class. Sixty-s bjects 
were female and 54 were male. All su s had 
visual acuity of 20/30 or better cor: d, ac- 


cording to the Snellen chart. 


Phases 

The study was conducted in two es, In 
Phase 1, five simulated highway sign stimuli were 
designed and reproduced on 35-millime\-r slides 
at each of six contrast levels (e.g., approximately 


97%, 92%, 90%, 85%, 67%, and 34° 
cases the surround was white, and t! 
were one of six levels of gray. Recall 
sured according to a scoring system whic 
points for correctly recording directions, loca- 
tions, and other information items on the slide. 
An independent group design was employed, in 
which each group of subjects viewed each pro- 
jected stimulus for either six seconds or 12 
seconds. The stimulus exposure was followed by 
a 36-second off time during which subjects noted 
on an answer form all information items that 
they could recall from the previous stimulus 
exposure. Phase 1 thus consisted of six groups 
of five subjects for each of the two exposure 
time conditions, Each group saw five stimulus 
slides all of which were at the same contrast 
level. h 

In Phase 2, one stimulus slide consisting of 
information on point assessments for driving 
violations in the State of Maryland was 1 
duced at each of the six contrast levels. Recal 
was measured by a 15-point multiple-choice test 
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SHORT Notes 


ally for this purpose. Test— 
this measure was estimated 


constructed 


retest reliat 

at 88. The cts were instructed to read 
through the ; ed stimulus one time only for 
one conditic to read for as long as was 
necessary fo n to understand the contents 
for the seco ndition, In Phase 2, six inde- 
pendent gro five subjects for each of the 
two exposur lions viewed a single stimulus 
slide at one le six contrast levels. Reading 
time in eac osure condition was measured 
via a stopw: he subjects’ reading compre- 
hension levi also measured using the Edu- 
cational T Service Cooperative English 
Test on R Comprehension. This score 
served as iriate in the covariance analy- 
sis experin lesign, 

The coni: vels were produced by manipu- 
lating phot ic exposure time. All photo- 
graphs were osed at a lens setting of f/32 
and develo ‘or one minute. Six exposure 
times were ed, 4 seconds, 7 seconds, 8.5 
Seconds, 1; ids, 17.5 seconds, and 45 sec- 
onds. Thi: s effectively manipulated the 
brightness > stimulus without affecting the 
brightness surround. Thus, six levels of 


brightness contrast were ob- 
ies of photographic prints were 


manufacture The prints were then copied onto 
$S-millimete- ide film. The resulting stimuli 
Were then ched with Munsell percent re- 
flectances, eng a photovolt photometer. The 


Scale used f natching was the Munsell neutral 
value 32- natte scale. Once the brightness 
level of each stimulus was matched with a known 

unsell y the percentage of contrasts be- 


tween stimulus and surround brightness were 
calculated using the Luckiesh (1937) formula. 
Letters ranged in size from approximately 3/16 
inch to approximately 5/16 inch tall when pro- 
Jected. Distance from the eye to the midpoint 
of the stimulus display on the screen ranged 
from 76 inches to 83 inches, depending on seating 
Position in the room. 


RESULTS 


For Phase 1 data (simulated highway sign 
ag a 6X 2 covariance analysis was calcu- 
ated, Figure 1 shows these data. The indepen- 
= Variables were six contrast levels and two 

'mulus exposure times (six seconds and 12 
pos) The dependent variable was recall of 

e simulated highway signs (points correctly 


Bed Out of a possible total of 80 points per 
es Slide). The covariate was Educational 
es Service Total Reading Comprehension 
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Ficure 1. Highway sign phase results: Mean recall 
scores adjusted by covariate. 


Results indicated a significant exposure time 
effect (F = 7.70, df = 1/47, p <.01) and a sig- 
nificant contrast effect (F= 7.50, dj = 5/47, 
p < .001) with the Exposure Time X Contrast 
interaction being nonsignificant. The covariance 
analysis was conducted on mean recall scores 
across contrast and exposure time conditions. 
Scheffé’s post hoc multiple-comparison tests were 
calculated on all two-way comparisons. These 
statistics indicated that the 34% contrast condi- 
tion produced the least recall and differed sig- 
nificantly from the 67% condition (p < .10) and 
from the 85% condition ( < .10) for the six- 
second stimulus exposure. For the 12-second 
stimulus exposure, the 34% contrast condition 
differed significantly from the 67% condition 
( .01) and from the 90%, 92%, and 97% 
conditions (p<.10). No other two-way com- 
parisons were found significant. Since the 
Scheffé test is an extremely conservative test, it 
was felt that the .10 level could be accepted as 
an appropriate indication of statistical signifi- 
cance (Edwards, 1960). 

Omega-squared statistics indicated that 34.11% 
of the total variance could be accounted for by 
the contrast main effect and 7.3% of the total 
variance by the exposure time main effect. Trend 
analyses yielded significant linear and quadratic 
effects (F= 22.95, p<.001; F=15.12, p< 
.001), respectively. 

In Phase 2 (the Maryland Point System 
phase), a 6 X 2 covariance analysis indicated a 
significant stimulus exposure effect (F = 9.88, 
df = 1/74, p <.01), a significant contrast effect 


seu A 


2 


Fane 2 9 Prees 


Content Lovet (1) 


Power 2. Point system phase results: Mean recall 
y scores adjusted by covariate. 


(F = 4.68, dj = 5/47, p < .01), and a significant 
Stimulus Exposure X Contrast interaction (F = 
842, dj = 5/47, p< .001). The independent 

variables were two stimulus exposure conditions 


levels; the dependent variable was the score on 
s the Maryland Point System test; and the co- 
7 N was = Educational Testing Service 
k Reading on score. Fi 

4 eee 3 igure 2 
Scheffé test results indicated no significant two- 
_ way comparisons for the one-time through stimu- 
lus exposure condition. For the unlimited time 
stimulus exposure condition, the 34% contrast 
condition again produced the least recall and dif- 
_ fered significantly from the 90% condition (p 
<.05) and from the 85%, 92%, and 97% condi- 
tions (p < .10). No other two-way comparisons 
were significant., Omega-squared statistics indi- 
cated that 14.91% of the total variance could 
~ be accounted for by the contrast effect. Simi- 
larly 7.2% of the total variance was accounted 
for by the stimulus exposure effect, and 2.77% 
by the Stimulus Exposure X Contrast interaction. 
Trend analyses for the point system phase indi- 
cated a significant linear trend (F= 24.71, p 
<.001), The quadratic trend was nonsignificant. 


DISCUSSION 


Several summarizing concepts can be gained 
from the general contrast literature. First, it is 
fair to state that in the majority of achromatic 


imna ee ee ea a a a id —" 


Snort Nores 


contrast direction studies, black st super- 
imposed upon white backgrounds erally 4 
found to be more legibile than the e ( 
Tinker, 1963). Second, a general yn has 
been that as brightness contrast betw stimu- 
lus and its surround decreases, the c nding 
legibility also decreases in a monot: ishion, 
Third, dark stimuli on light surroun to be 
preferred to light on dark regardless e and 


chroma. Fourth, in the case of bla nt on 


colored paper, minor variations in | re of 
minimal importance as long as the c< ratio 
remains high. 

Results of the present study ind that 
recall of projected alphanumeric ma is not 
seriously degraded by stimuli which oy less 
than maximum stimulus-surround brigh s con- 
trast. For this aspect of legibility, | ‘trast 
materials produced no better recall t 1 con- 
siderably lower contrast materials. Th sults — 
occurred over two types of recall The 
range of contrast conditions under w! recall 
scores are not affected appears to ex from 
approximately 97% to approximately N 

REFERENCES 
Cornoc, D. V., & Rose, E. C. Legibility Ipha- 
numeric characters and other symbols: refer- | 
ence handbook. (National Bureau of dards 

Miscellaneous Publication No. 262-2) W: agton, 

D.C.: U.S. Government Printing Office, 7. 
Eowaros, A. L. Experimental design in psy: logical 

research. (Rev. ed.) New York: Holt, Fin: hart L 

Winston, 1960. 


Lucknsn, M. The perfect reading page. tec 
Engineering, 1937, 56, 779-781. r 
Poutton, E. C. Effects of printing types anc formats 


on the comprehension of scientific journals. 
Nature, 1959, 184, 1824-1825. 
Poutton, E. C. Letter differentiation and rate of 


comprehension in reading. Journal of Applied 
Psychology, 1965, 49, 358-362. 

Poutron, E. C. Skimming lists of food ingredients 
in different sizes. Journal of Applied Psychology 
1969, 53, 55-58. (a) 3 

Pourtox, E. C. Skimming lists of food ingredients 
printed in different brightness contrasts. Journal 
oj Applied Psychology, 1969, 53, 493-500. (b) | 

Poutton, E. C. Size, style, and vertical spacing in 
the legibility of small typefaces. Journal of AP- 
plied Psychology, 1972, 56, 156-161. 

Swezey, R. W. An investigation of stimulus- 
surround brightness contrast: Effects on recall of 
projected material. Unpublished doctoral disserta- | 
tion, University of Maryland, 1972. 

TNKk ER, M. A. Legibility of print. Ames: 
State University Press, 1963. 


(Received May 10, 1973) 


Towa 


n r a> 


JC NAL OF APPLIED PSYCHOLOGY _ 


Copyright © 1974 by the American Psychological Association, Inc. 


E ee 
Augus! 974 Vol. 59, No. 4 
<td 


ARTIC 
Ti tionship Between Managerial Values and Managerial Success in the United 
5, Japan, India, and Australia George W. England and Raymond Lee 411 
Inf of Dogmatism, Risk-taking Propensity, and Intelligence on Decision-making 
‘egies for a Sample of Industrial Managers 
Ronald N. Taylor and Marvin D. Dunnette 420 
Quat of Decisions: Individuals versus Real and Synthetic Groups 
Bertram Schoner, G, C. Hoyt, and Gerald L, Rose 424 
B ming by Individuals, Coacting and Interacting Groups Warren R. Street 433 
Cu ‘vation and Expert Judgments: A Comparison of Independent Auditors with 
er Judges Robert H. Ashton 437 
Re ‘he Rater in Performance Appraisal Richard J. Klimoski and Manuel London 445 
R Expectations as an Aid in Reducing Voluntary Resignations 
Daniel R. Igen and William Seely 482 
P ty and Utility Assumptions Underlying Use of the Strong Vocational Interest 
ik Frank L. Schmidt 456 
Cc er Dissatisfaction as a Function of Dissatisfaction with Interpersonal Relation- 
ups Gerhard W. H. Scher] 465 
Ses. Belts: Relations Between Beliefs, Attitude, and Use 
Gunilla Fhanér and Monica Hane 472 
Effects ef Caffeine on Alertness in Simulated Automobile Driving 


dmund G. Regina, Gene M. Smith, Charles G. Keiper, and Robert E. McKelvey 483 
Time Delay and Similarity Effects in Facial Recognition ; 
Kenneth R. Laughery, Paul K. Fessler, David R. Lenorovits, 
and Darryl A. Yoblick 490 


SHORT NOTES 


Relationship of Role Conflict and Role Ambiguity to Job Involvement Measures 
W. Clay Hamner and Henry L. Tosi 497 


Stress and Performance During and After Officer Candidate School 
J. Richard Jennings, R. M. Rose, and Leo Kreuz $00 


A Performance Correlate of Personal Control Beliefs in an Organizational Context 
W. J. Heisler 504 


Comment on Locke's Satisfiers and Dissatisfiers among White-Collar and Blue-Collar 
Employees Robert G. Hadley 507 
Achievement Motivation among Managers of Small Businesses, Corporation Managers, 
and Business Students John C. South 509 


Effects of Applicant’s Sex and Difficulty of Job on Evaluations of Candidates for 
Benson Rosen and Thomas H. Jerdee 511 


— * 


Managerial Positions 


= 7 ! 1 
1 No $ í | (Continued on next page) 
Diary x \\ | 4 \ 
A ` i 


Date * 


to Ga 1.02 er E 


N). 


J 


1 


4 


Oe SO” ll. ll a a 


Spin-Offs from Behavioral Expectation Scale Procedures Milton R. Blo 513 


A Utility Analysis of a Weighted Application Blank Designed to Predict Turnover : 
Clerical Employees Raymond Lee and Jerome M. Boo 516 


Prediction of Scuba Training Performance Robert J. Biersner and David H. Rym 519 
Predicting Alumni Financial Donation Warren S. Blumenfeld and Patricia L. Sartci 522 
Effects of Spectral Differences in Illumination on Fatigue 


James B. Maas, Jill K. Jayson, and Douglas A. Reibe 524 
Effect of Ambient Illumination on Noise Level of Groups 
Mark Sanders, Jerry Gustanski, and Mike Law: 27 
Visual Dependency in the Erect and Supine Positions 
Jacob H. Lichtenstein and Rayjord T. Sauc: 29 
Effects of Numerical Data Upon Reading Time and Recall of Scientific Writing 
William W 32 
Type of Transmittal Letter and Questionnaire Color as Two Variables Influencis: 
Response Rates in a Mail Survey Michael T. Mattes: 35 
LIST OF MANUSCRIPTS ACCEPTED 5 28 
SPECIAL ANNOUNCEMENT 15 
ERRATUM 534 


INSTRUCTIONS TO AUTHORS 


Articles must possess: (a) conciseness: i i 
Fact : (a ess; (b) unambiguous and simple vocabulary; (c on- 
e . style in tables, terminology, and references; and (d) on- 
8 peat related to the evidence presented. Authors must progress from a te- 
purpose, through analysis of evidence, to conclusions and implications 


Criteria for evaluation include: (a) signi ibuti 
3 significance of contribution, (b) technical adequacy, (c) 
appropriateness for the Journal of Applied Psychology, and (d) clarity of S 2 


See the Publication Manual of the American Ps 


f 7 h ychological Association (1967 Revision), $1.50. 
for the format of manuscripts. Articles not prepared in this manner cannot be reviewed. Note 


ections on typing (p. 48, all copy must be double-spaced), tables, figures, and relerences. The 
vi Sha = a be typed on the upper left-hand corner of each page, except for 
names of groups, 2 Ne for example, journal titles in reference lists, the word “subject,” 
r manuscript copy must include an abstract of 100-120 words typed on 
Optional blind review must be 
author's name and affiliation mus! 
of the manuscript. Footnotes co 
Separate pages. 


specifically requested when a manuscript is first submitted. The 
t appear only on a separate title page, included with each copy 
ntaining the identity of the author or his affiliation must be on 


Check re i ; 
dA (One pane of the Journal of Applied Psychology for approximate length of regular 
of special erinan eee equals three double-spaced manuscript pages.) A few longer articles 
such as those 0 e printed as monographs. Short Notes feature brief reports on studies 
g some methodological contribution or important replication. 


The origi i 
e original drawing of a figure or an 8 x 10 inch glossy print must be submitted with the 


original copy of th i j . i 
711055 55 10 1 ce lettereld ae Duplicate copies may be photographic or pencil drawn. Figures 


Supplementary materials formerly deposited with the National Auxiliary Publications Service 


need no longer be submitted wi i i i 
5 years after publication of the een, Manuscript. Retain supporting and raw data for at least 


Journal of Ap 


following asserti 


. 


É 


ons and their implications: 


Personal value systems influence a 


manager’s perception of situations and 
problems he faces. 
Personal value systems influence a 
manager’s decisions and solutions to 
Problems, 

Personal value systems influence the 
Way in which a manager looks at other 
Individuals and groups of individuals; 


thus they influence interpersonal rela- 
tionships. 


R. 


equests 
Eng for re 


l prints should be sent to George W. 
and, Industrial Relations Center, 537 Business 


i Anistration Building, University of Minnesota, 
Mineapolis, Minnesota 55455. 


411 


1974, Vol. 5 419 
THE \TIONSHIP BETWEEN MANAGERIAL VALUES AND 
AGERIAL SUCCESS IN THE UNITED STATES, 
JAPAN, INDIA, AND AUSTRALIA 
GEORGE W. ENGLAND? Anp RAYMOND LEE 
University of Minnesota 
clationships between managerial values and managerial success were 
stigated for a diverse sample of American managers (u = 878), Australian 
ers (n= 301), Indian managers (n = 500), and Japanese managers 
12). Cross-validated results showed that values patterns were signifi- 
y predictive of managerial success and could be used as a basis for selec- 
and placement decisions, Results also indicated that managers from the 
countries were rather similar in terms of the personal values that were 
d to success, More successful managers have pragmatic, dynamic, and 
evement-oriented values, while less successful managers have more static 
bassive values. Arguments for and against using value assessment results 
lection and placement decisions are discussed. 

In recen considerable effort has been 4. Personal value systems influence the 
expended empts to predict managerial perception of individual and organiza- 
effectivene using maximum performance tional success as well as their achieve- 
measures: ical behavior measures. Most ment. 
of this ei nas focused on tests of (a) 5. Personal value systems set the limits 
intellectua ities, (6) spatial and mechan- for the determination of what is and 
ical aptitu (c) perceptual accuracy, (d) what is not ethical behavior by a mana- 
motor abi) and (e) personality and in- ger. 4 
terest. Ace Dg to Ghiselli’s (1966) review, 6. Personal value systems influence the 
each meas shows average rs in the range extent to which a manager accepts or 
of .25 to \nother area of potential im- resists organizational pressures and 
Portance receiving less emphasis, has goals, 
been the notie of predicting managerial suc- 7. Some personal value systems may con- 
cess from mi sers’ personal values. The sig- tribute to managerial performance, some 
nificance of in vestigating the value systems may be irrelevant, and some may be 
Of managers seen when one considers the antithetical to achievement efforts. 


It seems reasonable, therefore, to expect 
that personal value systems will influence not 
only the behavior of managers but their suc- 
cess as managers. Despite the apparent logic 
of the value-performance relationship, little 
empirical research has related values and per- 
formance of managers. The present study 
investigates the relationship between values 
of managers and their success as managers in 
four countries. The major focus of the re- 
search is comparative in nature and centers 
on answering the following questions: (a) 
What is the nature of the relationship be- 
tween managerial values and managerial suc- 
cess? (b) How similar or different are the 
relationships between values and success 


managers in four different countries? 
<) What are the distinguishing character- 


Managers and relatively less successful mana- 
5 and what do these differences imply 
behaviorally? (d) What type of causal rela- 
tionship is suggested? Are managers differ- 
entially successful because of their values or 
are their value systems different because they 
have achieved different levels of success? 


Meton 


tralia, 
major directories of corporation executives and di- 
rectors. The national samples were developed to be 
toughly comparable and were selected on the basis 
of three stratifying variables: (a) size of organiza- 
in terms of number of employees, (b) level 
of the manager in the organization, and (c) organi- 
function of the manager. While detailed 
sampling plans and data collection procedures have 
been presented elsewhere: the United States (Eng- 
land, 1967), Japan (England & Koike 1970), In- 
dia (England, Agarwal, & Dhingra, 1973), and 
Australia (England, 1972), it should be noted that 


tries were as follows: the United States (31%), 
Japan (26%), India (72%), and Australia (30%). 
Each national sample of managers represents a 
relatively diverse group in terms of organizational 
variables (type of company, size of firm, department 
or function, organizational level, and line- staff posi- 
tion) and in terms of personal variables (years of 
managerial experience, age, and formal education). 


Value Measurement 
__A framework was developed by England (1967) 


and stable in 
nature; they are also more general and less tied to 


case with many 


Several major classes of overlapping values are 
recognized in the framework. All Possible values 
that might be held by an individual or by a spe- 
cific group constitute the total value space and are 
known as potential values. The potential values are 
made up of two classes of values: (a) nonrelevant 
or weak values for a specific group or individual 
(those values which would have little or no impact 
on behavior) and (b) conceived values (those 


rr 
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which are likely to be translated fro: inten- 
tional state into behavior). Conceive: nes are 
made up of (a) operative values (thos h have 
a relatively high probability of bein inslated 
from the intentional state into actual t or, (b) 
Intended values (those which are view: impor- 
tant but may have only a moderate p lity of 
being translated from the intentiona? into 
behavior because of situational factor d (e) 
adopted values (those which are less a of the 
personality structure of the individual affect 
behavior largely because of situational i ` 
The development of the Personal \ Ques- 
tionnaire was based on the rationale th mean- 
ings attached by an individual to a care cified 
set of concepts provides a useful descr of his 


personal value system, which may in be re- 


lated to his behavior in systematic This 
attempt to describe an individual's va! rough 
the use of a carefully specified set of < s was 
influenced by the work of Osgood, Suc Tan- 
nenbaum (1957) and represents an on of 
their methodology. 4 

In order to specify a set of concept nt to 
the personal value systems of manage ol of 
200 concepts was selected from liter: ealing 
with organizations and with individua! group 
behavior, In addition, ideological and phical 
Concepts were included to represent > belief 
systems. A panel of expert judges reduc is pool 
to a set of 96 concepts, which was furt luced 
to 66 concepts based on pilot studies. rovide 
a framework within which respondents con- 
veniently evaluate each concept, the concen‘s were 


categorized into five classes: goals of business or- 
ganizations, personal goals of individuals, cups of 
People and institutions, ideas associated wilh peo- 
ple, and ideas about general topics. 

In the Personal Values Questionnaire, four scales 
are used to represent two modes of valuation. Since 
the general value of an object or idea to an indi- 
vidual is thought to be largely a function of its 
degree of importance to him, the primary or power 
mode of valuation used is the importance scale which 
consists of three points—high, average, and low. 
Three secondary modes of valuation were also de- 
veloped from the literature. The pragmatic mode of 
valuation suggests that an individual has an 
evaluative framework that is primarily guided by 
Success-failure considerations: Will a certain course 
of action work or not? How successful or unsuc- 
cessful is it apt to be? The pragmatic mode of 
valuation runs throughout much of the literature 
dealing with managers and finds support in various 
analyses of the Allport-Vernon-Lindzey (1960), 
Study of Values which suggest a major dimension 
of values as being “pragmatic and utilitarian” or the 
finding of a major value factor being “idealism 
versus practicality.” 

The ethical-moral mode of valuation implies an 


evaluative framework consisting of ethical considera- . 


tions influencing behavior toward actions and deci- 
sions that are judged to be “right” and away from 


to be “wrong.” The existence of a 

hical orientation is at the very heart of 

x fous belief systems and is supported by 
lyss of the Study of Values which find value 

or factors such as “social and altruistic” 
* 


affect or feeling mode of valuation suggests 
tive framework which is guided by hedon- 
One behaves in ways that increase pleasure and 
pain. The affective component of values has 
tensive philosophical and psychological back- 
d and seemed important to include as an 
ion in the present studies. 
the Personal Valucs Questionnaire, the prag- 
mode of valuation is represented by a Suc- 
scale; the ethical-moral mode of valuation is 
lined through a Right scale; and the affect or 
g mode of valuation is measured through use 
Pleasant scale. 
combination of primary and secondary modes 
uation was used to operationally define the 
MF conceptual classes of values. Operative values 
€ an individual include those concepts which are 
by him as high in importance and which fit 
hal orientation (pragmatic, ethical-moral or 
e). Intended value: include those concepts 
high in importance Dut which do not fit his 
lal orientation, adopted values include those 
mcepts not rated high in importance but which fit 
$ Personal orientation. Weak values include those 
encepts which are neither rated as highly impor- 
t nor fit the personal orientation of the indi- 


n the aforementioned definitional categories 
Values, a value score was prepared for each of 
66 concepts on the basis of the responses of each 
by assigning weights as follows: 4 for the 
values, 3 for the intended values, 2 for the 
values, and 1 for weak values. 


ires of Success 


major problem in dealing with this topic 
the definition and determination of suc- 
measurement. What is managerial success and 
do we measure it? 

ng and measuring manager success in a 
which samples widely across geographical 


INTERNATIONAL MANAGERIAL VALUES AND MANAGERIAL Success 


TABLE 1 
SALARY Comparisons oF Most AND LEAST Successrut MANAGERS 


aP * x 


areas, industries, o and of 

is considerably oer ae ricted from rt which 

be possible if one were studying only managers 

from one organization. 2 

It was our judgment that the heterogeneous na- 
ture of our sample dictated that we define and 
measure success in terms of objective data that 
were relatively easy to collect. Success was thus 
defined as managerial pay relative to age. This 
Success criterion is a relative measure of personal 
Success within the total sample of managers in cach — 
country and is not necessarily identical to organiza- 
tional success or managerial effectiveness, It does, 
however, provide a cross-organizational measure 
which is sufficiently accurate for our purposes. 

Our procedure for developing a measure of suc 
cess was as follows: (a) Separate managers in euch 
country into eight age categories—20-29, 30-34, 
35-39, 40-44, 45-49, 50-54, 55-59, 60 and over. 
(b) Convert the distribution of income received by 
managers in each age category into a standard score 
with a mean of zero and a standard deviation of 1. 

Thus, a manager in any age category who re- 
ceived a high salary compared to others in his age 
category would receive a high relative success score 
(approaching 3,0), while a manager whose salary 
was low, relative to others in his age group, would 
receive a low relative success score (approaching 
—3.0). Each manager, then, had a personal success 
index between —3.0 and +3.0 depending upon his 
relative salary position within his age group. 

While the success index is useful in relating rela- 
tive success to value patterns, it does not indicate 
the extent of success differences in the sample in 
any meaningful way. Table 1 presents the average 
salary and the average salary per year of age for 
the top and bottom 10% of managers in terms of 
their personal success score. 

As shown in Table 1, the managers in the top 
10% of our personal success index earn five to six 
times as much as do the managers in the bottom 
10% of our personal success index. Likewise, when 
one divides salary by age for each individual, the 
most successful group earns five to six times as 
much per year of age as does the least successful 
group. It is clear, then, that our personal success 
measure represents meaningful variability of con- 


United States Australia India Japan 
7 Yearly Yearly Yearly 
Manager Yearly 97 5 Yearly income income 85 income 
income income ane 
year per year per y 
(dollars) ef age | (dollars) | oF age of age. | (dollars) | POf age 
St successful 41,739 902 25,838 570 1,044 43,103 848 
Successful 8,045 176 5,500 117 360 7,500 138 
Ho of high to low 5251 47 409 5.5 5852 


44 
5 


ene We are dealing with differences 
_ that really make a difference. 


Data Analysis 


To determine the relationship between personal 
success and personal value systems for managers, 
tach of the 66 concepts in the Personal Values 
Questionnaire was examined to determine its degree 
of relationship with the success index by using the 
method of double cross-validation (Katzell, 1951). 
The procedure may be illustrated as follows: 


1. Randomly separate the managerial sample into 
two equal size samples for each of the four 
countries. 


TABLE 2 


SIGNIFICANT CORRELATIONS FOR CONCEPTS MEASURED 
BY PERSONAL VALUES QUESTIONNAIRE WITH 
MANAGERIAL SUCCESS INDEX 


: | Countries 
n 
Lr | a ee | javan 
1 — — — 
productivity 0 * Sy 1 5 
— al aas leadership | 0 —— 0 4 
Naß) —— | =- 0 + 
Tres maximiza ++ ++ ++ 0 
Social welfare 0 — — Ss ee 
Organizational growth o ++ 0 0 
Customers 0 ++ +4 0 
Craftsmen 7 ++ 0 0 
Managers an 5 as + + 
My subordinates +4 ee) 4 t 
Labors 0 + ate 5 
My com; + ++ = 0 
0 — 
Technical employees 0 Tyt 0 ++ 
Me + $+ — — 
labor unions ++ | ++ + $ 
Ability ++ ++ ++ + 
Trust — a = 0 
Aagressivences ++ ++ ++ 
Loyalty a 3 1792 
Prejudice + + ++ 
Cooperation es era E g 
Tolerance = we 0 
Conformity -- = — 0 
mor 
Leisure 2 = g S 
oe y — — 0 = 
evement 
Autonomy re $y 1585 g 
Individuality 0 o 0 
nfluencs 
Security. 2 lee oe E 
Ower 0 
Creativity: 5 4 Sipe g 
Success ++ + * 0 
Authority 0 —— 1 
E oe — — 0 i Pi 
aap a ++ + + 0 
Competition 
$ ompromise $ 8 + £ ae 
onflict 
penalty aus x TE ER 
roperty EF 
Religion aa eee ue 705 
Risk ++ — o ++ 


Note. ++ indicates a significant iti = 
_ Vote. ++ in s g positive correlation; — — 
indicates a significant negative correlation; + N posi- 
tive correlation; — indicates a negative correlation; 0 indi- 
cates no correlation. a 
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2. Within each subsample, correlate the suo 
score for all individuals with their value scores 
on each of the 66 concepts. 

3. The final scoring keys for cach country are 
developed by selecting those concepts which 
have the same direction of correlation in both 
samples and show at least a minimum core 
lation of .08 in one sample and at least a 
minimum correlation of .02 in the other sam- 
ple. Using these two criteria, the product of 
the two probability values for the occurrence 
of each concept by chance alone will be Jess 
than .05. 


The previously mentioned procedure steps were 
used to develop a special value key for each coun- 
try, which was not a function of chance differences 
but was stably related to personal success. Th 
significant correlation results between the con 
measured by the Personal Values Questionnaire 
the managerial success index are reported in 
2. It indicates that of the 66 possible pairs of 
lations in each of the four countries, 28 in the 
Australian sample, 21 in the Indian sample, 12 in 
the Japanese sample, and 22 in the United State: 
sample were found to be statistically significant. 
This analysis also shows that there are nine cone 
which have the same direction of correlation 
have at least one significant relationship across 
four countries. Since the probability of this o 
ring by chance is below .001, it seemed approp 
to develop a common key based on these nine 


country and the common key for the four countries 
had been developed, the correlations between th 
composite value scores of each key and the suca 
index were obtained to indicate the accuracy of 
composite value scores of each key for predi 
a manager's status on the success index. 


RESULTS AND Discussion 


Table 3 presents the overall correlation 
between the success score and the composite 
score of each value key for the managers in 
different countries and the total sample. 
correlation coefficients in Table 3 were ob- 
tained my correlating the success index with 
their five value scores (as derived from the 
American, Australian, Indian, Japanese, 4 
common value keys, respectively) for 
American, Australian, Indian, Japanese, 
total sample. There are three primary resul 
to be noted. First, tailor-made value ke 
developed for each country manifest the be 
results. As shown in Table 3, the relationshij 
between personal values and personal suc 
are .32 for American managers, .47 for Au 
tralian managers, 35 for Indian manage 


J 


CORRELAT EN VALUE Scores 


TABLE 3 


AND THE Success INDEX FoR MANAGERS IN DIFFERENT Counters — 
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NATIONAL MANAGERIAL VALUES AND MANAGERIAL Success 


Managers 
C No. — - --— — 
concepts" American | Australian Indian | Japanese Total 
(n = 878) | (m= 301) | (n= 500) | (m= 312) | (w= 1991) 
Special key | tem ee A) A TA Du 
analyses un | } 
sample 22 | A agen | SFT 23° 4 | 27 
Special ke item | | 
analyses ian | | | 
sample 28 | 18 17 | .26* 002 | 23° 
Special ke, item | | 
analyses ui } | 
sample | 21 .19* | .29° „35* 03 23° 
Å ; | 
Special ke n item | | | | 
analyse: inese | | l | 
sample 12 | 05 | 17* | 19° | 26* | 13° 
| | 
Common gthe Í } | | | 
samples tries } 9 | 23% F<} be | 23° | 15 24° 
“Naps 791 ich special key are selected by the method of double ERr i 3 
ad 26 ‘panese managers. These rela- individuals who will turn out to be successful. 
lonships ul statistically significant and We developed expectancy tables showing the 
5 of sin magnitude to the validity co- chances out of 100 that a manager with a 
eficients generally reported for predicting given value score would be among the top 
Manager success by other types of predictors. half of managers in terms of the personal 


We view these relationships as solid evidence 
that value patterns and success are related. 
Second, Table 3 indicates that when the 
Special value key developed on one country 
Is applied io another country, the validity 
Coefficient shrinks, in some cases to nonsig- 
poet values. This finding implies the 
eee of applying a validated instrument 
zom one group to another. Third, as shown 
ag last row of Table 3, the common key 
in S „yields significant, yet moderate rela- 
3 iPS across the four countries. This 
‘i cates that there may be some common 
ements underlying the relationship between 


value patterns and success across the different 


Countries, 

a Personal value systems of managers 

idly 8 stable and do not change rap- 

8 bel 8 & Oliver, 1972), and since values 

ae to Success, we have explored the 

81 uyot using values as a selection or 
motion device in attempting to select 


success index. (These tables are not shown 


because of space limitations.) An American 


manager (for example) who has a value score 


in the top quintile would have a 72% chance 
of being included among the top half of 
managers in terms of success. The expectancy 
of inclusion decreases to 63%, 47%, 41%, 
and 34% for decreasing quintile levels of 
value scores. 

From Table 3 and the expectancy tables, 
it is evident that value patterns predict suc- 
cess and could be used in making selection 
and placement decisions. However, contro- 
versy due to logical, ethical, and legal issues 
may arise. Persuasive arguments can be made 
that organizational vitality and adaptation to 
changing social and technological conditions 
may come about in large part because of 
the value mix in an organization. Selecting 
managers with similar value profiles might 
result in a static organization. While the full 
consequences of an individual organization 
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having managers with similar value profiles 
is not known, it should be noted that the 
median number of concepts in the four value 
keys is 21.5 which represents only one third 
of the total 66 concepts. Use of these value 
keys in selection and placement would still 
allow managerial values to differ in the re- 
maining two thirds of the concepts. 

A second set of questions about the use 
of the Personal Values Questionnaire in 
selection and placement decisions are ethical 
and legal in nature and center on “fair use” 
of such an instrument. Table 3 shows that 
when the special value keys developed from 
American, Australian, and Indian samples are 
applied to the Japanese sample and when the 
Japanese value key is applied to the Amer- 
ican sample, the validity coefficients shrink 
to nonsignificant values. These results are not 
surprising, however, since subjects from dif- 
ferent cultural backgrounds are expected to 
have different value systems which in turn 
will result in differential perceptions of mana- 
gerial success and performance. Although 
the results indicate that personal value sys- 
tems do relate to managerial success, they do 
not show equal validity for managers from 
different cultures. Thus, without knowing 
whether the key developed from the Amer- 
ican sample is equally valid for other 
American minority groups, we are uncertain 
about the fairness of application of the 
Personal Values Questionnaire to minority 
groups of American managers. 

Other researchers (Brunson, 1970; Kashefi, 
1970; Sikula, 1971) have shown that effec- 
tive and ineffective employees within various 
organizations have differing values and value 
systems. Our study also indicates that mana- 
gerial values have important and similar in- 
fluences on success under different cultural 
conditions. Managers from the four different 
countries are rather similar in terms of the 
relationship between personal values and 
success. While the nature of these similarities 
will be considered in the following section, 
the reader is asked to note in Table 3 that 
a common value key which is valid across the 
four different countries can be developed. Tt 
seems likely, therefore, that one could de- 
velop a special value key which is valid for 
different subgroups within a particular popu- 


| ig 


lation. Since the tailor-made val 
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ys de- 


veloped for specific countries n est the 
best results, we believe different e keys 
and different expectancy tables uld be 
developed for different subgroups or dif- 
ferent organizations to promote ness in 
employment decisions. Value dat: o must 
be integrated with other sources types 
of data in order to obtain »ptimal 
utilization of human resources. 
Nature of Value Differences be: More 
and Less Successful Managers 

Examination of the content of | ncepts 
which are related to success pr addi- 
tional meaning to the general que about 
relationships between values and 83. 

Looking at the four couniri mpara- 
tively, one notes that managers i: ve four 
countries are rather similar in t of the 
relationship between personal val nd suc- 
cess. The general pattern in Tab! dicates 
that more successful managers values 
seated in high productivity. maxi- 
mization, managers, my subordin ‘es, labor 
unions, ability, aggressiveness ejudice, 
achievement, creativity, success, . com- 
petition, and liberalism. Less succe man- 
agers have values which emphbssi=e social 
welfare, obedience, trust, conformity, leisure, 
dignity, security, conservatism, equality, 
and religion. Successful managers tend to 
emphasize pragmatic, dynamic, achievement- 
oriented values, while less succ man- 
agers prefer more static and pas values, 
the latter forming a framework descriptive 


of organizational stasis rather than organiza- 
tional and environmental flux. More success 
ful managers favor an achievement orienta- 
tion and prefer an active role in interaction 
with other individuals useful in achieving the 
managers’ organizational goals. They value 4 
dynamic environment and are willing to take 
risks to achieve organizationally valued goals. 
Relatively less successful managers have 
values associated with a static, protected en- 
vironment in which they take relatively pas- 
sive roles and often enjoy extended seniority 
and security in their organizational positions. 
In terms of the significant differences that 
exist among the four countries across the 
concepts, the most obvious one is that the 
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relationships en the success indexes and 
the concept liy” are significantly posi- 
tive for the nese managers but are sig- 
nificantly ne for the managers from the 
other thre atries. Another difference 
shown in is that the concept “me” 
is negativel ed to the success index for 
the Japane ‘agers but is positively asso- 
ciated with s indexes for the American, 
Australian idian managers. The behav- 
ioral patter 1 customs of the Japanese 
people ha deeply influenced by Con- 
fucianism stresses a rigid relationship 
within a chically arranged collective 
Society. rs of each collectivity are 
expected aintain absolute loyalty and 
obedience uthority in the fulfillment of 
their obli is. In view of this cultural 
backgrou: is not surprising that more 
successful ese managers place relatively 
greater e upon “loyalty” and rela- 
tively les: mphasis upon “me” than do 
managers other three countries. 
Althous ‘ynificant correlations between 
values and cess have been found, it seems 
Important x whether these correlations 
represent 4 erficial and faulty association 
or an intrinsic relationship. A systematic 


ween the results of our study 
2 findings in the literature provides 
evidences of the intrinsic validity of Personal 
Values Questionnaire for predicting mana- 
Serial success. A cluster of concepts including 
high productivity, profit maximization, abil- 
lty, achievement, and success might first be 
Considered. The positive correlations between 
these concepts and success imply that an 
achievement and success motive in the direc- 
Hon of corporate goals is associated with in- 
creasing managerial success. Patton (1965) 
citing his own research and surveys by others 
noted the relationship between pay and prof- 
itability. It might, therefore, be expected that 
creasing success (as measured in the pres- 
ent study) would be associated with values 
Pop hasizing high productivity and a profit 
maximization. Bray, Grant, and Katkovsky 
1 similarly cite the substantial correla- 

on: of career orientation and achievement 
Motivation with managerial salary progress 
18 corrected by initial starting salary). 

eClelland (1953, 1961) also points out that 
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entrepreneurs are primarily driven by the 
achievement motive. They are interested in 
productivity, profits, and success because it 
serves as a measure of their competence. 
McClelland suggests that achievment motiva- 
tion precedes individual success and economic 
growth. Jurgensen (1966) includes produc- 
tive, intelligent, and enterprising in his list 
of characteristics most descriptive of success- 
ful key executives. The importance of ability 
is implicit in Williams’ (1956) statement 
that critical executive behaviors include utili- 
zation of knowledge and services of subordi- 
nates, understanding the capabilities and 
limitations of his associates, and a major 
behavioral category concerned with technical 
competence. Executives are also willing, ac- 
cording to Williams, to work long hours to 
achieve their assigned objectives (work or 
achievement primacy rather than a leisure 
orientation). 

A second cluster of values from the present 
study that are consistent with other findings 
include change, aggressiveness, creativity, 
competition, liberalism, social welfare, obedi- 
ence, trust, conformity, leisure, dignity, equal- 
ity, conservatism, and religion. More success- 
ful managers’ values suggest that change, 
aggressiveness, creativity, competition, and 
liberalism are behaviorally relevant, while so- 
cial welfare, obedience, trust, conformity, 
leisure, dignity, equality, conservatism, and 
religion are less behaviorally relevant. The 
implication of more successful managers 
favoring a dynamic, relatively competitive 
and initiative environment versus less success- 
ful managers’ orientation toward a more 
secure, stable, and convergent environment 
finds considerable support in the literature. 
For example, Campbell et al. (1970) include 
competitive, creative, dedicated, and energetic 
in their list of personal qualities said to be 
necessary for managerial effectiveness. Jur- 
gensen (1966) also cites aggressive, self- 


starting, determined, energetic, and creative 


as most descriptive of successful executives 
and cites amiable, conforming, 
agreeable, conservative, kindly, mannerly, 
neat, cheerful, formal, courteous, and modest 
as least descriptive of successful executives. 
The previously cited association of achieve- 
ment motivation with managerial success 


reserved, 
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abo is relevant, as are the considerable 
number of studies noting the direct relation- 
ship of dominance and the inverse relation 
o dependency to managerial success (Bray & 
Grant, 1966; Bray et al., 1967; Hinrichs, 
1967; Wollowich & MacNamara, 1969). 
Other sources also support the relation- 
Ships found between the cluster of variables 
previously noted and managerial success 
(Dicken & Black, 1965; Laurent, 1962, 1970; 
Mahoney, Jerdee, & Nash, 1961; Mitchell & 
Porter, 1967; Porter & Ghiselli, 1957). 

A final cluster of concepts encompasses 
several related to “groups of people” including 
managers, my subordinates, and labor unions. 
Managers’ perceived behavioral relevance of 
these concepts increases with their actual suc- 
cess. These concepts are similar to some of 
those investigated by Rowland (1960), and 
the results obtained are consistent with the 
achievement orientation of successful man- 
agers noted previously, the group-oriented 
characteristic of successful managers cited by 
Campbell et al. (1970), Williams’ (1956) 
dimension of “Relations with Associates,” 
Mahoney’s ( 1966) “Performance-Support— 
Utilization,” and Dicken and Black’s (1965) 
“Ability to Cooperate.” 

We think the systematic comparisons pre- 
viously described suffice to show that the cor- 
elation between values and success is not a 
uperficial association of nonessential charac- 
eristics. There is a real and intrinsic rela- 
ionship between the level of success achieved 
managers and their personal values. The 
lature of our study (concurrent validation 
trategy) does not allow any clear interpreta- 
ion of the causal linkage between values and 
uccess. We do not know whether values in- 
luence success or whether success influences 
r changes values among managers. However, 
f we accept the Rokeach (1968) assumption 
hat values have strong motivational compo- 
ents as well as cognitive, affective, and 
ehavioral Components, it would seem that 
Managers are differentially successful because 
f their values. The fact that we derived our 
uccess measure for each age group and then 
sgregated for all age Categories argues 
gainst there being any strong cumulative 
fect of long-term successful experience in 
he relationship between success and values 
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that we found. Our succe 
correlated with age and tot: 7 
that a manager has worked. The value keys 
we developed are minimally related to age in 
the four countries (rs ranging from —.05 
91 


neasure is 
umber of 


SUMMARY 


This study examines the nature of the rela- 
tionship between the personal values of man- 
agers and their success as managers for 
samples of American, Australian; Indian, and 
Japanese managers. While additional and 
extended study and analysis of managers in 
other countries (especiallý the communist 
countries) undoubtedly will clarify the func- 
tional relationship between values and suc- 
cess, our research evidence suggests the 
following general findings and conclusions: 


1. There is a reasonably strong relation- 
ship between the level of success achieved by 
managers and their personal values. The 
magnitude of the relationships are indicated 
by the significant correlation coefficients of 
32 for American managers, .47 for Australian 
managers, .35 for Indian managers, and .26 
for Japanese managers. We view these cor- 
relations as real and intrinsic relationships 
rather than superficial and faulty associations. 

2. It is evident that value patterns are pre- 
dictive of managerial success and could be 
used in selection and placement decisions. 
A number of legal and ethical issues must 
be considered, however, before such use is 
justified. 

3. Although there are country differences 
in the relationships between values and suc- 
cess, the findings across the four countries are 
quite similar. 

4: The general pattern emerging from the 
study indicates that more successful man- 
agers appear to favor pragmatic, dynamic, 
achievement-oriented values, while less suc- 
cessful managers prefer more static and pas- 
sive values, the latter forming a framework 
descriptive of organization stasis rather than 
organizational and environmental flux. More 
successful managers favor an achievement 
orientation and prefer an active role in inter- 
action with other individuals instrumental to 
achievement of the managers organizational 


goals. Less sful managers have values 


associated static and protected en- 

vironment hich they take relatively 

passive role 
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The influence of psychological attributes of 
decision makers on the strategies they adopt 
in decision-making tasks has received consid- 
erable attention in the research literature 
(eg, Bourne, Ekstrand, & Dominowski, 
1971; Lanzetta & Kanareff, 1962; MacCrim- 
mon & Taylor, in press), Dogmatism has 
been singled out for intensive investigation, 
perhaps because of its central rcle in theories 
of decision making (e.g., Rokeach, 1960), and 

some tentative conclusions have been drawn 
to explain its impact on decision-making 
Strategies. Dogmatism generally has been 
bound to be associated with an approach to 
_ decision making characterized by rapid deci- 
“sions following limited information search 
and yet confidently holding to the resulting 
decisions (e.g., Long & Ziller, 1965; Pruitt, 
1961). 

A decision maker's risk-taking propensity 
and his intelligence probably also figure 
prominantly in his decision-making strate- 

gies, but the influence of these psychological 
attributes on such strategies has not been 
_ Systematically investigated. Under conditions 
of relatively costless information seeking and 
increased probability of attaining correct de- 
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INFLUENCE OF DOGMATISM, RISK-TAKING PROPEN* 
© INTELLIGENCE ON DECISION-MAKING STRATEGI] 
A SAMPLE OF INDUSTRIAL MANAGERS 


The relative influence of dogmatism, risk-taking propensity, and intelligenc 
on strategies adopted in a decision-making exercise was investigated for 
sample of industrial managers. Findings of prior studies, that dogmatism y 
associated with a decision-making strategy characterized by rapid and con 
dently held decisions following limited information search, were supporte: 
High propensity for risk was associated with an information-seeking strate 
involving rapid decisions made on the basis of little information, but involve 
į deliberate information processing, Intelligence was positively correlated wit! 

efficiency in processing information, accurate choices, and cautiousness in 
l changing decisions in the face of adverse consequences. 


MARVIN D. DUNNET 


University of Minne 


cisions as additional relevant iní ion is 


acquired, high-risk-taking propen would 
be expected to be associated wit! ision- 
making strategies stressing limite: orma- 
tion seeking and rapid decision The 
influence of the intelligence of decis ikers 
on his decision-making approaches gen- 
erated surprisingly little research; it -ically 
should influence the evaluative aspcts of 
decision making. The purpose of the present 


study was to investigate the impact of these 
three psychological attributes on the informa- 
tion-seeking and decision-making strategies 
used by a sample of industrial managers for 
making a job-related decision. 


METHOD 


Measures of dogmatism, risk-taking propensity, 
and intelligence were administered to a sample of 79 
male, line managers; ranging from foremen to fac- 
tory managers and representing eight units of a 
heavy manufacturing company. The subjects then 
participated in an individually administered deci- 
sion-making exercise. The research was conducted 
while the subjects were participating in a manage- 
ment development program at the corporate head- 
quarters, and data were obtained for all participants 
in three sessions of the program. 


Instruments 


Psychological attributes investigated were dogma- 
tism, measured by the Dogmatism scale (Rokeach, 
1960); risk-taking propensity, measured by the 
Choice Dilemmas Procedure (Kogan & Wallach, 
1964); and intelligence, measured by the Wesman 
Personnel Classification Test (Wesman, 1965). The 


420 


ing behaviors of amount of informa- 
sought, time required to reach a decision, infor- 
essing rate, decision accuracy, decision 
„and decision flexibility were measured by 
1 Decision Simulation developed by 
(1970). 
Decision Simulation. Information-seek- 
and decision-making strategies were measured 
the Personne! Decision Simulation. The Personnel 
Simulation presents the subject with a 
ed managerial decision requiring him to play 
tole of a business manager while making a 
option decision. The subject selects, on the basis 
ormation that an actual manager would have 
lable to him, one of three hypothetical salesmen 
promotion to sales manager. The Personnel De- 
n Simulation departs from experimental tasks 
ly used to investigate decision-making per- 
e by emphasizing systematic observation and 
€ measurement of processes involved at the 
stages of decision making. The decision 
sses measured by the Personnel Decision Simu- 
n are (a) information amount, the number of 
ormation items sought; (b) decision time, the 
required to reach a decision; (c) processing 
the rate at which information units are proc- 
(d) decision accuracy, correctness of decision 
(e) decision confidence, self-rated confidence 
e correctness of decisions; and (f) decision flexi- 
n the number of adverse consequences of a 
on which would cause the decision to be 
(high score indicates flexibility). Scores on 
5 Decision Simulation are based on two 
ion problems. The two problems were equiva- 
forms of the Personnel Decision Simulation 
tered with a one-week interval between forms 
d total scores were the mean scores for the two 
s. For any subscore (e.g., “time”) the correla- 
n between scores for the two decision problems 
calculated and entered in the Spearman-Brown 
Means, standard deviations, and reliabilities 
‘the variables used in this study are given in 


rmula 


TABLE 1 


AND STANDARD DEVIATIONS FOR PERSONNEL 
CISION SIMULATION SCORES AND PSYCHOLOGICAL 
_Atremutes ror 79 Mare Line MANAGERS 


Variable x sD Oe 
el Decision 
BA, ition scores 
mount of information | 138.04 57.26 98 
Processing rate 3.70 1.28 96 
25 b time 38.49 14.11 96 
eign en. +72 39 68 
lon confidence 3.08 81 78 
“cision flexibility 7.56 2.74 93 
ical Heap butes 
-Taking pr % 400% ae? 
opensity . 00 E 
ogmatism 19.08 | 27.65 


rrecients for two forms of the Personnel Decision Simula- 
corrected ß by Spearman-Brown formula. 
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TABLE 2 


CORRELATIONS BETWEEN PERSONNEL DECISION — 
SIMULATION SCORES AND 
PSYCHOLOGICAL ATTRIBUTES 


Risk- 
Personnel decision Dogma- | taking 
simulation scores tism | propen- 
sity 
Amount of information | —.16 — 37° 
Processing rate 18 —.21" 
Decision time ~—34"* | —.29°° 
Decision accuracy 23% | — 0t 
Decision confidence 2 11 
Decision flexibility —.09 —.15 
„ <05. 
of <0 
Analysis 


The bivariate correlation coefficients between dog- — 
matism, risk-taking propensity, intelligence, and the 
six scores from the Personnel Decision Simulation 
were examined to determine both the nature of the 

psychological traits and the relationships between 
these traits and performance in the 3 
exercise. 


RESULTS AND DISCUSSION 


Intercorrelations Among Decision. Ii aker 
Attributes 


The intercorrelations were insignificant be- 
tween risk-taking propensity and both intelli- 
gence (.09) and dogmatism (.08). The only 
significant relationship was between intelli- | 
gence and dogmatism (.29), with dogmatic 
subjects tending to be more intelligent. These 
relationships are so low it appears that only * 
a small part of the influence of dogmatism on 
decision-making performance found by prior 
studies could have been attributed to either 
the intelligence or the risk-taking propensi É 

of dogmatic decision makers. 3 


Correlations Between Psychological Attributes 
and Personnel Decision Simulation Scores 


The correlation coefficients between psycho- 
logical attributes and Personnel Decision 
Simulation scores are shown in Table 2. 

Dogmatism. Correlations between the Rok- 
each Dogmatism scale and the Personnel De- 
cision Simulation scores that were in line with 
findings from previous studies were the sig- 
nificant negative correlation between dogma- 


“fidenc: * Ziller, 1965). As could 
be expected from the correlations between the 


based on less information, but tend to process 

h item of information more slowly. This 
finding is at variance with a conceptualization 
Of risk taking as disregard for the informa- 
tion base in a willingness to be exposed to 
an uncertain decision environment. It sug- 
Rests that, although risk. prone decision mak- 
ers reach rapid decisions by the expedient of 
‘Testricting their information search, they give 
attention to the information they 


x Meaning and dimensionality of 
taking as a construct (eg., Wallach & 
1961; Slovic, 1962) have found the 
ons among instruments proported to 
sure risk taking to be 
ults led Slovic 


conceptualize 
tive processes 
n be considered premature. One purpose 
of the Present study was to further clarify the 
aterpretation of the Choice Dilemmas Pro- 
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cedure as a measure of risk 
both in the light of its 
dogmatism and intelligence 
on decision-making performa: 
Intelligence, Intelligence sh 
positive correlation with a ten 
information quickly and to m 
cisions. It was also found to 
sociated with decision flexibi! 
the major contribution of int 
cision-making performance is 


ropensity, 
hips with 
influence 


significant 
o process 
irate de- 
ively as- 
parently 
e to de- 
‘ficiency 


with which information was p: and ac- 
curacy in choosing a high-qua! ision al- 
ternative, Intelligence appears ve little 
influence on decision confider is im- 


portant in evaluating consequ: 
cision to determine which ones « 
serious to warrant changing th: 
each of these decision processes, 
of intelligence seemed to involve 
information, integrating it for ma! 
and judging the import of adi 
quences. Intelligence apparently 
fluence on the more idiosyncratic I 
predecisional information search str 
postdecisional confidence. Further ‘ies US 
ing measures of general mental ab „ and 
extending the investigation to include <pecific 
mental abilities, are needed to understand 
more fully the role of intelligence in decision 
making. 

Investigation of the psychological corre- 
lates of information-processing and decision- 
making performance appears to be a fertile 
area for future research into the cognitive 
and motivational processes involved in mana- l 
gerial decision making. Further investigations 
are needed to specify more completely the 
influence of decision-maker characteristics on 
performance in decision problems. This study 
indicates that the Personnel Decision Simu- 
lation is a potentially useful tool for achieving 
those ends. 


iciently 
ion. In 
fluence 
ion of 
choice 
conse- 
tle in- 
ses of 
ies and 
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mal 
tele. Both types of real groups outperformed individuals. Real groups wit 
he previous exposure to the problems made better decisions than did grou; 
whee members bad previously made individual decisions on the same tas! 


between decision quality and riskiness of decision alternatives varied in t 
three decision tasks. Real groups with no prior exposure to the task respond 
to the suggested risk norm more appropriately than did other decisio: 


making 

This article reports the results of an experi- Block, 1958; Tuckman & Lorge In 
r the effects of individual and alterna- a study by Campbell (1968) th: ty of 
modes of face-to-face group decision group solutions was clearly inie: the 
ung on decision quality. Identical eco- scores of even the average indi 3 AS 
decisions were made by individuals, by Campbell points out, however, “sv: fnd- 
groups who had first considered and reached ing is dependent upon the type oi blem 
sions on the problems individually, and used and the manner in which it w: -cored 

Y groups who had no prior exposure to the p. 209].” 
oble: In addition, the latter groups, Another branch of literature relevani :o the 
sequent to the group decisions, made present study has shown similar results that 
decisions, Thus this investigation of individual risk taking and, in particu- 
from the bulk of previous problem- lar, the so-called “risky shift” phenomenon. 
Studies principally in that two types Hereto, risk-taking propensity has been 
were employed, and the problems shown to be highly situation and task specific 
economic decisions under uncertainty. (Slovic, 1962), and especially the enhance- 


Much of the early work comparing group ment of risk-taking propensity under group 


performance on problem- conditions (the so-called “risky shift” effect) 


has been shown to be highly specific to the 


over individuals (Lorge, Fox, task involved. For a critical review of the 


Lorge & Solomon, “risky shift” literature, see the special sup- 


studies, however, tended plement of the December 1971 Journal of 


to yield mixed results in comparing perform- Personality and Social Psychology, in which 


indicating that under some conditions it is pointed out that the risky shift phe- 


© group setting inhibits performance, while nomenon is really specific to some of the 


& Blake, 1963: 


performance may be enhanced dilemmas of choice items originally used by 
Campbell, & Jaastad, 1963; Hall, Stoner (1961). However, for evidence that 


* ? 
Taylor, Berry, & the phenomenon operates, at least in part, 


—— for reprints should be sent to Be for simulated investment portfolio decisions, 


t of Economics See Deets and Hoyt (1970). s 
University, Burnaby, Despite such mixed results, these studies 
might well have interesting implications for 
44 
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—— dr iking organizations if the 

| plemomnens uon could be shown to 
held for « lecision problems. Over a 
decade ag » (1960) made a strong 
appeal for the implications of just 
wach beha nomena for the theory of 
decision m » economics, The present 
study con. ne attempt to meet this 
objective. 

Unlike n vious studies of individual 
aad group making that used non- 
feomomic « tasks, the present study 
relied upon ical decision theory to de- 
fae decisio y. Each subject and group 
made threx ess decisions. In each deci- 
thon cach tive was associated with 
dearly spe ossible events, a probability 
distributio: ‘hese events, and a payoff 
associated each event. The attitude 
toward ris! imposed in that for each 
problem ti ion maker (or group) was 
to make th ion in the context of a role 
in a firm. escription of the firm indi- 
cated whet not he was to be risk averse 
OF risk nev 

T The dec criteria below follow from 
statistical n theory: 

I. If one crnative has at least as high 
& payoff a ıt of a second alternative for 
every possibi- cvent, and if there exists at 
least one eu lor which the first alternative 
has a highe: payoff, the first alternative 


tes ancl is superior to the second. 

2. If the decision maker (or group) should 
be risk neutral, alternatives possessing higher 
pected values are superior to alternatives 
Possessing lower expected value. In the con- 
ext of utility theory, risk neutrality implies 
à linear utility function over the range of 
Payofis of the alternatives. 

3. If the decision maker or group should 
be risk averse (a) higher expected value is 
Superior to lower expected value, given equal 

(“variance” is used in the statistical 

“ese as a measure of dispersion of a random 

Variable); (4) lower variance is superior to 

variance, given equal expected value; 

(c) a higher expected value-lower-variance 

alternative is superior to a lower expected 
value-higher-variance alternative. 


Criterion 3 merits some discussion. By this 
“riterion, expected value and variance are the 


a 


E 8 


f 
F 
i 
i 


| 
f 


dratic over the range of interest, 


K 


90 subjects considered the same decisions but in the 
reverse (G-I) order. As before, the 90 G-I subjects 
were assigned to their five-man decision-making 
groups in a random manner. (In both group con- 
ditions no leaders were appointed. Rather, leader- 
ship was permitted to emerge.) 

The decision problems are described as follows. 


Problem 1 


The first decision problem is shown in the present 
_ Context, as the “Holigraph Corporation” problem, 

in the form in which it was presented to subjects 
making individual decisions. In order not to create 


Holigraph Corporation 


As the President of the Holigraph Corpora- 
tion, you must decide in which of four regions 
(coded below to maintain Secrecy from competi- 
tors) you will build a plant to produce a new 
Communications device. Table 1 shows the re- 
Sulting rate of return on investment for each 
combination of plant location and degree of 
market acceptance. 

Holigraph’s market research indicates that the 
probability of very high consumer acceptance 
is 10 percent, the probability of moderately high 
acceptance is 30 percent, the probability of very 
low acceptance is 20 percent, and the probability 
of low acceptance is 40 percent. 

Holigraph has historically been conservative 
in its investment policy, and you have no reason 
to believe that the Board of Directors has 

this orientation. 

In which location would you build? Briefly 
state your reasons for the decision you reached 
on the reverse side of this page. 

The application of the three decision tules (de- 
Previously) to the expected values and vari- 
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ances of the locations yields 


(a) ZAJ, (b) ZOJ, (c) ZEJ, « UJ. 

The “conservative investme y” of the 
board should imply, for two i ts of equal 
expected value, that the smaller v alternatives 
is superior. (See Criterion 3 previ nentioned.) 

Although it is an unsettled qu: s to which 
alternative is riskier if one has r expected 
value and the other a lower varia can assert 
that: (a) a dominated alternat skier; and 
(b) of two alternatives possessi: expected 
value, the higher variance alter: is riskier, 
(See our earlier discussion.) App): se criteria 
to the Holigraph problem yields ntical risk 
ranking as by focusing on proba of lowest 
outcome; that is, there is no ambi- assessing 
risk. The resulting ranking by r a) ZUJ 
(b) ZEJ, (c) ZOJ, and (d) ZAJ. 1 be noted 
that the risk ranking is in exactly t site order 
to the quality ranking. 

Problem 2 

The second task is shown in the context, 
as the “Zirolex I” problem, in the received 
by subjects. E 

Zirolex I 
As the Vice-President of Market r the 

General Chemical Products Corporat very 

large producer of industrial chemicals, must 

decide whether or not to produc last 
batch of Zirolex (a chemical compo. used 
in the production of paint). 

Last year General Chemical made $214 


million after taxes. Although the profit margin 
of Zirolex has been declining in recent years, 
last year sales of Zirolex produced a contribu- 
tion to profit of $80,000. l. 

The Zirolex producing equipment was origi- 
nally purchased for $300,000 ten years ago, but 
is now fully depreciated. The equipment could 
be sold for scrap at any time, netting the com- 
pany $10,000 after miscellaneous costs associated 
with the removal of the equipment. 5 

You know that one of your competitors is 


TABLE 1 
RATE or RETURN on INVESTMENT AND DERIVED MEASURES ror HOLIGRAPH Locations 


k Degree of market acceptance 
Plant i Risk 
- s Quality Expected b 15! 
| locations % very | % moder- % low % very rankbe value» | Variance rank*° 
3 high ately high low 
— 
ZEJ 15 11 9 
ba 8 3 10 4 2 
ji ZAJ 20 16 14 13 1 15 4 4 
_ wy 29 17 4 2 4 10 75 1 
201 22 16 15 10 2 15 10+ 3 
5 5 e used for plant locations. A 
Note that ZAJ and ZOJ dominate ZEJ. 
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developing ompound which will make 
Zirolex of n it is brought out. Your 
production rketing experts studied the 
problem ca and have reached a consensus 
that there fifty chance that this com- 
petitor will ut the compound within two 
months. 

If you produce one more batch of 
Zirolex, | scheduling difficulties with 
the pack irtment, the batch would 
have to b l next week. You are able to 
produce or ) batch sizes. If a large batch 
(4 tons) c produced, and if the com- 
petitor’s n ound were marketed within 
the next t hs, most of the batch would 
have to bx í at drastically reduced prices. 
This woul in a loss of $20,000 on the 
batch, On her hand, if the competitor's 
new produ c not brought out within the 
next two you would expect to make a 
profit of on the batch. 

If a sn h (2 tons) were to be pro- 
duced, anc competitor’s new product were 
to be intr: within the next two months, 
the expect would be $5,000 on sales of 
the batch competitor’s new product were 
not marke hin two months, you expect 
a contribu profits of $25,000 on sales of 
the batch. urse, you also have the option 
of produci: Zirolex at all next week. 

What w do? Why? 

An optima! -gy for a large company making 
numerous sı iecisions would be to maximize 
expected valuc ind be risk neutral (i.e, variance is 
irrelevant) on each decision, since the firm has the 
law of large numbers working for it. Thus, the 
Optimal alternative is to produce the large batch, 


at an expected value of $15,000, and the worst 
alternative is to produce nothing. 

outlined criteria for ranking alter- 
n pect to riskiness are inadequate for 
Zirolex I. Consider the following extension of the 
tisk criteria. For nondominated alternatives, that 
ternative possessing the highest variance is riskiest. 
Accepting this definition again insures that the vari- 
ance and probability of loss notions of risk yield 
identical rankings of alternatives in Zirolex I prob- 
lem. The large batch is riskiest followed in order by 
Small batch and no batch. 


Problem 3 


A third task is shown in the following display, 


55 the “Zirolex II“ problem, in the form received 
Y subjects, 


Zirolex II 


Pat this morning, you have received a tele- 
a ne call from John Ransome, the President 
à reputable research firm. Ransome told you 
88 Sur competitor has recently marketed a 
quantity of the new compound through 


a wholesaler in order to have the compound 
tested. Ransome's research firm has obtained a 
pound of the substance. Ransome offered to 
test the compound, and tell you positively 
within 24 hours, whether or not product de- 
velopment has proceeded to the point where the 
compound will be marketed within two months. 
You know that your competitor is waiting 
only for technical development of the com- 
pound to be completed before marketing it. 
Therefore, you arc sure that the information 
you would receive from Ransome's research firm 
would allow you to know with certainty 
whether or not the new compound will be 
marketed within two months. 

Ignore any tax and ethical considerations in 
answering the following question: What is the 
most you would be willing to pay Ransome's 
research firm for the information on your 
competitor's compound? Why? 


The expected value of the optimal act (lage 
batch) under uncertainty is $15,000. If perfect in- 
formation indicated that the competition will market 
his product within two months, the optimal batch 
size is zero for zero payoff. Conversely, if it were 
known with certainty that the competitor's new 
product will not be marketed within two months, 
the optimal batch size is large, for a payoff of © 
$50,000. The analysis indicates a & probability that 
the competitor's product will be marketed within 
two months. 

Thus the expected value under certainty (ic., if 
the information is purchased) is .5 X $50,000 + 5 
X $0 = $25,000. 

This is $10,000 increase over the expected value 
which can be obtained without information. Hence, 

a rational decision maker should be willing to pay 
up to $10,000 for the information, 


Procedure 


At the beginning of the experimental session, sub- 
jects were instructed that they would be asked to 
make a series of decisions similar to those encoun- — 
tered in actual business situations. They were asked 
to act as they would if they actually occupied the 


role assigned to them (e.g., President in the indi- 


vidual condition and member of the Executive Com- 
mittee in the group condition). They were given no 
training in statistical decision theory during the ex. 


periment nor were they instructed to do their best 
in any sense except in attempting to behave as 


they actually would in the roles they were asked 
to assume. 


Half of the subjects in each treatment received 2 


Problem 1 (Holigraph) first; the other half received 
Problems 2 and 3 (Zirolex I and Zirolex II) first. 
(Problems 2 and 3 were necessarily paired by the 


nature of the issues to be decided.) The order of 


the three problems was preserved in the individual 


and group conditions for each subject. Within each 


task the order of presentation of alternatives was 


yh 


; 


‘the individual or group conditions. When all subjects 


ResuLTS 


The relative quality of decision-making 
performance for each condition is analyzed 
for each task in this section. The quality of 


_ pooling rules used to create synthetic group 
decisions from individual decisions: (a) plu- 
tality, (5) best man’s decision, and (c) Lorge 
and Solomon’s (1955) Model A. 

As Table 2 shows, the groups in the G-I 
treatment clearly showed superior perform- 
ance on the Holigraph problem, selecting the 
optimal location two thirds of the time, com- 
pared to less than one third for the I-G 
_ groups, and only 16.8% for the individuals 
in the I-G treatment. (If one chooses to con- 
4 sider only the expected value criterion, thus 
making locations ZOJ and ZAJ equally de- 


TABLE 2 


Quatity or Decisions BY GROUPS AND INDIVIDUALS 
ON PROBLEM 1 


Individual then group | Group then individual 
treatment (I-G) treatment (Gel). 


indi- 
6 990 4 = 19) 


indi- 
viduals 
(n = 90) 


% groups | % groups 
( = 18) 


— ee | 
Note. Ba z 3 +001 (omitting ZE] because of low- 
* Nonsense syllables were used for plant locations. 
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sirable, the same relative nance ob- 
tains.) Although the acade: erience of 
the subjects was such tha would be 
expected, ideally, that all subj vould pos- 
sess sufficient sophistication | ulate ex- 
pected value, not all did so not per- 
ceive the relevance of the jue). To 
minimize risk, on the other was not 
necessary to calculate the v: of each 
payoff distribution. A simpl: tion of 
the range of payoffs is a su! estimate 
of the variance in this matrix 

Individuals in the G-I treat onded to 
give the same responses as t! oup had 
given. Only three of the 90 ual sub- 
jects in the G-I treatment a ned the 
location chosen by the group. 

Table 3 shows that in the 7 I task 
the majority of groups and indi is, under 
all treatments, tended to prefer zedium- 


ve, that 
less, in- 
of GI 
hat of 
s, with 
G indi- 
rginally 


risk, medium-expected-payoff al 

is, to produce a small batch. Ne 
spection of Table 3 shows the pa 
group decisions to be superior 
either I-G groups or I-G indivi: 
I-G groups marginally superior t 
viduals. The G-I individuals were 
inferior to G-I groups, with very = shifts” 
from the group decision. Thus, the relative 
quality performances of the various decision 
modes observed in Problem 1 is essentially 
replicated in Problem 2. 

Contrary to the results of some previous 
studies (e.g., Green, Halbert, & Fitzroy, 
1967), the subjects in this study rather con- 
sistently undervalued the information which 
they could purchase in the Zirolex II task. 

In this instance, individuals in the G4 
treatment achieved the highest proportion of 
the optimal $10,000 decision (the few 
switches tended to be in the optimal direc- 
tion), followed by G-I groups, then by I-G 
groups, with the I-G individuals again mak- 
ing the poorest showing. These differences, 
while consistent with the results of Problems 
1 and 2, do not achieve statistical significance. 

In order to gain insight as to whether the 
relative superiority of real groups over indi- 
viduals was due to statistical pooling or tO 
the dynamics of face-to-face interaction, syn- 
thetic group decisions were derived. The com- 
position of synthetic decision groups wWas 


>) 
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TABLE 3 
uty or Decisions py Grours AND Inpivipuats ox Promexm 2 


- — — — — — 
| Individual then groups Group then individual 
| s treatment (I-G) treatment (G-I) 
ected Risk 
ue (variance) [ 
% individuals | 9% groups % groups |96 individuals — 
| (n = 95) (m = 19) (n = 18) (n = 90) 
) 0 0 0 56 
10,000 Medium 84.2 66.7 6 
,000 High 15.8 333 300 


to of the I-G groups so that 
the I-G groups’ natural and 
ns involve the same subjects. 
poo rules were employed to 
th: uthetic decisions. First, a 
vith ties resolved in favor of 
ive) rule was used to com- 
al decisions of I-G subjects. 
it le (two or more individual 
$s in ce group being optimal, no 
y for onoptimal decision, and with 
favor of the optimal decision) 
onger test than a majority 
in no instance would the ma- 
resulted in an optimal deci- 
vod set of synthetic group deci- 
Was then obtained by assuming that the 
Synthetic decision was the same as 
best individual decision. Finally, the 
and Solomon (1955) Model A was used 
ct group decisions on the basis of the 
data. 
al groups outperform the plurality vote 
decision rule, then one might infer 
me positive contribution can be as- 
to face-to-face relationships. Table 4 
that both I-G and G-I groups tended 
h more optimal decisions than did 
y vote synthetic groups. However, 
for G-I groups in Problem 1, neither 
Of real group reached as many optimal 
as would be expected by the best- 
thetic criterion. The latter results 
Pical of studies surveyed by Davis 
). Apparently face-to-face interaction 
Mta sufficient condition to assure adop- 
the best individual’s decision, but it is 
a mere voting process which se- 
ng several individual viewpoints. 


is significant at .0S level. H = 8.3, 9 <.0S, 


The same conclusion is suggested by appli- 
cation of the Lorge and Solomon (1955) 
Model A to the present data. Their model 
asserts that the probability that a group will 
make an optimal decision is 1 — (1 — P) — 
where r is the number of members of the 
group and P is the likelihood that an indi- 
vidual will make an optimal decision, This 
criterion is similar to the best-man-out rule in 
that it is assumed that if an individual reaches 
an optimal decision, this decision would be 
adopted by the (artificial) group of which 
he is a member. One may use the frequency — 
of optimal decisions in the individual I- G 
results as a measure of P. Consistent with 
previous studies (Davis, 1969, p. 46) Model 
A overpredicts group performance in this 


study with one notable exception. In the first, f 


Holigraph problem, G-I groups actually per- 
formed at, or slightly above, the level pre- 
dicted by the model. 


Individuals Versus Groups 5 
These results clearly add support to che 
view that groups tend to make better deci- 
sions than individuals (Davis, 1969, p. 38). 
It is significant to find that these results ob- 
tain in decision situations which are similar 
to those encountered in business (unlike pure 


gambling and many of the other tasks used 


previously). While, with one exception, real : 
groups did not perform as well as the best 
individual, G-I groups did better the per- 
formance of plurality vote synthetic groups. 


Previous studies (Davis, 1969) suggest that 


best-man types of pooling rules, such as the a 
Lorge and Solomon’s (1955) model, overesti- 
mate the quality of decisions real groups will 
achieve. Real groups seem to be generally 
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TABLE 4 


PREDICTED AND OBSERVED PROPORTIONS 
or Orrimat DECISIONS 


Actual 
decisions | 
Problem P .— A í, i 
I-G | G-I a | Best | Plu- 
groups | groups | | man [|ralityb 
Holigraph | | | 
(Problem 1) 108 310 667 -601 | 580 316 
Zirolex I | | 
(Problem 2) 158 158 | 333 | .577 | .580 -000 
Zirolex II | | 
(Problem 3) 179 | -211 278 627 -580 111 
*R =1 (1 — P) where P = Proportion of I-G indi- 


viduals making an optimal decision on a problem and R = 
Predicted proportion of groups making an optimal decision on 
a problem (see Lorge & Solomon, 1955). 

In only one case did a clear plurality for the optimal decision 
exist. In other cases, ties were resolyed in favor of the optimal 
decision in order to provide the strictest possible test. 
incapable of fully using the contributions of 
individuals who have high-quality decisions. 
However, groups do seem to benefit from 
deliberations in the sense that they make bet- 
ter decisions than would be obtained by pool- 
ing individual decisions using a plurality vote 
rule. 

Of course, it is not completely reasonable 

to accept the best-man and Model A ap- 
proaches to accurately predict group perform- 
ance when the solutions to the problems are 
not of the “self-confirming” or eureka type. 
Tn tasks like economic decisions, the relative 
quality of alternative solutions may not be 
apparent to even the best individual problem 
solver, much less to the entire group. Thus 
the common assumption in both the best-man 
and Model A approaches, that if one person 
in the group finds the optimal solution it in- 
evitably is adopted by the group, is an ex- 
tremely strong assumption for the type of 
decision task used in many problem-solving 
studies, as well as in many real-life problems. 
Consequently neither the best-man nor Model 
A pooling rules constitute appropriate expla- 
nations of the difference between individual 
and group decision-making performance for 
noneureka tasks. Adequate explanation re- 
quires both an analysis of how any given 
persons obtain an optimal solution and how 
that solution gains group acceptance. 

In the Holigraph task, comparisons of LG 
individuals with both I-G groups and with 
G-I groups imply that both sets of groups 
were more conservative than were individuals. 
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This result may be due to tl ription of 
the problem which indicated i he firm’s 
board of directors tended to caution. 
In the Zirolex I task group signif- 
cantly more risky (in the va e sense) 
than were individuals. In that ion sub- 
jects were provided with no i ce as to 
the organization’s attitude toy isk tak- 
ing, but they could infer that iould be 
risk neutral. 

If the cultural values interpr may be 
properly applied in this case, do ap- 
pear to be more responsive th: re indi- 
viduals to the orientation towar if this 
orientation facilitates reaching quality 
decisions. 

I-G Versus G-I Groups 

One of the most surprising fin is that 
the inexperienced (G-I) groups better 
decisions than did experienced (1 zroups. 
In the absence of previous stu of this 
issue some speculation as to the ons for 
the relative superiority of ir rienced 
groups is warranted. A likely reasc» uld be 
the fact that I-G group members Already 
arrived at a decision and made sort of 
“public stand” on that decision by handing 
it to the experimenter. Having taken a stand 
with respect to that decision, they become 
committed to it and were reluctant te change 
it (Kiesler, 1971). A similar explanation 
could account for Campbell’s (1968) finding 


of the inferiority of group as compared to 
individuel decisions. The effect of commit- 
ment appeared strongest in the case of Gl 
individuals who seldom deviated from group 
decisions. This finding supports the commit 
ment interpretation since subjects were mak- 
ing their group decisions known to all mem- 
bers of the groups as well as to the exper- 
menter. 3 
Some evidence consistent with the commit- 
ment explanation was obtained from confi- 
dence measures on individual decisions. M 
every decision, individuals who selected the 
worst alternative were just as confident (dif- 
ferences were materially and statistically in- 


significant) of their decision as were subjects 


selecting the best and second-best alterna- 
tives. Since the majority of subjects in 4 


Il 
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three decisi ade nonoptimal individual 
decisions i !-G condition, their confi- 
dence in the optimal individual positions 
probably í a greater expenditure of 
additional t ıd more compromises (of a 


nonoptimal e) in I-G groups than was 
required in I groups where individuals 


had no op ‘ity to develop prior strong 
confidence r own choices. 

Another | which could explain the rela- 
tive inferio: F I-G compared to G-I group 
decisions ir group structure. One would 
expect that groups would develop clearer 
definitions tructure than would G-I 
groups, sin wer group resources are re- 
quired to stand the decision situation. 
But do cle: tructures facilitate better de- 
cisions? On aht expect the answer to be 
yes for I= ups, since the resolution of 
divergent m is a major problem and 
some struc hould facilitate the resolution 
of conflicts wever, for G-I groups the 
resources de ed to clarifying structure de- 


tract from 
tion, evalua 
of the deci 
might expec: 
clarity and 


se devoted to problem defini- 
and other necessary elements 
making process. Thus, one 
hat for I-G groups structural 
cision quality would be posi- 
but for G-I groups the rela- 
be reversed. 
finding remains unexplained. 
ph decision, G-I groups per- 
formed better than any of the synthetic deci- 
Sion rules. It would appear that certain kinds 
of decision theoretic problems are particularly 
amenable to group decisions. The Holigraph 
Problem fits into this category, the Zirolex 
Problem does not. Which features of the two 
Problems determine this bifurcation remain 
Open to conjecture. 
Ik the findings of group problem-solving 
terature are to have any relevance to real 
ganizations and decisions, it appears neces- 
“ary to move experiments from the laboratory 
to the real world. This experiment may be 
"garded as transitional in nature. Business 
School subjects were used and economic deci- 
ee were involyed. If these results can be 
“ihe to continuing groups, this study 
5 Cates that amount of prior exposure and/ 
Commitment to decision alternatives, clar- 
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ity of group structure, and the risk orienta- 
tion of the organization may affect the rela- 
tive quality of decisions of individuals and 
different types of groups. Verification of 
these findings with actual managers and con- 
tinuing executive groups is desirable. 


REFERENCES 


Barron, F., & Mackenzie, K. A constrained optimi- 
zation model of risky decisions. (Working Paper 
41) University of Kansas, March 1971. 

Bem, D. J., Wallach, M. A., & Kogan, N. Group de- 
cision making under risk of aversive consequences. 
Journal of Personality and Social Psychology, 1965, 
1, 453-460. 

Campbell, J. P. Individual versus group problem 
solving in an industrial sample. Journal of Ap- 
plied Psychology, 1968, 52, 205-210. 

Cohen, J., & Hensel, M. Risk and gambling. New 
York: Philosophical Library, 1956. 

Davis, J. Group performance. Reading, 
Addison-Wesley, 1969. 

Deets, M., & Hoyt, G. Variance preferences and 
variance shifts in group investment decisions. 
Organizational Behavior and Human Perjorm- 
ance, 1970, 5, 378-386. 

Dion, K., Miller, N., & Baron, R. Group risk taking: 
A review of the risky shift phenomenon, In L. 
Berkowitz (Ed.), Advances in experimental and so- 
cial psychology. Vol. 5. New York: Academic Press, 
1972. 

Dunrette, M. D., Campbell, J. P., & Jaastad, K. 
The effect of group participation on brainstorming 
effectiveness for two industrial samples. Journal 
of Applied Psychology, 1963, 47, 30-47. 

Green, P., Halbert, M., & Fitzroy, P. Experiments 
on the value of information in simulated market- 
ing environments. Boston: Allyn & Bacon, 1967. 

Hall, E. J., Mouton, J. S., & Blake, R. R. Group 
problem solving effectiveness under conditions of 
pooling vs. interaction. Journal of Social Psy- 
chology, 1963, 59, 147-157. 

Kiesler, C. The psychology of commitment. New 
York: Academic Press, 1971. 

Kogan, N., & Wallach, M. A. Risk taking: A study 
in cognition and personality. New York: Holt, 
Rinehart & Winston, 1964. 

Krantz, D., Luce, R., Suppes, P., & Tversky, A. 
Foundations of measurement. Vol. 1. New York: 
Academic Press, 1971. 

Lorge, I., Fox, D., Davitz, J., & Brenner, M. A 
survey of studies contrasting the quality of groups 
performance and individual performance. Psycho- 
logical Bulletin, 1958, 53, 337-372. 

Lorge, I., & Solomon, N. Two models of group be- 
havior in the solution of Eureka-type problems. 
Psychometrika, 1955, 20, 139-188. 

Markowitz, H. Portfolio selection. New Vork: Wiley, 
1959. 

Siegel, S. Non-parametric statistics. New York: 
McGraw-Hill, 1956. 


Mass.: 


960, ——— when. using 


Tuckman, J., & Lorge, I. Individu 
oi individual and function of group r He 
Unpublished mas- 1962, 15, 45-51. 

Technology, 


(Received April 3, 1975 


Convergent, validation of riik, taking desis or ONDU creative thi: 
'ournal of Abnormal and Social Psy- istrative Science Quarterly, 1958, - 
lity as a 


Relations, 


Po? cow ee ee a = 
draai of App ty 
me, V * 436 


INSTORMING BY INDIVIDUALS, COACTING 
AND INTERACTING GROUPS `+ 


WARREN R. STREET? 
Central Washington State College 


six 3-person groups of undergraduate volunteers worked in three 
t settings to produce answers to three brainstorming problems. The 
s worked independently in separate rooms (individual condition), 
ndently in the same room (coaction condition), and cooperatively in 
room (interaction condition). Conflicting theories predicted lower 
ctivity either in the interaction and coaction conditions, where co-workers 
vhysically present (social facilitation theory), or only in the interaction 
tion, where group norms could be formed. Results showed that the 
dent and coaction conditions produced the same number of solutions 
the interaction condition was significantly (p< .001) inferior to both. 
suggests that public action, not physical proximity, is responsible for 


Several iments performed over the 
last 15 yi ve demonstrated that indi- 
viduals arc erior to groups in providing 
answers to tivity problems requiring di- 
vergent th z (Bouchard, 1972b; Bou- 
chard & H 970; Campbell, 1968; Dun- 
nette, Can & Jaastad, 1963; Meadow, 
Parnes, & e, 1959; Rotter & Portugal, 
1969; Tayl.. Berry, & Block, 1958). There 
are obvious plications of these results, but 


their general ¿ability would be enhanced by 
an understanding of the causes for a given 
individual’s decreased responsiveness in a 
group setting. 
Researcheis in brainstorming have sug- 
gested a number of reactions to the group 
setting that may take place within the indi- 
vidual member and result in decreased pro- 
ductivity. One possible explanation is that the 
individual member perceives the group’s norm 
to be one of noninvolvement in the problem 
at hand. Each person waits for someone else 
do take the lead in producing solutions, and 
the result is low productivity for the whole 
group. Individual passivity in group settings 
has been found in research dealing with by- 
Lander intervention in emergencies (Darley 
S Latané, 1968; Latané & Darley, 1968; 
Latané & Rodin, 1969). If the same norm of 
1— 
oe of this article was presented at the 
Reou, 1 Association, Anaheim, 1973. 
; Sher 5 for reprints should be sent to Warren 
ington 5 of Psychology, Central Wash- 
ollege, Ellensburg, Washington 98926. 


idual-group differences in brainstorming. 
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noninyolvement affects brainstorming groups, 
training techniques or instructions which 
stress involvement and individual participa- 
tion should increase productivity. Two recent 
studies (Bouchard, 1972a, 1972b) demon- 
strate that such techniques are effective in 
partially overcoming the inferior productivity 
of brainstorming groups. 

A second, related explanation offered in 
brainstorming studies is that fear of social 
disapproval causes group members to censor | 
their contributions, regardless of how much 
the experimenter encourages them to report 
every answer that comes to mind. This same 
fear of disapproval may also cause group 
members to perseverate on one class of simi- 
lar, “safe” solutions and avoid plowing new 
ground in the search for solutions. Because 
it perseverates, a group is not likely to pro- 
duce the diversity of solutions that would be 
obtained from the combined output of the 
members working independently. It follows 
that diversity of group-produced solutions 
could be increased by simply increasing the 
number of groups working on a problem. 
Thus, if the perseveration hypothesis is cor- 
rect, the pooled output of many groups Should 
be at least equal to the output of an equal 
number of isolates. Bouchard, Drauden, and 
Barsaloux’s (1974) reexamination of data 
from past studies, however, shows that groups 
are still inferior to individuals when a pooling 
procedure is followed. While perseveration — 


= 


does not appear to account for group-indi- 


n 
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vidual differences, there are still other alter- 
natives worthy of consideration. 

One such explanation for the inferiority of 
groups is offered by an application of Za- 
jonc’'s (1965) theory of social facilitation. 
Zajonc contends that the presence of others 
leads to heightened arousal, or drive. This, in 
turn, causes an increased likelihood of giving 
well-learned responses to stimuli and de- 
creased likelihood of giving poorly learned or 
unusual responses, Since these unusual re- 
sponses are the ones called for in a problem 
requiring many creative answers, the presence 
of others should decrease the number of crea- 
tive solutions. This, of course, is the observed 
result of brainstorming studies. 

It is important to note that decrements in 
creativity, according to social facilitation 
theory, should result from the mere presence 
of others, regardless of whether they are 
interacting or simply working independently 
in close physical proximity (coacting). The 
brainstorming productivity of both interacting 
and coacting groups should be inferior to indi- 
vidual output if the social facilitation model 
is correct. The present study compared the 
productivity of individuals, coacting groups, 
and interacting groups in order to (a) repli- 
cate the group-individual differences found in 
earlier studies and (4) to examine the produc- 
tivity of coacting groups where others were 

present but working independently. 


METHOD 
Subjects 


The subjects were 108 undergraduate psychology 
_ Students who volunteered to participate in the study. 
They formed 36 3-person groups which were homo- 
= geneous with respect to sex and of the same sex as 
che experimenter. The experimenters were 24 under- 
graduate students of experimental social psychology. 
The study was described to the experimenters as an 
_ exploratory study with no specific hypothetical out- 
come. 


Design 


Ihe experiment was a repeated-measures design 
-~ wherein each group of three subjects gave solutions 
for three different problems. Each problem was 
Presented in a different setting. The order of presen- 
tation of settings, order of presentation of problems, 
and the assignment of problems to settings were all 
completely counterbalanced, resulting in 36 different 
combinations. 
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Hrocedure +4 

Settings. Individual, interaction action set- 
tings formed the three levels oí ndependent 
variable of interest. In the indivic tting, sub- 
jects were each placed in isolated controled 
cubicles furnished with a table, pen, and 
paper. They had 10 minutes in w >) Write as 
many solutions to their problem as uld think 
of. In the interaction setting, were all 
Seated in the same room around on table and 
Were to produce solutions by tal! cely and 
working together for 10 minutes. | coaction 
setting, the subjects were all seat: individual 
desks in the same room, separated rout two 
feet. They were instructed not municate 
while they wrote their answers in the nute time 
allotted. In the interaction setting, ubject, 3 
volunteer, recorded the group’s answ: 

This general procedure differs fro ny past 
studies in one regard. In this stud subjects 
recorded their answers by writing, w ost pre- 
vious studies have tape-recorded ora! onses. In 


a study where written responses w: d, Bou- 
chard (1969) points out that a cons response 
mode across treatment conditions is aportant 
requirement for valid comparisons. Ir present 
study, the coaction setting necessitate ten, in- 
stead of oral, responding, and this mod response 
was used in all settings. 

Instructions. After arriving at a room, 
each group was met by an experimenter 0 intro- 
duced himself, thanked them for comin; and er- 


plained the need to read instructions 
pared text. The instructions for each s 
as follows: 


a pre- 
ng were 


(Individual) Each of you is in a scparate 
room. I can talk to all of you through this 
intercom. In a moment, I will give you a prob- 
lem for which there are many creative answers- 
Your job is to come up with as many solutions 
to this problem as possible. Don’t worry about 
whether your answers are realistic or practical. 
Just try to produce as many answers as possi- 
ble. Write each answer down on the sheet in 
front of you. You will have ten minutes in 
which to work on this problem. Now, if you 
have any questions before I read the problem, 
just speak into the intercom and I'll answer 
your questions. 

(Interaction) We've placed you all in the 
same room and we want you to work on this 
task as a group. This means that you should 
talk freely to cach other and work together 
to do as well as possible. In a moment, I will 
give you a problem, etc... .. (the remainder 
is identical to the Individual setting instruc- 
tions). 

(Coaction) We've placed you all in the same 
room, but we want you to work on this task as 
individuals. This means that you should not 
talk to each other or look at each other’s work. 


each à separate desk so that 


We've gis 


you will lependently. In a moment, I 
will give problem, etc. . (the re- 

mainder is cal to the Individual setting 

instructior 

Problems roblems were similar to those 
wed in oth: orming studies and read as fol- 
lows: 

1. Heig lem: We know today that the 
average | id weight of people is increas- 
ing. Supp: overies in medicine and nutri- 
tion ace iis trend and that, 20 years 
from no werage American is seven feet 
tall and t average weight is 250 pounds. 
What wc the consequences? What ad- 
justments this situation require? 

2. Thu problem: This is not likely to 
happen, bi igine for a moment what would 
happen i ryone born after 1980 had an 
extra thu each hand. This extra thumb 
would be just like the present one is, 
but woul located on the other side of the 
hand. It bend inward toward the fingers, 
just as th ent one does. Here is the ques- 
tion: Wh fits or difficulties will arise when 
people st ing this extra thumb? 

3. Tou roblem: Each year a great many 
American ts go to visit Europe. But now 
Suppose t ur country wanted to get many 
more Eu ns to come to America during 
their vaca . What steps can you suggest that 
would ge: iy more Europeans to come to 
this count s tourists? 

RESULTS 


The dependent variable, group productiv- 
ity, was defined as the number of nonover- 
pping solutions produced by a 3-man group 
on a single problem. While subjects in the 
Coaction and individual settings did not ac- 
tually produce solutions as a group, their 
ideas were combined and scored as one group’s 
Output. These are typically called “nominal 
groups™ in the literature. While the tourists 
Problem appears to be of a qualitatively dif- 
ferent nature than the other two problems, 
ere were no significant differences in the 
mean numbers of answers given for different 
problems. 
The data were analyzed by means of a 
Š X 6 repeated-measures analysis of variance 
nove), testing for differences between 
. Settings and six possible orders of set- 
k 8s. Since some previous studies report in- 
‘eased output following a “warm-up” prob- 
em in isolation, it was possible that setting 


} roRMING BY ÎNDIVIDUALS, COACTING AND INTERACTING Groups 
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TABLE 1 


Mean Nuúuner or Resronses To 
Crxariviry Prones 


Setting Mean 
Individual 27.53 
Coaction 27.06 
Interaction i 17.72 


I 

orders may have been related to productivity. 
The mean productivities per group (Table 1) 
in the individual and coaction settings arè 
almost identical and far superior to produc- 
tivity in the interaction setting. The ANOVA 
shows the effects of settings to be significant 
beyond the .001 level (F = 12.85, df = 2/ 
10), and Tukey’s test of honestly-significant- 
differences shows no differences between the 
individual and coaction settings, both of which 
are higher (p < .01) in productivity than are 
groups in the interaction setting. There are no 
effects related to orders of settings (F = 1.03, 
df = 5/30, ns). 


Discussion 


In replication of similar studies, interact- 
ing groups were found to be inferior to indi- 
viduals in the production of answers to prob- 
lems of divergent thinking. The results of the 
present study go on to show that some factor 
other than the mere presence of two other 
workers is necessary to produce the inhibiting 
effect of groups. It is possible that larger 
numbers of co-workers could hamper produc- 
tivity, but the present study indicates that 
working in the same room with others does 
not hamper creativity. It is only when one’s 
actions become public knowledge that de- 
creased performance results. This conclusion 


in a recent review of social facilitation litera- 


p 
coincides with that offered by Cottrell ( Wen 


ture. 
A methodological issue in the present study 


should be mentioned in this report. In the 
coaction and individual settings, all three sub- 


jects could record answers. To promote verbal 


| 


exchange in the interaction setting, only one 
subject, the volunteer recorder, was given 
writing materials. This may have reduced the 
number of answers recorded in the interaction 
groups. In a brief follow-up study, 24 3-man 


groups solved one of the three original prob- 
tems in the interaction setting. Each subject 
was given writing materials, and the group 
was given 20 minutes in which to work. Even 
under these favorable conditions, mean pro- 
| ductivity was 22.53 solutions, still signifi- 
cantly less than individual output (¢ = 1,70, 
f < OS), although improved over the origi- 
nal interaction groups (¢ = 1.78, P < 05). 
be pattern of group-individual differences 
was unaltered by this change in method. 
It seems likely that differences between 
group and individual output do not stem from 
physiological arousal but rather from compli- 
ance with group norms of noninvolvement. In 
_ applied settings, steps may be taken to com- 
pensate for these inhibiting norms. The ex- 
perimental literature indicates that special 
_ motivational, training, and problem-solving 
techniques occasionally result in groups that 
‘are as creative as individuals. Rarely, how- 
ever, do these groups surpass individual pro- 
ductivity. Time-consuming and expensive 
techniques to bolster group creativity do not 
appear to pay their way in numbers of ideas. 
In deciding upon a problem-solving tactic, 
group techniques should not be rejected with- 
out consideration, despite the results of this 
and other brainstorming studies. These stud- 
ies typically employ problems requiring the 
largest possible number of solutions, but 
many practical problems call for the produc- 
tion of a single, best solution. A combination 
of individual, anonymous production of a 
large number of alternatives and group delib- 
erations on the best alternative may be use- 
ful in these circumstances. In addition, situ- 
ations where relevant information is dis- 
tributed among many people may justify a 
Eos problem-solving approach (Bouchard, 
< 1). 
Where creativity and diversity are desired, 
however, individuals working alone should be 
productive. The present study shows 
that the physical proximity of individual 
brainstormers is inconsequential, but that 
privacy or anonymity of response is a major 
fi contributing to productivity, 


Waxaen R. Sreeet 
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. UTILIZATION AND EXPERT JUDGMENTS: 


A COMPARISON OF INDEPENDENT AUDITORS 
WITH OTHER JUDGES 


ROBERT H. ASHTON * 
University of Texas at Awitin 


ilization by independent auditors, a type of expert judge, was examined 
an analysis of variance framework. Alter reading a brief narrative 
ing a hypothetical manufacturing company, 63 auditors judged the 
h of payroll internal control as a function of 32 different combinations 
internal control indicators, The results demonstrated the overwhelming 


© correlation for interjudge consistency was .70, while the average 
udge correlation was .81, The average insight index (the correlation 


was 89. 


s Expert jı are faced with the task of 
interpreting integrating numerous items 
of informa nto an overall judgment or 
decision, T orm this task, a judge must 
weight item nformation differentially and 
must qualif; interpretation of a given item 
when other iderations make such qualifi- 
ation nece (Slovic, 1969). Because of 
the importa of such judgments in psychol- 
Ogy, busines: .nd elsewhere, researchers have 
Tecently con sucted systematic studies in an 
effort to understand the manner in which 
experts use iniormation. An important goal of 
such Studies has been the description of expert 


judgments. Ii such judgments, and the infor- 

Mation-processing activities which precede 
„can be adequately described, then it 

May be possible to develop techniques to im- 

Prove upon human judgment in various ap- 
areas. 

Initially, multiple regression techniques 
Were used in an effort to identify both linear 
and configural cue utilization. Hoffman’s 
(1968) discussion concerning the lack of suc- 
Ss on the part of judgment researchers in 
identifying configural cue utilization with 
multiple regression models has been followed 

a small number of studies conducted in an 
imalysis of variance framework. A typical 

Requests for reprints should be sent to Robert 


. Ashton, Business-Economics Bldg. 324, University 
exas, Austin, Texas 78712. 


cn an auditor’s subjective weights and the statistical weights of the six 
Similarities and differences between the present findings and 
of studies involving other types of expert judges are discussed. 


finding of such studies has been the over- 
whelming of main effects (repre: 
senting linear cue utilization) as compared to 
interactions (representing configural cue uti- 
lization). This research has involved various 
types of judges and judgment tasks; for ex- 
ample, Veterans Administration (VA) hos- 
pital personnel (Rorer, Hoffman, Dickman, & 
Slovic, 1967), radiologists (Hoffman, Slovic, 
& Rorer, 1968), stockbrokers (Slovic, 1969; 
Slovic, Fleissner, & Bauman, 1972), and 
graduate students (Slovic et al., 1972). 

The present article reports a study involv- 
ing a different type of expert judge—inde- 
pendent auditors employed by public ac- 
counting firms. In the process of performing — 
an audit, an independent auditor must fre- 
quently rely upon his professional judgment 
in combining various items of information 
into an overall evaluation. In addition to 
studying cue utilization via the analysis of | 
variance model, the present study assessed 
(a) judgment consensus, or agreement across 
auditors at the same point in time using the — 
same data; (b) judgment stability, or con- 
sistency over time for the same auditor using 
the same data (Goldberg & Werts, 1966); and 
(c) judgment insight, or the extent of agree- 
ment between the statistical description of 
the auditor’s judgments provided by the 
analysis of variance model and a subjective 
description of those judgments provided by 
the auditor himself (Hoffman, 1960). 
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k METHOD 
Subjects 
+ 
y The subjects were 63 practicing auditors who 
’ were employed by four public accounting firms in 
_ the Minneapolis/St. Paul area, The four firms con- 
sisted of two large national firms, a regional firm, 
and a local firm. The great majority of the auditors 
bad 2 or 3 years of experience. A partner or mana- 
ger in each firm was contacted to determine how 
closely the experience levels of the subjects coin- 
= cided with the experience levels of those auditors 
who, in actual audits, make the type of judgment 
t studied in the experiment. All firms indicated that 
che typical amount of experience required before an 
i auditor is allowed to make this type of judgment is 
_ 24 to 3 years. Therefore, most of the subjects were 
_ approaching the point in their careers at which they 
would actually make this type of judgment. 


Task 


The experimental task selected concerns the audi- 
tors evaluation of his client's system of “internal 
control,” which has been described as 


+ + . the plan of organization and all of the 

coordinate methods and measures adopted 
within a business to safeguard its assets, check 

_ the accuracy and reliability of its accounting 
data, promote operational efficiency, and en- 
Courage adherence to prescribed managerial 
Policies [American Institute of Certified Public 
Accountants, 1949, p. 6]. 


The selection, timing, and extent of auditing pro- 
cedures applied in a particular engagement depend, 
in part, upon the judged effectiveness of existing 
internal control (Brocker, 1967; Mautz, 1964). It 
is important, therefore, that the process of judg- 
ment formation in the evaluation of internal control 
be subjected to systematic research.2 


- 


for example, 
inventories, etc. 


characteristics of adequate payroll internal control 
‘should be familiar to all auditors, For the purpose 
‘of this experiment, six indicators of payroll internal 
control were selected. They were presented to the 
auditors in the form of questions. Each question 
Vas answered by the experimenter as “yes” or “no,” 
and a “no” answer always represented a weakness 
in the internal control subsystem. It is important 
ta recognize that the most common instrument used 


portance of internal control evaluation in the per- 
formance of an audit and (b) the necessity of rely- 


i upon judgment in the evaluation of internal 
control, see Ashton (1973). 


3 For a more complete discussion of (a) the im- 


3 
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by auditors for reviewing inter 


rol sub. 


systems is the internal control ques: e, which 
corresponds very closely to the st presenta- 
tions used in this experiment. Of cx e typical 
internal control questionnaire contair han sx 
questions. 

Each subject was given a brief na describ- 
ing a hypothetical manufacturing ny. In- 
cluded in this narrative was a descr of some 
basic structural aspects of the co payroll 
internal control subsystem. In cx n with 
this unchanging background informa ch sub- 
ject judged the strength of interna ol as a 
function of 32 different combination and no 
answers to the six questions. Their ts were 
made on a scale from 1 (extreme!) ) to 6 
(adequate to strong). One of the stin resenta- 
tions (hereafter called a “case”) used experi- 
ment appears in Figure 1. 

Each of the first five questions nts an 
aspect of payroll internal control w! recog- 
nized to be very desirable from the int of 
sound internal control. The sixth questi: cluded 
because auditors may use past results tarting 
point for the examination of the cur: ar; in- 
deed, they may (perhaps erroneously ) jer the 
past to be the best indicator of preser litions. 
Another reason that past results art ant is 
that all aspects of an internal contro system 
may not be examined every year. only a 
few aspects may be examined each yes: ~d that 
information may be combined with th: ults of 
previous examinations in evaluating the eĉ -tiveness 


of the controls. 


Design 


Since six questions were employed, each vi 
could assume two answers, 64 possible p 
answers (2°) to the six questions were availal 
order to enable the experimental task to be com- 
pleted within a reasonable time period, only 32 
patterns were actually used. Specifically, the 32 
patterns were chosen in accordance with a one-half 
fractional replication of a 2° factorial design (Coch- 
ran & Cox, 1957). The estimation of four-factor and 
five-factor interactions is impossible with this Pe 
ticular design because their mean squares are in- 
cluded in the mean squares of main efiects and 
two-factor interactions. This does not present à 
significant problem; much empirical evidence exists 
which indicates that such higher order interactions 
in studies of human judgment are negligible (Gold- 
berg, 1968; Hoffman, 1968; Rorer et al, 1967). 
Since the data were analyzed separately for each 
auditor, there was no within-cell variance from 
which to construct the error term for the analysis; 
therefore, the mean squares from the three-factor 
interactions were combined to form the error term 
(Davies, 1956). The inability to evaluate three-factor 
and higher order interactions is not a significant 
disadvantage; a principle aim of the study was to 
assess the relative importance of main effects (repre- 
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NO. = YES NO 


the tasks of both timekeeping and payment of employees 
“quately separated from the task of payroll preparation? X 


e the tasks of both payroll preparation and payment of 
sloyees adequately separated from the task of payroll bank 
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id two-factor interactions (rep- 
vie 


T Descriptive n s of judgments. Analysis of vari- 
2 Nei to the data collected from each 
text, a significant main effect 
juestion implies that the auditor’s 
f systematically with the value of 
Particular question. In other words, it made a 
ence to the auditor whether the question was 
wered yes or no. A significant two-factor inter- 
on implies that the auditor was responding to 
pa 3 patterus of answers to the questions; that 
a e effect of variation of one question upon his 
onses differed as a function of the level of an- 
et question (Hoffman et al., 1968). After F ratios 
a obtained for the six main effects and 15 two- 
k- interactions, omega-square statistics (.) 
* computed (Hays, 1963). In this context, omega- 
em an estimate of the proportion of variance 
3 ent that is accounted for by the specific 
n effect or two-factor interaction. Therefore, 
5 Bee is used to determine the extent to 
= each auditor used each of the six internal 
az de questions (and their interactions) in formu- 
E his internal control judgments. 
A insight. To assess the relationship be- 
5 statistical description of the auditors’ 
aa aud a subjective description of those 
dit oth types of descriptions were obtained. 
itor's subjective cue-weighting scheme was 


nti ee : 
tified by having him distribute 100 points across 


count reconciliation? x 
the names on the payroll checked periodically against the 
ve employee file of the personnel department? X 
formal procedures established for changing names on the 
roll, pay rates and deductions? x 
the payroll audited periodically by internal auditors? xX 
1s the internal control over payroll found to be satisfactory 
‘ing the previous audit? x 
= 2 
5 E ra g a a 3 
gpa „ 
T 3 E 8 = B 
5 Ä—Hr EAAS 
1 2 3 4 5 6 


cure 1. Example of an internal control case used in the experiment. 


the six questions in such a manner as to reflect the 
relative importance of each question to his judgments 
(Hoffman, 1960). Each auditor distributed his 100 
points after having rated the 32 cases. 

Order of presentation of the cases. The 32 cases 
were presented to the auditors in three different 
orders, with approximately one third of the auditors 
receiving cases arranged in each order. In Order 1, 
the first half of the cases was primarily composed of 
those in which only two questions were answered 
yes, while the second half was primarily composed of 
cases in which four questions were answered yes. 
This pattern was reversed in Order 2, while in the 
third order, a “four-yes” case always followed a 
“two-yes” case, The effect of order of presentation 
upon both statistical weights and subjective weights 
was 8 
Judgment consistency. The two types of judgment 
consistency that were investigated in the study 
were (a) consensus, or consistency across auditors 
at the same point in time using the same data, and 
(b) stability, or consistency over time for the same 
auditor using the same data. Estimates of both 
types of consistency were expressed as correlation 


coefficients. 


Procedure 


The experiment was conducted by visiting the 
public accounting firms and meeting with the audi- 
tors in small groups. The participants were given 
a brief oral introduction to the judgment task and 
were then presented with the case materials. The 
auditors typically worked at the task for 30 to 40 
minutes. The second administration of the experi- 


= 


TABLE 1 


Retative Inporrance Oy Mx Evrrers ax 
Two-Pactox INTERACTIONS TO 


rut Avoirors 
Average a? 
Main effect | No. — 
for w. 
teraction | significant* | Significant | All auditors 
P f ones only | (N = 63) 

@ 274 26.2 
w 26.3 25.2 
46 8.7 6.7 
a“ 98 7.1 
5 10.5 8.5 
27 14.5 6.6 
15 3.5 1.1 
6 27 4 
8 23 6 
6 3.6 5 
7 1.9 4 
23 8 2.7 6 
24 8 3.2 7 
5 2.8 4 
26 7 1.5 4 
34 3 3.7 4 
35 1 1.8 4 
36 1 1.7 a 
45 2 2.2 4 
46 + 1.8 2 
56 1 6 3 


„ Significant at the .0S level. The great majority of significant 
main effects were significant well beyond the -O1 level, but 
| almost no interactions were significant at that level. 


ment, which was necessary in order to assess con- 
sistency of judgment over time, was conducted in 
the same manner as the first administration. The 
time interval between the two administrations 
ranged from 6 weeks to 13 weeks. Of course, the 
_ tactical procedures involved in making such judg- 
ments in actual auditing situations differ from 
_ those described in this study. For example, auditors 
> do not make a series of similar judgments within 
such a brief time span, nor is a subsequent audit 
performed on the same client 6 to 13 weeks later. 
These simplifications were made necessary by prac- 
tical considerations. ue 


P REsuULTS 

Main Efjects and Interactions 

An indication of the relative importance to 
che auditors of the main effects and interac- 


. * An exception was made in the case of the re- 

gional firm, in which it was necessary to conduct 

_ the second administration by mail. Five of those 

_ auditors did not respond; therefore, only 58 of the 
_ 3 auditors repeated the experiment. 
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tions is presented in Table 1.‘ ercentage 
of variance in judgment accou: r by the 
six main effects combined ran om 48% 
to 96%, with an average of 8( while the 


range for the 15 two-factor int ons com- 


bined was 0% to 17%, with erage of 
6.4%. (On the second admir ion, the 
average for main effects and in ions was 
79.9% and 5.5%, respectively vast five 
of the six main effects were ant for 
two thirds of the auditors, whi! number 
of interactions reaching signifi was be- 
tween zero and two for almos! of the 
auditors. On the average, the “,. portant 
main effect (representing Qui 6) ac 
counted for six times as much ance in 
judgment as did the most impos interac- 
tion (representing Questions 1 ). Ap- 
parently the auditors did not | ior pat- 
terns of answers to the six questi ; rather, 
they evaluated the effect of ea: question 
independently of the effects of r ques- 
tions in the same case. ; 
Individual Differences in Cue Użi eon 

A point of agreement between the | sults of 
this study and those of previous sit es con- 
cerns the extent of individual dife nces in 
cue utilization. On the average, main effects 


representing Questions 1 and 2 accounted for 
a total of 51.4% of the total variance in 
judgment. This represents a sizable propor- 
tion of the 80.2% accounted for by the six 
main effects. Notice that both quesiions deal 
with separation of duties, an internal control 
characteristic which is crucial in the payroll 
subsystem because of the risk factor. The two 
questions were almost equally important; 
Question 1 accounted for 26.2%, and Ques- 
tion 2 accounted for 25.2% of the total vari- 
ance. On the average, the other four ques- 
tions were approximately equally important. 
Although Questions 1 and 2 were considered 
most important overall, such individual dif- 
ferences existed among the auditors that each 
question was considered most important by at 


»Unless otherwise specified, the data reported 
are taken from the first administration of the ex- 
periment. The results from the second administra- 
tion were usually very similar and are presenti 
where differences (or particular similarities) are of 
interest. 
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ind each question was con- 
tant by at least one audi- 
ndividual differences in cue 
sented in Table 2. 


cx was computed for each 
x took the form of a corre- 
jective weights and statisti- 
the six questions. Before 
relations, statistical weights 
include the effects of inter- 
169). It must be emphasized 
dex was computed in order 
al indication of degree of 
ment). One must be careful 
much confidence in the spe- 
‘he insight indexes because 
uted over such a small num- 
likely to be highly unstable. 
sreement between subjective 
tistical weights was quite 
ge value of the insight index 
nirds of the indexes were in 
s, and only three were below 


‘ation of the Cases 


portance of total main effects 


.ctions was not influenced by 


TABLE 2 
EXAMPLES OP D:FFERENCES IN THE IMPORTANCE OF 
THE QUESTIONS TO THE VARIOUS AUDITORS 
Percentage of total variance in 
„ Most x judgment accounted for by 
important Auditor question 
question 1 
1 2 3 456 
— C 
` 24 7 1-2) a A 
$ 44 74 7) 41 1 1 
2 2 ns | 67 | ms ns 9 7 
$ 21 5167 13 „ 33 
3 29 7 24 20 10 4] as 
> 37 | 16 | 12 | 33 | ns | 16 | ns 
$ 27 14 712 | 28 | ns | ms 
5 57 3 |14 | 14 | 33 | 14 | ns 
5 56 6 6| 3) 12 45) 8 
6 59 11 6 3 3 | 29 | 29 
6 22 ns 2 2 1 | 18 | 66 
42 ns | ns | 3 | ns | 26 | 48 


` z 
Not significant at the .05 level. 


TABLE 3 


Errect oF Ozone or Parsextanox or mE 
Casts ox trax Coume Iurortaxce 
or Questions | axp 2 


Average statistical Average subjective 
weight ight 
eevee Eee 
| 
Admini- | Admini- | Admini- Admini- 
siration | | stration 2 | stration | | Strata? 
1 84.5 84 48.7 $23 
2 454 423 aaa 42.2 
3 53.8 52.8 | 49.2 Sii 


order of presentation, the importance of Ques- 
tions 1 and 2 differed substantially. For those 
auditors who received Order 2, in which rela- 
tively strong cases of internal control ap- 
peared early in the sequence, Questions 1 and 
2 were apparently less important than for 
those auditors who received the cases in a 
different sequence. The same effect occurred 
for subjective weights and statistical weights 
on both administrations of the experiment. 
Data are presented in Table 3. This implies 
that the importance of an internal control 
cue is partially determined by the context in 
which it appears. It also emphasizes, in an- 
other manner, the relatively strong agreement 
between the statistical weights attached to 
the questions by omega-square and the sub- 
jective weights assigned to them by the audi- 
tors. 


Judgment Consistency 


Concerning judgment consensus, the corre- 
lations between the ratings of all pairs of audi- 
tors on the 32 cases were computed as an 
estimate of the extent of agreement between 
their internal control judgments. The average 
correlation was approximately .70. The range 
of correlation coefficients was generally from 
the lower .40s to the higher .80s, but a small Er 
number of coefficients were between .06 and 2 
40 (about 3%) and between .90 and .93 
(about 1%). Generally, the more discrepant — 
were the cue-utilization schemes of two audi- — 
tors, the lower was the correlation between 
their resultant judgments. Consensus differed — 
very little as a function of firm affiliation or 
experience level. y 


Nn 
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Concerning judgment stability, which was 
assessed for each auditor by correlating his 
ratings from the two administrations of the 
experiment, the coefficients ranged from a low 
of .43 to a high of .96, with an average of 
81. Only four of the 58 coefficients were be- 
low .70. Again, the coefficients differed very 
little by firm or experience level. 


Discussion AND CONCLUSIONS 


The present study has revealed some simi- 
larities, and some differences, between judg- 
ments made by independent auditors and 
other types of expert judges. These similari- 
ties and differences are discussed below. Pos- 
sible implications of the present findings for 
the auditing profession are discussed else- 
where (Ashton, 1973). 

The most striking similarity concerns the 
overwhelming importance of main effects as 
compared to interactions when the cue-utili- 
zation strategies were represented by the 
analysis of variance model. In the present 
study, it is perhaps unrealistic to expect siza- 
ble configural cue utilization. Since a yes 
answer to any question represented a favor- 
able aspect of internal control (and, likewise, 
a no answer always represented an unfavor- 
able aspect), one would not expect different 
patterns of answers to reverse the implica- 
tions of particular cues. On the other hand, if 
judgment processes of experts really include 
a sizeable configural component, the present 
study should have allowed the demonstration 
of that component to a greater extent than 
previous studies. The judgment task used in 
this study was simpler than those which have 
typically been used. The probability that the 
descriptive models which emerged from this 
study may represent the judges’ simplifica- 
tion of an overly difficult task would appear 
to be lower than has been the case in previous 
studies. 

The fact that very small configural effects 
were found, even in this relatively simple 
judgment task, reinforces even more strongly 
the results of previous studies, Hoffman et 
al. (1968) found that approximately 90% 
of the variance in judgments could be ex- 
plained by main effects. The average per- 
centage of judgment variance explained by 
interactions in the study by Slovic (1969) 
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was 6% and by Slovic et al. ) was 4%, 
Even smaller percentages v eported by 
Rorer et al. (1967). In thos idies, as in 
the present study, it appears small con- 
figural effects may be identifi zen certain 
patterns augment or attenu he impor- 
tance of particular cues (H in et al, 
1968). This situation is so es called 
“ordinal interaction” as oppose disordinal 
interaction” (Digman, 1966). 

Some differences between pri udies and 
the present study should be n first, the 
present study involved a lar; umber of 
subjects (63) than did any > studies 
previously mentioned (their ra was 2 to 
24). While it is true that analy variance 
was applied to the data of subject 
separately, the consistency of alysis of 
variance results over a larger nu of sub- 
jects lends additional credibilit, the find- 
ings. Second, the format of the ulus ma- 
terials used in this study corre: led more 
closely to the “real world” tha: | formats 
used in some previous studies: example, 
whereas stockbrokers do not, in al prac- 
tice, make evaluations based u; dichoto- 
mized variables ub versus down, 
“good” versus poor, “many” yersus “few” 
(Slovic, 1969; Slovic et al., 1972 )-—auditors 


do evaluate internal control effectiv: 
questionnaire containing a series 3 
no answers.“ Third, the variables of experience 
level and firm affiliation were given explicit 
attention in this study. The data were ana- 
lyzed to determine any differences related to 
these variables. No significant differences con- 
cerning configural processing were found. 

In this study, as in previous studies, judg- 
ment stability was greater than judgment 
consensus. When data for individual auditors 
were examined, it was found that stability, 
like consensus, was related to the use of par 
ticular internal control questions. Auditors 
who relied upon the same one or two ques- 
tions to approximately the same extent on 
both testings exhibited relatively high stabil- 


50n actual questionnaires a third category of 
“not applicable” is sometimes included, It was not 
included in the experiment because only six ques- 
tions were used and all were presumed relevant. As 
usual, it is difficult to generalize beyond the levels 
actually used in the experiment. 
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ity. Those w! anged their patterns of reli- 
ance exhibit: latively low stability. 
Other jud t studies of this type have 
not assessed ity over a significant period 
of time. Hofi t al. (1968) had each judge 
make two ji nts of each stimulus case, 
apparently < ed by only a few minutes, 
and found t} > judges’ reliabilities ranged 
from .60 to with a median of .80. In 
another stuc offman (1960) found relia- 
bilities in t»! sid-.80s. Length of time be- 


tween judgn 
in judgment 
a slightly n 


may be an important factor 
ility. For the present study, 
e relationship between sta- 


bility and L of time was found. As the 
time interva! reased from 43 days to 94 
days (in ro 10-day intervals), average 
* con ently decreased from .83 to 
+10, 

It is also esting to note that judgment 
consensus w ‘eater in this study than in 
Studies of o types of expert judges (see 
Goldberg, 1 For example, in the Hoff- 
man et al. | study, the range of corre- 
lation coeffi: s was —. 11 to .83, with a 
median of. ia that study, approximately 
60% of the «ficients were below .40. 

The degre: of judgment insight reported 
for other typo: of expert judges appears to be 
much lower thn that reported in the present 
Study (Hoffman, 1960; Slovic, 1969; Slovic 


et al., 1972), Caly one specific average value 
for an insight index has been presented; it 
Was 34 and was based upon the magnitude 
of effect index (Slovic et al., 1972). The same 
study found a negative relationship (Spear- 
Man r = —.43) between length of experience 
and degree of insight. In the present study, 
‘sight did not differ significantly as a func- 
tion of experience, Of course, the relative 
Simplicity of this particular judgment task 
may have contributed to the magnitude of in- 
Sight indexes obtained, 

The data collected in the present study can- 
pol Provide an explanation for the findings 
that independent auditors exhibited greater 
consensus and greater insight than have other 
pes of judges. One may speculate, however, 
that the educational experiences of auditors 
a m college and in preparation for the 

“tified Public Accountant examination—are 
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at least partly responsible for these differ- 
ences. The study of internal control in col- 
legiate auditing courses is not likely to differ 
substantially among reputable colleges and 
universities. Characteristics of “good” internal 
control are defined quite clearly and are dis- 
cussed in numerous publications with which 
the great majority of auditors should be very 
familiar (e.g., American Institute of Certified 
Public Accountants, 1949, 1963). 

The extent to which the internal control 
evaluation situation is well defined may be 
contrasted with that pertaining to the medical 
diagnosis situation (Hoffman et al., 1968) and 
the stock evaluation situation (Slovic et al., 
1972), in which the meanings of particular 
cues are likely to be much more ambiguous in 
relation to the variable to be judged. This 
may provide a somewhat more basic explana- 
tion for the observed differences between 
auditors and other types of expert judges. 

The importance of individual judgment in 
accounting and auditing, and in business gen- 
erally, should not be overlooked by judgment 
researchers. In business, as in other areas, the 
consequences of decisions which include indi- 
vidual judgment as a major input are often 
quite significant, and current techniques of 
judgment research are generally applicable. 
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< for performance appraisal are frequently made by supervisors. In the 
study, judgments of effectiveness for hospital nurses were obtained 
e nurse herself and her peers in addition to her supervisor, using the 
iting form. Factor analysis indicated that cach rating source could be 
identified and characterized. The data reaffirmed the notion that inter- 
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ROI 
disa 
sive appraisals are suggested. 

The prol errors, and dilemmas in- 
herent in ra have been well documented 
(eg., Barre 66, Guion, 1965; Schmidt 
& Kaplan, and suggestions have been 
made for ci changes, specific formats, 
and trainin; . Kavanaugh, 1971). Simi- 
larly, numer uthors have been quite criti- 
cal of curte: actice (Wallace, 1965), and 
some (Odi 1964) have suggested the 
elimination dgmental criteria entirely. 
The latter se. = unlikely to occur for as long 
a there are -ople supervising others it ap- 
pears there be subjective (albeit covert) 
evaluations. :.rthermore, there are those who 
agree with Emn and Naylor (1968) that 
Subjective judements are “a very rich source 
of data [p. ic8|” that should not be aban- 
doned but somehow gathered or transformed 
to meet its tial (see also Campbell et 


al, 1970, p. 123). Moreover, a number of 
these advocates appear to concentrate their 
ellorts and optimism not on the development 
s specific rating forms per se, but instead on 
the judge, the rater, or as he sometimes is 
called, the observer, 

The ideal rater who observes and evalu- 
ates what is important and reports his judg- 
ment without bias or appreciable error does 
0 exist, or if he does, we don't know how 

Separate him from his less effective col- 
———_—- 


— 5 authors wish to thank Milton D. Hakel and 
a Pai - Wherry, Sr. for their helpful comments on 
Vice on a draft of this article. Robert J. Wherry’s ad- 

Re data analysis was invaluable. 
I. oo for reprints should be sent to Richard 
aig Department of Psychology, College of 

í and Behavioral Sciences, Ohio State Univer- 


Sit 
ens West 17th Avenue, Columbus, Ohio 


greement may reflect systematic rater bias as well as meaningful 
es in the ways in which judgments are made. Implications for com- 


leagues [Barrett, 1966, p. 7].“ While this 
may be true, Barrett feels he can continue 
and offer guidelines which, if followed, tend 
to maximize the potential accuracy of ratings. 
In doing so he places great stress on the 
importance of the organizational position of 
the rater relative to the person being rated, 
for “The position of the rater . . . determines, 
in part, the extent and nature of his oppor- 
tunity to observe, the quality of his judgment 
and the appropriateness of his point of view 
[1966, p. 102].” The positions that he con- 
siders are those of self, superiors, peers (those 
who function on approximately the same or- 
ganizational level as the person rated), sub- 
ordinates (those who are supervised by the 
person rated), and the field review specialist 
(an outsider trained in performance measure- 
ment). And, after reviewing the strengths and 
liabilities of using ratings made by observers 
in these various positions, Barrett concludes 
that for certain purposes the less frequently 
used raters (e.g., peers or subordinates) ap- 
pear to offer great benefits to an appraisal 
program. The focus of the present study is 
on judgments made by persons in the first 
three of the positions listed—supervisors, 
peers, and self. : 
By far, supervisors’ judgments comprise 
the bulk of the ratings generated in industry 
for appraisals (cf. Campbell et al., 1970), 
and it is probably safe to say that most re- 
search on appraisals tends to be directed 
toward them (e.g., Barrett, 1966; Whisler & 
Harper, 1962). However, a growing body of 
literature on the potential usefulness of peer 
ratings exists that commands attention. Re- 
search on peer judgments, starting in the late 
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forties (eg, Hollander, 1954; Weitz, 1958; 

Wherry & Fryer, 1949) and continuing 

through the present (¢.g., Amir, Kovarsky, & 
Sharan, 1970; Gordon & Medland, 1965; 
Mayfield, 1970, 1972) is impressive. Excel- 
lent reliabilities and, when used as predictors, 
good validities have been obtained by nu- 
merous authors in several types of organiza- 
tions (Campbell et al., 1970, p. 112). In 
contrast, self-ratings have a less well-docu- 
mented history and have been used in indus- 
try as an adjunct to other effectiveness data, 
primarily providing a vehicle for more effect- 
ive appraisal interviews (Whisler & Harper, 
1962), Current thinking is that no one posi- 
tion or organization vantage point can provide 
the information necessary to determine a 
person's effectiveness. Thus, recent literature 
Suggests that one should gather ratings from 
several judges, at times from all three posi- 
tions mentioned (Kavanaugh, MacKinney, & 
Wolens, 1971; Lawler, 1967). Indeed, this is 
a strategy in recent construct validity re- 
Search. Yet despite this advocacy it is not 
clear why combinations of various ratings 
should be effective. 

A few authors have offered some plausible 
arguments. For example, it appears that su- 
periors and peers perceive different aspects of 
a person’s behavior. Disagreement in this 
case should not be viewed as a mark of unre- 
liability but instead as a potentially valid 
indicator that differing aspects of a worker's 
actions are being accurately perceived and 
reported (Norman & Goldberg, 1966). The 
personality of the rater (Schneider & Bay- 
roff, 1953) and the rater’s effectiveness on his 
own job (Kirchner & Reisberg, 1962) have 
been shown to differentially affect ratings. 
Guion (1965, pp. 472-473) has suggested 
_ that raters in different positions may, in fact, 
be using different percepts or dimensions in 
__ their evaluations of a worker. Thus, the addi- 
_ tional validity obtained may be due to the 


more complete coverage of a true or “ulti- 


mate” criterion which, by nature, is complex. 
In particular, Guion feels that evaluations 
reflected in peer nominations may be fairly 
well differentiated (factorially complex) and 
= suggests that “If so, peer nominations might 
well be subjected to dimension analysis for 
possibly greater predictive value [ Guion, 
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1965, p. 473].” This is exac 
field (1972) did in his study 
pothesized, only one of the d 
ally predicted Mayfield’s crit 
and production of life insuranc: 
supporting this type of analy: 

In addition to increasing t 
the job behavior domain, 
various ratings should be mor. 
single ratings by allowing the 
systematic error variance due 
sponse tendencies and responsr 
identifiable rater errors includ 
dency, leniency, and faking dis: 
giving variable and inconsistent 
nette, 1966). In Cronbach’s ( 
such temporary or lasting, spe 
can then be classified as rando: 
ever, temporary, general va. 
mains as systematic error. Sy 
variance due to changes in ti 
formance and to the situation 
ered by analyzing multiple rati 
two or more intervals in time. 
temporary, general variance could 
as random error allowing last 
variance to be isolated. Whet! 
ratings are made at more than or 
time or not, it is necessary to st 
tematic rater bias from differences ir 
havioral elements observed by the ra 
their respective vantage points. À 

The present study addresses itself to @ 
issue of dimensional complexity of perfon 
ance appraisal judgments. Specifically, it 
amines ratings made at one point in time b 
supervisors, by peers, and by self-ratings 
an attempt to discover both the nature aM 
the complexity of these judgments. In doit 
so it was felt that the results of such 
analysis could contribute to our understa 
ing and usage of alternatives in building pe 
formance appraisal systems. 


METHOD 


The data were collected as part of a larger SUW 
concerned with the effects of possible moderators | 
the relationship between performance and satisfat 
tion (London & Klimoski, 1973a, 1973b). One hur 
dred and fifty-three registered nurses (RN) fro 
four Columbus, Ohio area hospitals served as * 
jects. No attempt was made to split the sample a 
cording to hospital or functional unit (e.g., medic 


diatrics, obstetrics, etc.) be- 
mber of subjects that would 
All RNs working on the day 
were required to fill out the 
f a research study carried out 
he School of Nursing at the 
In the study 202 RNs par- 
juestionnaires were eliminated 
to lack of self-ratings, super- 
ratings for each subject. 
it the research questionnaire 
in small groups. Head nurses 
onnaires at other times. The 
st to the present study con- 
s of effectiveness. For each 
ked to simultaneously rate and 
hemselves (head nurses had to 
subordinate RNs on the same 
| continuum, The nurses first 
ve RNs in their units next to a 
letter corresponding to each 
riate place on the scales. It 
vethod would encourage greater 
tent and, hence, greater care in 


pd sargical v 
ame of the » 
peat in each c 
f hih in each } 
geriennaires 

ie conjunctior 
Ohio State Ur 
tkipated ; how 
trem the sam; 
vitor ratings, 

The RNs í 


utter and pl 
mame on the 
was hoped th 
attention to sc 
making rating 
The measur 
were 
forms used by 
pied. The 
Bitis (both b 
the range of 
forms. The ch: 
format of the 
different from : 


effectiveness used in this ques- 
ed from performance appraisal 
ferent hospitals including those 
zators tried to select character- 
s and traits) which represented 
eness covered in the appraisal 
istics are listed in Table 2. The 
scales used in the study was 
ppraisal forms used for admin- 


Mirative decisio. dhe hospitals; included were 18 
Fistics © performance plus a measure of 

Sverall effectivencss. Instructions to the subjects 
_ emphasized that iiie questionnaires would be used for 
arch purposes only and that complete anonymity 
Would be maintained in recording the ratings. Al- 
Perfor ratings used for research pur- 


poses are likely be different from those used for 
My raises or promotions (Lawler, 1967; Oberg, 
1972), they may be less likely to be plagued by 
„and therefore underlying dimensions of effec- 
performance will be more evident. Such an 
‘alysis may aid in constructing more valid per- 
formance 


appraisal forms. 
Analyses 


e intercorrelation matrix of 20 variables 
* total score of these ratings, 18 ratings on indi- 
il characteristics, and a single overall rating of 
"veness) for each of the three rater / sources was 
15 This matrix was then submitted to Wher- 
80) hierarchical method of factor analysis. It 
r that the number of subjects (153) is 
than T, relative to the number of variables (60), 
s usually considered desirable for a factor 

a ziad by statisticians. However, since the aim is 
lying cal one of understanding dimensions under- 
Yatiables various ratings in order to determine which 
Might be used for performance measures, 
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multiple R squares are wed.) Factors are extracted 
by the principal-axis method until residual are 


ing corrected factor loadings are subjected to a 
varimax rotation. à 


ij 


on which they have the highest loading. 

are then used to creste a theoretical 
the corrected 

the Thurst 


b 
f 


: 
| 
: 


“il 
i 
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i 
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RESULTS 


A summary multitrait-multirater matrix 
with median intercorrelations for each tri- 


rater- group specific bias differentiates self- 
ratings, supervisor ratings, and peer ratings 
from each other. 

The results of the hierarchical factor analy- 
sis are presented in Table 2. One general and 
five subgenerals were extracted. (In this 
analysis original communality estimates 
ranged from .45 to .85, with a median of .71. 
Residuals ranged from +.149 to —.157.) 
This factor solution was selected by compar- — 
ing a number of hierarchical solutions of the 
60 K 60 intercorrelation matrix in which 
three through seven factors were extracted 
and by examining the results of individual 


Personal communication to the authors from 


Robert J. Wherry, Sr., October 17, 1973. See 
Wherry (1959) for a similar discussion. 


TABLE 1 


LTITRAIT-MULTIRATER MATRIX WITH MEDIAN INTERCORRELAT 


Summary Mt 
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Measure 1 


2 


1. Self-ratings 


2. Supervisor ratings 


3. Peer ratings 


pesa z 
Note, N = 153; (C) = validity diagonals; | "Ss 


monorater triangle. 
2 C. G. 
p 01. 


factor analyses performed on the 20 X 20 
matrices of each rater group. (These data are 
not reported in this study.) In each factor 
analysis of the 60 * 60 matrix, three or- 
thogonal factors representing self- ratings, 
supervisor ratings, and peer ratings emerged. 
Similarly, the separate factor analyses of the 
data of each rater group consistently re- 
sulted in a strong general factor made up of 
rater-group specific bias and interrater bias. 
(These two sources of error, however, are 
inseparable in the individual factor solutions.) 
This general factor accounted for more than 
85% of the common variance leaving very 
low loadings on the subgenerals in all cases. 
At least one subgeneral factor emerged from 
every solution. The six factors extracted from 
the larger analysis combining the self-ratings, 
supervisor ratings, and peer ratings distin- 
guishes between rater-group specific bias and 
interrater bias. It allows a comparison be- 
tween the specific factors which emerged 
from the individual analyses. This indicates 
that factors interpretable in terms of traits or 
behaviors can be separated from rater halo. 

It is evident from Table 2 that there is con- 
siderable halo common to supervisors and 
peers while strong rater specific biases also 
exist. Factors 1, 2, and 3 clearly indicate that 
self, supervisors, and peers each rate per- 
formance from a different perspective. Self- 


ratings account for 96% of the ‘otal varianee 
of Factor 1; supervisor ratings account i 
97% of the total variance of Factor 2; all 
peer ratings account for 99% of the tota 
variance of Factor 3. Factor 1 accounts tol 
23% of the common variance of the fa to 
analysis, while Factors 2 and 3 account 1 
15% and 29%, respectively. 4 


for 11% of the common variance. It 
cates that supervisors and peers overlap 
the way they make their ratings. 

supervisor ratings and 19 peer ratings 


to note that the total scores for the sek 
ratings (& = 267.5, SD = 44.3) were signi 
cantly higher than supervisor ratings (& 
246.7, SD = 60.0, t = 3.44, df = 304, P= 
01) and peer ratings (X = 248.6, SD 
50.5, £ = 3.48, df = 304, p < .01), wheres 
supervisor and peer ratings did not diffe 
significantly. The general factor indicates tha 
the significant differences are not merely dil 
ferences in degree between self-ratings, P% 
ratings and supervisor ratings, but also a 4 
ference in dimensions evaluated by the ratei 
This could be interpreted as indicating that ! 
form of “justifiable halo” exists insofar 3 
two sets of raters responded to the same 8“ 
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of stimuli on the same instrument in similar 


. 15% and 
7% of the common variance, respectively, 
demonstrate that raters made many discrimi- 
nations among the characteristics rated. This 
indicates that systematic variation exists 

“apart from the rater effect. 

Factor 4 has been labeled “competence, 
skill, and ability.” Supervisor ratings of tech- 
nical competence, ability to organize and 
schedule work loads, skill in planning nurs- 
ing care, adaptability in emergencies, de- 
pendability, quality of work, completing 
work on schedule, acceptability of completed 
work, and attendance and promptness all 
have meaningful loadings on this factor. Self- 
ratings load highly on five of these items and 
peer ratings load highly on six items. The 
high loadings of supervisor ratings on five 
additional items indicate that supervisors are 
less able to distinguish between items reflect- 
ing competence and the remaining items than 
axe nurses rated by themselves or their peers. 

Factor 5 has been labeled as “effort con- 
formance, and interpersonal qualities.” Ob- 
Servance of rules and regulations, effort ap- 
plied, and impressing visitors have note- 
worthy loadings on this factor for all raters. 
In addition, self- and supervisor ratings of 
personal appearance, observance of rest and 
lunch periods, and acceptance of responsibil- 
ity for own behavior load highly on this fac- 
tor. The factor also includes self- and peer 
ratings of willingness to perform duties, self- 
tatings of amount of work performed, and 
Supervisor ratings of communication skills. 


Discussion 

The results of the factor analysis confirm 
the evidence cited in the introduction that 
‘self-rater, supervisor rater, and peer rater bias 
exists. This bias is due not only to systematic 
halo error which may be common to all raters 
but to rater-group specific bias. The factors 
which can be interpreted in terms of traits 
are weak compared to the general and rater- 
group specific factors, Yet it seems, as Guion 
(1965, p. 473) suggested, that raters in dif- 
ferent positions use different dimensions. For 
example, supervisors may be less able to dis- 
criminate between items related to compe- 
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tence from those related to , whereas 
nurses rating themselves and can make 
this distinction. 

Of course, it is possible th factorial 
results do not completely refl re extent 


of the discriminations made.“ 
ties of the ratings were not de 


reliabili- 
red, it is 


impossible to ascertain the uni s of each 
variable. Also, it should be n that the 
items on which the subjects w ated and 
the procedure used to make th ings were 
not developed with the intent o° maximizing 
their relevance or reliability. No ystematic 
attempt was made to sample otal per- 
formance domain and develo; des that 
yielded high reliability. This have re- 
sulted in decreasing systematic ance at- 
tributable to differences in ra and in- 
creased the systematic error var attribu- 
table to raters. 

The factor-analytic approach iltitrait- 
multirater analysis demonstrate: he pres- 
ent study may help the researc! explain 
the factors which are associate: h being 
evaluated differently by different rs, One 
technique for doing this has been © «ing Ghi- 
selli’s (1963) method of examini:.: the off- 
quadrants in the relationship be en the 
different raters’ evaluations (e.g., „ rating 
by supervisor, low rating by peers; low rating 


by supervisor, high rating by peers: and the 
same rating given by supervisor and peers) 
and searching for moderator variables (Law. 
ler, 1967). However, the factor analysis of 
multitrait-multirater data can aid in this 
process by uncovering dimensions upon which 
ratings are made and using these dimensions 
in the quadrant analysis. For example, high 
and low supervisor ratings of effort variables 
may be compared to high and low peer rat- 
ings of skill variables on the basis of such 
moderators as age, tenure, need for achieve- 
ment, etc. This would result in an even 
greater understanding of the basis upon which 
ratings are made. 

Since rater bias may be expected, any sub- 
jective evaluation of performance should be 
submitted to dimensional analysis. The hier- 
archical method allows dimensions which are 


The authors are grateful to a reviewer for mak- 
ing this point. 


t of this bias to be uncovered. 
er, it may be that different factor 
may be predictive of different criteria. 
his approach should be helpful in assess- 
pnt center evaluations or periodic perform- 
‘appraisals used to predict rate of ad- 
future salary increases, or future 
ctive evaluations for selection or pro- 
purposes. Similarly where ratings are 
as criteria, data irom the incumbent 
sor and peers also should be obtained 
Mi analyzed. (Subordinate ratings, although 
analyzed in this study, may also be use- 
Thus, ratings of the same traits from 
points of view may be predictive of 
Criteria, or themselves be differen- 
predictive. Which judgments are ob- 
d should depend on their intended use. 
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Realistic information about a situation into which new organizational men 
were about to enter was presented to the new members after they had de: 
to join the organization, but before they reported to the organization. W 
compared to a control group which received no such information, the e 
b mental group showed significantly lower turnover. The results are d 
in reference to the functions served by realistic job information available 
new members after they have accepted a position with an organization. 


Any organization becomes concerned when 
apparently committed and carefully selected 
members voluntarily resign. Typically, at- 
tempts are made to improve selection prac- 
tices and/or to alter the job environment in 
order to reduce turnover. In a few cases, 
attempts have been made to alter the beliefs 
and expectations of prospective organiza- 
tional members in order to improve the fit 
between the individuals’ beliefs and the or- 
ganizational environment. Work with insur- 
ance agents (Weitz, 1956) and entering cadets 
at the United States Military Academy 
(Macedonia, 1969) has demonstrated that 
applicants, who received realistic information 
in booklet form about the organization they 
were considering, more frequently decided to 
join the organization and less frequently re- 
signed voluntarily. Katzell (1968) also found 
realistic expectations about the situation re- 
duced turnover among nursing students. 

Three explanations seem plausible for the 
reduction in turnover when applicants receive 
candid information prior to their decision to 


The research reported in this article was con- 
ducted while both authors were members of the 
Office of Military Psychology and Leadership at the 
United States Military Academy, West Point, New 
York. While the support of the Academy was 
greatly appreciated, the ideas expressed in this 
article are solely the opinions of the authors and 
are not to be considered as the position of the 
Academy or the U.S. Army. 

The authors gratefully acknowledge Robert D. 
Pritchard for his comments on an earlier draft of 
this article. 

Requests for reprints should be sent to Daniel R. 
Igen, Department of Psychological Sciences, Purdue 
University, West Lafayette, Indiana 47907. 


join the organization.“ First, | upon the 
information in the booklet, son mbers of 
the applicant sample may deci to join 
the organization. Although som« hose who 
reject the organization at this ; may have 
remained on the job and pe zed quite 
well, many who reject the or: tion may 
realize that they would eventu: ind their 
position unacceptable and re Conse- 
quently, the sample that joined uld con- 
tain those individuals who were re likely 
to remain with the organization 

Second, the applicant’s pos: n of ac- 


curate information prior to making a decision 
to accept or reject the position may lead to 
a greater felt responsibility for the decision, 4 


greater commitment to the decision, than 7 
the decision were made on the basis 0 
faulty or incomplete information. The increase 


in commitment to the decision could be re- 
flected in a decrease in the probability of resig- 
nation as a possible means of coping with 
unpleasant aspects of the position. The in- 
dividual had been forewarned of these un 
pleasant factors prior to his decision to Jom 
the organization. 

A third possible reason for the turnover 
reduction is that knowledge of what to ex- 


3 The insurance agents in Weitz’s study received 
the candid information in booklet form when ba 
applied for the job. Macedonia sent out a bookle 
to all cadets who had been accepted for admissions 
two months before the cadets were to report to 
Academy. Therefore, Macedonia’s sample cone 
both those who had already decided to join (or 115 
to join) the organization and those who were in Fe 
process of making a decision, There was no way 
identify these two groups. 
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Rear 

pect from t! iation may better prepare 
T individuals t œ with it, and thus reduce 
frustration. s is the case, individuals 
with realistic mation about the situation 
should be b. able to cope and less likely 
to leave thc ition than those who have 
unrealistica] tive information about it. 
The pres: mn of realistic information 
prior to the icant’s decision to join the 
organization ounds the effects of self- 
selection, ini d commitment, and aid in 


coping. An; ll three of these processes 


may be ope z to reduce turnover. How- 
ever, informe un presented after the appli- 
cant makes i> decision to join the organiza- 
tion but befo he reports to it should elimi- 
nate the first interpretations. The present 
study inves’ d this situation. It was hy- 
pothesized t ew members would be better 
prepared t e with the difficult setting 
when they | what to expect, even if they 
were told to expect after they had 
decided to the organization. Since vol- 
untary resig us were used as an indica- 
tion of abili cope with the situation, it 
was hypothe that fewer individuals would 
resign from group which received this 
information. pport for this hypothesis 
Would not im:iy that only aid in coping 


Operates w 
Presented, but 
in coping is 
the absence of 
commitment io 


predecision information is 
t would demonstrate that aid 
icient to reduce turnover in 
self-selection and increased 
the decision. 


METHOD 
Settin g 


The research was conducted with new cadets 
pane the United States Military Academy in the 
ummer of 1971. New cadets reported on July 1, and 
“Pent approximately two months prior to the be- 
Se of the academic year in an intensive train- 
T period. This training was designed to familiarize 
t 5 a cadets with the Academy and with some of 
eae military disciplines. The period 
tad 105 a quick adjustment on the part of the new 

et and often was very stressful, both psychologi- 


j cally and physically. 

3 Procedure 
the booklet was prepared that candidly described 
dail; Wo-month summer-training period and the 


i : ; 

E during the academic year. This booklet 

(1969) revision of a booklet prepared by Macedonia 
Material included was based upon inter- 
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views and critical incidents solicited from cadets 
and officers closely associated with the summer 
program and the day-to-day conditions during the 
academic year. Care was taken to include mundane — 
and also stressful activities that are often omitted 
from booklets primarily aimed at attracting indi- 
viduals to the organization. Drafts of the booklet 
then were resubmitted to other cadets and officers 
for ratings of the accuracy and relevance of the 
material. 

The booklet was distributed, by mail during the 
first week of Junc, to a random sample equal to 
one fifth of those individuals who had stated that 
they would accept their appointments. Of the 246 
booklets distributed, one did not reach the cadet 
due to a change of address, and 11 individuals did 
not report on the first of July. This left a sample 
of 234 new cadets who received the booklet. From 
the remaining sample of new cadets who reported 
on the first of July but who had not received the 
booklet, a control group (n= 234) was randomly 
selected. 


RESULTS 


Voluntary resignations over the two-month 
summer-training period were recorded for 
both the experimental group (the group that 
received the booklet) and the control group. 
Since some resignations which may have re- 
sulted from the summer experience were not 
processed until a few days after the end of 
the summer-training session, all resignations 
before the first day of classes in the fall were 
considered. During this time period, 27 re- 
signed from the control group and 14 resigned 
from the experimental group. This difference 
was statistically significant (x? = 4.52, df = 
1, p<.05) and supported the hypothesis 
that candid information presented after the 
decision to participate but before entering 
the organization reduced the probability of 
voluntary resignation. 


DISCUSSION 


Although the data presented supported the 
hypothesis, the strength of support is tem- 
pered by the appropriateness of the control 
group. The most appropriate control group 
for this design would be a group equal in size 
to the experimental group and randomly se- 
Jected from the applicants who had accepted 
their appointment by June first. With a con- 
trol group selected at that time it would have 
been possible to compare the number of those 
jn each group who had accepted their ap- 
pointments but did not report on July 1. 


the 2 

However, only the latter possibility would be 
to the validity of the interpretation 

f the number of those who did 


l 


trol group, the alternative interpretation of 
self-selection is possible. One could argue that 
those who did not report after accepting their 
appointments still had some doubts, and the 
booklet convinced them not to come. The 
“doubters” were still in the control on July 1 
but subsequently resigned, thus creating the 
Significant difference reported. 
A consideration of the Macedonia (1969) 
data and one additional analysis led the au- 
thors to conclude that the possibility of a 
significantly smaller number of the applicants 
who did not report in the control group than 
in the experimental group was highly un- 
likely. Macedonia (1969) found that a sig- 
nificantly greater number of cadets reported 
the first of July from the group that received 
a similar booklet than from the group that 
did not receive the booklet. His sample in- 
cluded both those who had and those who had 
not accepted their appointments. In his sam- 
ple, it was not possible to separate those who 
had accepted their appointments from those 
who had not. However, it is highly unlikely 
chat the significant difference was due pri- 
_ marily to those who had accepted their ap- 
_ pointments. In a similar vein but with a less 
_ comparable sample, Weitz (1956) found that 
it was easier to fill insurance agent positions 
When candid information was presented to 
applicants than when it was not. 
An additional set of analyses applied suc- 
_ cessive chi-squares to discover the critical 
number of persons necessary in order to have 
number not reporting in the experimental 
T significantly greater than that of the 
control. Of the 245 who were sent booklets, 
II did not report July 1. Assuming an equal- 
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sized control group had been n 245), 
in order to obtain a signific ference at 
the p< 05 level between mber not 
reporting to the control group he Ll pot 
reporting in the experiment: up, only 
three or less of those in the ol group 
would have had not to report three of 
245 over a one-month period d highly 
unlikely in light of the norma tion prior 
to the July 1 reporting date 1 light of 
the Macedonia data. Therefo authors 
were willing to accept the inte: tion pre 
sented as the most plausible on 

Finally, the question of the. to which 
these results may generalize t settings 
should be addressed. Most new ets prior 
to their first day at the Academ t that the 
summer training was going to ery dif- 
cult, although they had many iceptions 
about specific aspects of the ng. The 
information in the booklet spe! t specific 
factors of which they may n ve been 
aware, but this information did ntradict 
their previous general impressio he train- 
ing. We would hypothesize tha general- 
izability of this study may be enced by 
the degree to which candid inform. ion about 
the job fits the general impres» of the 
newly accepted applicants. Pera if the 
is a large discrepancy between tho informa- 
tion provided after the decision to join the 


organization and what the applican: was 

to believe when he accepted the position, the 
new employee may feel he has been deceived 
and candid information may not reduce vol 
untary resignations. 


CONCLUSION 


The effect of realistic expectations, estab- 
lished after the decision to join the organiza 
tion had been made, demonstrated that rè- 
duced resignations need not be due to M- 
creased commitment to the decision or t° 
self-selection by the applicants. Self-selectio® 
has been the concern of those whose prima 
interest is recruiting. Recruiters believe that 
potentially valuable organization member 
who would remain with the organization may 
be turned away by the initial information? 
that information does not present a vet 
positive picture. Although both Weitz (1956) 
and Macedonia (1969) showed that it Was 
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new members if the new 

! candid information, and 
i no significant differences on 
red for acceptance (eg., high 
olastic Aptitude Test scores, 
hose who declined their ap- 
hose who accepted their ap- 
nizations often are reluctant 

ribe positions to applicants. 
ta from the present study in- 
ven if one is reluctant to de- 
tion completely prior to the 
sion to join the organization, 
in benefit from this informa- 


in their adjustment to the job should 
even before they report to work. 


Welte, J. Job — and survival. Journal of 
Applied Prychology, 1986, 40, 248-247. 
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PROBABILITY AND UTILITY ASSUMPTIONS UND! ING 
USE OF THE STRONG VOCATIONAL INTEREST | K 


FRANK L. SCHMIDT ?* 
Michigan State University 


Decision implications of Strong's assumption of equal base rates for his 
in-general (MIG) and individual occupational groups are explored. | 
the Strong Vocational Interest Blank (SVIB) Physician’s scale as an exan 
it is demonstrated that when realistic base rates are assumed, total de< 
error rate using the SVIB—even when optimal cutting scores are employ: 
is at best about equal to that resulting from prediction using base rates al 
It is shown that use of the SVIB as recommended by Strong impli: 
assumes that false negative errors are much more serious than are false po 
tives. Finally, it is shown that when “reasonable” utility or gain matrices : 
assumed, use of the SVIB can reduce total error rate despite the limitati 
imposed by low occupational base rates, and that this effect is greater wh 
statistically optimal cutting scores, rather than those recommended by Stron 


are used. 


In many practical decision-making tasks 
faced by psychologists, scores on some con- 
tinuous test or variable are used to make 
dichotomous decisions. In a study that is now 
“a classic, Meehl and Rosen (1955) demon- 
strated that cutting scores used in such deci- 
sions must take into account population base 
rates, that is, the relative frequency of the 
category or group of interest. They showed 
that a device which usefully reduces decision 
or classification errors when the base rate of 
the dichotomous phenomenon of interest (e. g., 
“schizophrenic versus normal or shoplifter 
versus honest shopper) is near .50 may be 
useless—or may even lead to increased er- 
rors—when the base rate is very high or very 
low. They presented conditions that must be 
met for the test to achieve a greater degree of 
accuracy than would result from use of the 
base rates alone. Dawes (1962) later pro- 
vided a more easily understood analysis in 
terms of conditional probabilities. Rorer, 
Hoffman, LaForge, and Hsieh (1966) pre- 
sented an analytic (as opposed to empirical) 
solution to the determination of optimal cut- 
ting scores, and Rorer, Hoffman, and Hsieh 
(1964) provided extensive tables of optimum 
cutting scores for groups differing both in 


i 


F Requests for reprints should be sent to Frank 


L. Schmidt, Department of Psychology, Michigan 
State University, East Lansing, Michigan 48824. 
The author is grateful to John Hunter and Robert 
Forsythe for helpful comments on an earlier draft of 
this article. 


base rate and variance. These 3 assume 
that all errors are equally cost! 1 all cor- 
rect classifications are equally eficial; a 
subsequent study (Rorer, Hoff * Hsieh, 
1966) presented a method o ermining 
optimal cutting scores when positives 
and negatives are unequal in and true 
positives and negatives are une in bene- 
fit. 

The decision theory principles splored in 
these studies can be applied to an evaluation 
of the Strong Vocational Interest Blank 


(SVIB)—perhaps the most commonly used 
instrument in occupational counseling. The 
purpose of the present study is to show that 
application of these principles leads to con- 
clusions about the usefulness of the SVIB 
that are quite different from those reached 
by Strong (1955), Strong and Campbell 
(1966), and Campbell (1968, 1971). The 
following analysis applies equally to the 1966 
revision and to the earlier form. Similarly, 
while it is in terms of the men’s form, the 
analysis generalizes easily in all its essentials 
to the women’s form of the SVIB. 


Impact or Base RATES oN EXxPECTANCIES 
AND ON TOTAL Error RATES 

Meehl and Rosen (1955) and Dawes 
(1962) have pointed out that criterion vali- 
dation all too often results in a statement o 
test validity in terms of P(R;/G:), that E 
the probability of Response j for subjects in 
Group i (eg., the probability that schizo- 
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above 70 on a test). The 
predictive efficiency, how- 

the probability of group 
n the response (e.g., the 
e respondent is a schizo- 
he scores above 70). These 
ire related by means of 


P(R/GIP(G)) 
P(R;) 


cessfully avoided this com- 
sented validity evidence for 
form P(G,/R,), where G; is 
question and Ry is a letter 
IB scale for the occupation 
immarizing the results of his 
study of 884 Stanford Uni- 
Strong (1955, p. 43) pre- 
jilities (expectancies) shown 
erages across 16 occupations. 
f being in a given occupa- 
one obtained an A—, for 
scale for that occupation 18 
74. These probabilities are 
and this summary is widely 
ence of the SVIB’s effective- 
stasi, 1968, p. 472; Super & 
40), But, as pointed out by 
uch evidence is both er- 
| highly misleading. 
ale is designed to discriminate 
een that occupation and the 
men. in-genera! (MIG) group, a large hetero- 
genous group of men in various professional 
paraprofessional occupations. Because 
base rates in Strong’s (1955, pp. 40-41) MIG 


Latter Rating Score | 


As 55-70 88 
A= 45-54 74 

B+ 40-44 62 

35239 49 

8 30-34 36 

C Below 30 7 


O 10 20 30 40 50 60 70 80 90 100 
Chances in oo of Employment in Occupation 


1 1. Chances in 100 of employment in oc- 
Pation given specific letter grade on the Strong 


os tional Interest Blank 18 years earlier (Strong, 
955, p. 43). 
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O 10 20 30 


40 8 8 70 60 


Faun 2. Distributions of the men-in-general and 
specific occupation groups under the assumption of — 
equal base rates. 4 


group were different for different occupations — 
and because Strong felt that within occupa- 
tions, these base rates would vary as between 
universities and geographical regions, he de- 
cided to express expectancy ratios for all 
scales “in terms of samples of equal size.“ 
The implications of this procedure can be 
seen by comparing Figures 2 and 3.“ In com- 
paring each occupation with the MIG group, 
Strong assumed the situation shown in Figure 
2, that is, an occupational group equal in size 
to the MIG group. This assumption is obvi- — 
ously highly unrealistic; none of the occupa- — 
tions scaled on the SVIB even approximates — 
the relative frequency of all other occupa- 
tions combined—and this holds true whether — 
we consider base rates in Strong’s MIG 
group or in the United States male employed 
professional-technical population. Figure 3 
illustrates a more reasonable assumption; it 
depicts the base rate for the occupation as 
being much smaller than that of the non- 
occupation, or MIG group. Operating on the 
basis of this assumption leads one to esti- 
mates of P(G,/R,) much smaller than those 
shown in Figure 1, with the degree of reduc- 
tion in size depending on the estimate chosen 

of the base rate of the occupation. As an 
example, consider the physician’s scale. Phy- 
sicians constituted 6% of the original MIG 
group (Strong, 1955, p. 68); they make up 
5.1% of the United States employed male 


professional population (U.S. Bureau of the 4 
Census, 1963). When considered in relation 


2 Scaling of the abscissa in Figures 2 and 3 ap- k 
proximates that of the typical SVIB scale de, 
Fura = 30, Xoceup. +50, and ai =e 


pitted the oe 


0). 

5 census figures; at the time of this writing, 
national 1970 census breakdowns by occupational 
group were not yet available. But it is unlikely that 
the 1970 base rates will be significantly different 


from 1960 figures, 


Strong, 1955, p. 37) and the standard devi- 

10 for the MIG and physi- 
groups, respectively (Strong, 1955, p. 
68), application of Bayes’ theorem‘ yields 
the expectancies shown in Table 1 for the 
base rates of . 50 and .20, and the rounded 
base rates of .05 and .02. (The base rate .060 


being a physician 18 years later. However, 
these expectancies drop off sharply when more 
realistic base rates are entered into the equa- 


using normal curve tables. P(G;) is, of 
assumed physician's base rate. P(R;) = 
FN - P(G,) plus the probability of the re- 
the MIG group times the assumed base 
MIG group, that is, 1—P(G,). These 
those for Table 2, assume normal- 
th there are some apparent departures from 
in some occupational scales, they are not 
affect the figures in 
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TABLE 1 


NOR ror tux Srzoxo Voca 
Biaxk Paysictax’s Scars Us 
Base Kare Assumes 


Letter 


10 Score 

| 0 N 
At 55-70 92 ‘ 
A- 18 84 80 0 
B+ ou 00 28 
B 35-39 4l 
B- K | 0 
C Below 30 OF 01 


Nate. Ementially, PiGY/R)) is the pr 

LN iven that be obtained 
* years earlier, 

tion. When the base rate is ta} 
student scoring A— has even cl 
becoming a physician. When .0 
the .02 which is perhaps mor: 
most situations, is employed as 
none of the expectancies is as | 
No matter how high the studen: 
probability is always much highe: 
become a nonphysician than a ph 
problem is not peculiar to the ox 
physician, In fact, most of the 
on the SVIB that could be found 
sus figures showed smaller base rai 
physician category. 


Another approach to this probi: 


an examination of the total error 


percent of all decisions that are in 


TABLE 2 


NTOREaE 
ta 


res, the 
he will 
in, This 
tion of 
ipations 
he cen- 
han the 


ı is via 


rate—the 
error—as 


PERCENTAGE OF ALL DECISIONS INCORRECT FOR 
STRONG VOCATIONAL INTEREST BLANK 


PHYSICIAN’S SCALE as FUNCTION 
CUTTING SCORE AND Base RATE 


Cutting score j 
50 | 20 | 05 
45(A-) | 21.7 | 162 | 134 
40 (B+) 19.0 20.8 21.2 
40.135 18.5 = — 
40.5 — 15.7 — 
65.39° = — | 4.9 


* Optimal cutting score for base rate = .50. 
> Optimal cutting score for base rate = .20. 
5 cutting score for base rate = .05. 


4 For base rate = .02, there is no cutting score that result# 
in fewer errors than prediction using base rate alone. 


ing score and assumed base 

5, p. 54) has advocated 
iously consider” an occupa- 

y «ore B+ or above, thus 

cs to tentatively classify 
such an occupation rather 

G group. The more stringent 

\— is also sometimes used. 
statistically optimal cutting 
ermined for each assumed 

2 presents total error rates 
scores as a function of four 

en a base rate of .50 is as- 
recommended cutting score 

to optimal; in addition, all 
educe the error rate consid- 
at resulting from use of the 
(509%). When the base rate 
\— is close to optimal as a 
while the B+ suggested by 
an error rate above the 20% 
When the base rate is taken 
‘ter grade cutting scores pro- 
considerably above the 5% 
result from use of the base 
n the optimal cutting score 
ce error by only .19%. When 
che base rate, no cutting score 
which produces fewer errors 
er rate prediction that no one 
« physician (29% error rate). 
willing to assume a .05 base 
future physicians in most counselee 
, the reduction of .1% in total 
rror y makes use of the SVIB appear 
Ørthwhile. Clearly, if use of the Strong can 
be justified, justification must be on the basis 
inequality (i.e., differential weighting) of 


SSification errors. 


the physician's scale, X =50, SD=10; for 
IG distribution, X 30, SD=13 (Strong, 
P. 68). For each cutting score, total error rate 
nt of false positives in the MIG group (de- 
ed from normal curve tables) times the as- 
MIG base rate plus the percent of false nega- 
in the physician's group (also from normal 
tables) times the assumed physician base rate. 

Cutting scores for all base rates except .20 
Te calculated by interpolation using the Rorer, 
Aman, and Hsieh (1964) tables. Since the base 
f .20 is not contained in these tables, this 
Bimal cutting score was calculated using Equation 
e Rorer, Hoffman, LaForge, and Hsieh (1966). 
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of the SVIB, etc., seem to indicate that true 
positives are of greater value than true nega- 
tives. Nevertheless, it might be argued that 
there are three sets of assumptions about true 
positives and negatives that are plausible. 
Assumption 1 states that gain in utility from 
a true positive (G.,) and gain from a true 
negative (G-) are both zero; true positives 
and negatives merely function (and function 
equally) to avoid losses in utility that result 
from false positives and negatives. Assump- 
tion 2 holds that true positives and negatives 
produce equal positive gains. Assumption 3 
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assumes that both gains in utility are posi- 
tive, but that the value of a true positive 
exceeds that of a true negative. These three 
assumptions can be expressed algebraically 
as follows: 

1. Gy =G 0 

2 0 

0 


Sets of assumptions other than those three 
(e.g., G., < G_ and G,, = G- < 0) can be 
eliminated as implausible. The decision prin- 
ciples presented by Rorer, Hoffman, and 
Hsieh (1966) can now be used to shed light 
on the extent of inequality that is implicitly 
being assumed between false positive and 
false negative errors under each of these 
three sets of assumptions when the SVIB is 
used in counseling as recommended by Strong 
(1955), Strong and Campbell (1966), and 
Campbell (1968, 1971). That is, we can re- 
cover some of the properties of the gain or 
utility matrix implicitly assumed by Strong 
(1955). 

Given a gain or utility matrix such as that 
shown in Table 3, Rorer, Hoffman, and 
Hsieh (1966) showed that 

FCC 
E 


_ where B = the true base rate for the occupa- 
tional group and B' = the base rate adjusted 
for the inequalities in the gain or utility 
matrix. B’ is the base rate needed to enter the 
table of optimal cutting scores, given a par- 
ticular B and utility or gain matrix. For 
convenience let 


Grr- Gy 
8 
TABLE 3 


GAINS MATRIX FOR AN ĪNDIVIDUAL OCCUPATIONAL 
SCALE ON THE STRONG VOCATIONAL 
INTEREST BLANK 


G. 


Predicted 
Actual 
Men-in- = 
3 general (-) | Occupation (+) 
=- Men-n-general (—) GIS G- 
Occupation (+) G4- Gag 


Note, G = gain, 


ScHMIDT 
Then, solving for G in Eq 1 gives 
1 
In computing all expectan ios on the 
basis of equal base rates, Sti as in effect 
assuming B! = .50. Using t lue for B. 
Equation 2 yields G = 4 wl - 20, G= 
19 when B = .05, and G when B is 
taken as .02. Then under A tion 1 that 
5 
G. 20 = a 7 
(ea 
G.o5 2 Ga) = 
and 
G4- 
gS cuss =a 
where 20, 05, and .02 refe base rates. 
Thus, if true positives and ne; s are con- 
sidered to each represent zero , then use 
of the SVIB physician's scale nes that 4 
false negative is 4, 19, and 49 s as harm- 
ful as a false positive error if true base 
rate is taken as .20, .05, and .G2, -«spectively: 4 
(The reader should recall that h G,- and 
G are negative numbers.) 
If we assume G. = G. > 0, thon when the 


base rate is .20, 
G — Gy Dor 
S 


Substituting G, for G__ and solving, we get 


6.02 = 


Since G_, is a negative number (a negative 
gain“), as G,, and G- increase a false 
negative comes to be considered more than 
four times worse than a false positive. Simi- 
larly, when the base rate is taken as .05, 


5 
G+ 5 

And, finally, when the base rate is O2, we get 
Gi- 40 8027 
8 


icrease, false negatives be- 
relative to false positives. 
er true positives and nega- 
ive and equal value, use of 
n’s scale as advocated by 
umes that false negatives 
times as serious as false 
base rate is taken as .20. 


As G,, and G 
come more cc 
Thus, if we cí 
tives to have 
the SVIB ph 
Strong (1955 
are more tha 
positives whe 


When the b: ue is taken as .05, use of 
the SVIB as ‘s that false negatives are 
more than 19 s worse than false positives. 
When the b: te of 02 is used—the most 
realistic of t! se base rates for most high 
school and e counselee populations 


the implicit 
counselee int: 


ption is that classifying a 
MIG group when he will in 


fact enter ti -cupation is more than 49 
times as seri s the opposite error. 
Finally, if sume (Assumption 3) that 
Ga G.. > hen, when the base rate is 
taken as .20 
G Cre = Ges =4. 
G 
Solving for G_‘G_., we get 
G- | Gi AC 
G y+ 
< Since G, is tive number, G / G will 


be less than 


aly when G,, >4 G. For 
all other pe 


ible values of G,,, a false 
hegative will considered more than four 
times as serious as a false positive. When the 
base rate is taken as 05, 


Here the value of a true positive (G,,) must 
be considered to be more than 19 times the 
value of a true negative (G.) for the cost of 
à false negative (G) to be considered less 
than 19 times as great as a false positive 
(G-). When the base rate is 02, 


Gre G — 49G 
~~ = 49 4 “th 8 
6— ie CR 


Th this, the most realistic of the base 
a the cost of a false negative (G,-) is 
1 5 ; ered greater than 49 times the cost of 

alse positive (G_,) unless the value of a 
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true positive (G.,) is considered to be equal 
to or greater than 49 times the value of a 
true negative (G__). 

On logical and rational grounds, both As- 
sumption 3 and a base rate of .02 appear the 
most realistic for most situations in which 
the SVIB is used. On the same grounds, it 
would appear unlikely that true positives are 
in fact considered more than 49 times as 
useful as true negatives. The conclusion indi- 
cated, then, is that use of the SVIB physi- 
cian’s scale as recommended by Strong (1955, 
p. 54) implicitly assumes that false negative 
errors are approximately 49 times as harmful 
as false positive errors.“ 


Accuracy GIVEN A “REASONABLE” GAINS 
MATRIX 


Strong’s (1955) assumption that the ad- 
justed base rate (B’) is .50 leads backward 
to implicit gains matrices with G = 4, 19, and 
49 when the true base rate (B) is taken as 
.20, .05, and .02, respectively. Ordinarily, 
of course, the process is reversed: The four 
values in the gains matrix are specified, and 
these values, along with B, are entered into 
Equation 1 to solve for B’. B is then used to 
enter the Rorer, Hoffman, and Hsieh (1964) 
tables to find the optimum cutting score and 
the adjusted total error rate. An important 
question then, is whether, given gains matrices 
containing “reasonable” values, use of the 
SVIB leads to an adjusted total error rate 
lower than B’, the error rate obtaining when 
prediction is on the basis of B’ alone. The 


6Some counselees may expect the SVIB to indi- 
cate not the specific occupation that would be opti- 
mal for them but rather a cluster of family of occu- 
pations, most of which would in practice be suitable 
from the viewpoint of interests. When the SVIB is 
used this way, the appropriate unit of analysis 
would be the 11 occupational families on the SVIB. 
With this kind of analysis, the base-rate problem 
would be reduced (few families would have base 
rates as low as .02), and the number of decision 
errors, especially false positives, would be reduced. 
While this may be a realistic model for the use of 
the SVIB for many counselees, especially in view of 
the high occupational scale intercorrelations within 
most occupational families, it does not seem to be 
that of Strong (1955). Strong’s focus is on the indi- 
vidual occupations. Nevertheless, a subsequent study 
is planned using the occupational families as the 
unit of analysis. 


Y 
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TABLE 4 


A REASONABLE” Gaiss MATRIX FOR INDIVIDUAL 
Sreowe VOCATIONAL INTEREST BLANK SCALES 


= = = = 


Predicted 


<— Men-in | 
general (—) Eeyan (+) 
ri 
Men.in-general (—) +1 —1 
Occupation (+) 10 +10 


gains matrix shown in Table 4 might be rea- 
sonable for many counselees. Table 4 as- 
sumes that correct identification of the indi- 
vidual’s future occupation (Cell ++) is 10 
times as useful as the correct identification 
of the occupation as inappropriate for the 
individual (Cell ——). Similarly, incorrect 
rejection of the occupation as inappropriate 
(Cell +—) is 10 times as harmful as incor- 
rectly labeling the occupation as appropriate 
(Cell —+).* In an individual decision-mak- 
ing process of this sort, actual gains matrices 
would, of course, vary from counselee to coun- 
selee, depending on individual differences in 
value systems (Cronbach & Gleser, 1965, pp. 
7-9). For other counselees, a “reasonable” 
gains matrix might be produced by substitut- 
ing +20 for the +10 in cell G,, of Table 4 
and a —20 for the —10 in cell G., Such 
counselees would consider false negatives 
more costly and true positives of greater value 
than individuals characterized by the gains 
matrix in Table 4. This we refer to as the 
second or alternative “reasonable” gains 

For both these “reasonable” gains matrices, 
Table 5 shows for each of three base rates: 
(a) B’ (from Equation 2), (b) total adjusted 
error rate, and (c) percent reduction in error 

using (a) optimal cutting scores (from Equa- 
* These statements, and the analyses of the previ- 
ous section, assume measurement of the utilities on a 
ratio scale. However, as pointed out later, empirical 
estimation of the utility of the SVIB for a given 
individual requires measurement of the four utilities 
in his gains matrix only on an interval scale. As 
can be seen in Equation 1, this follows from the 


fact that it is the ratio of differences rather than 
the direct ratio of utilities that is used in computing 


2 Both total adjusted error and percent reduction 
in error are, of course, averages that would be ex- 
‘pected over a large number of individuals with iden- 


* 
x 
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tion 1 of Rorer, Hoffman ch, 1966) 


and (5) Strong’s (1955, | suggested 
cutting score of 40 (B+) the base 
rate is taken as .20, use of t! IB reduces 
total adjusted error rate be evels that 
would result from predictior B“ only, 
but these gains are consider: rger when 
optimum cutting scores, rath a Strong's 
suggested score, are employe difference 
is especially large for the a! tive gains 
matrix. In this instance the r n in total 
error that would result using s cutting 
score is so small (.5%) as t negligible. 

When the base rate is taken „ the cut- 
ting score advocated by Stron s to a sig- 
nificant reduction in total er: r the first 
gains matrix and is only marg less eff- 
cient than the optimal cutting For those 
counselees whose values cor d to the 
alternative gains matrix, S cutting 
score is essentially identical optimal 
cutting score at this base rate. ar this set 
of circumstances, procedure mmended 
by Strong (1955, p. 54) for se of the 


SVIB are ideal. 
The optimal cutting score, ` 
works out to 51 for the first gai: 


B = 02, 
vatrix and 


produces a total adjusted error of 13.2% 
(versus the 17% expected wh: predicting 
using adjusted base rate alone). is consti- 
tutes a 22.4% reduction in error. However, 
when the cutting score of 40 * ommended 
by Strong is used instead of the optimum 
cutting score of 51, total adjusted error rate 
is 21.0%—a greater error rate than that 
which obtains when predictions are made 


tical gains matrices. The model used in this study— 
which is general statistical decision theory with dif- 
ferential utilities—was developed for situations u 
which many decisions are made under similar Of 
cumstances. In the case of individual decisions, each 
individual’s gains matrix is potentially unique 
(though it is likely in practice that some in 5 
viduals will have essentially identical matrices): 
When applied as above, the model implicitly * 
sumes, for each gains matrix, a hypothetical pop 
lation of individuals all with the same gains mattis 
it then gives the average error that would 1511 
over this population. For any individual with th 
gains matrix, this is the best estimate of the value 
of the SVIB for him. This estimate must be base 
on a model which assumes a number of similar a 
sions; each individual's decision is a unique 113 
which never recurs and thus cannot be modeled i 
is sense. 
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TABLE 5 
AL ADJUSTED ERROR Rate, AND Pxacext Repucrion ox Error rox Two 
“REASONABLE” Garxs Martes at Turce Base Rares (B) 
Optimum cutting score Strong's (1955) cutting score 
$ Total | æ A x Total | 
‘ te adjusted | reduction Cutting adjusted | Je reduction 
error rate f on or | score error rate Í in arer, 
“Reasonable” gains matrix in Table 4 (G = 10) 
— — — — 7 — 
34.4 150 483 40 = aot 39.0 
44.0 18.6 45.3 40 20.2 | 40.5 
51.0 | 13.2 22.4 40 21.0 | -B5 
1 
Alternative “reasonable” gains matrix (G = 20) 
j] m: 1 g; 7 J; = 
| “30.8. e 38.8 40 16.9 5 
40.4 į 18.9 614 40 18.9 61.4 
45.7 | 17.7 39.0 40 20.3 30.0 
Aae B’ > 50, error rate for base-rate prediction is 1 ~ B’, ers r in 5 
the adjusted base rate alone. On the CONCLUSION 


other hand, when the alternative gains matrix 
$ assumed, use of Strong’s cutting score leads 
0 a useful reduction in total decision error, 
gh not as great a reduction as is pro- 
duced by the optimal cutting score. Of the 
base rates examined in the present con- 
02 is probably the most realistic for 
of the high school and college counselee 
ations with which the SVIB is used. 
ty of this instrument when used with 
g’s suggested cutting score in these pop- 
will be negative for those individuals 
subjective gains matrix corresponds to 
‘Matrix in Table 4. As the counselee’s 
5 Matrix moves away from the Table 4 
and toward our alternative gains 
Ix, the positive utility of the SVIB for 
increases. 
In conclusion, when the utility of the 
B is calculated using more realistic base 
fates, the instrument still shows a positive 
Baur in many situations, given a “reason- 
gains matrix. Utilities are highest, 
ver, when optimal cutting scores rather 
those recommended by Strong are used, 
id under certain, possibly common, combi- 
tions of base rate and gains matrix, use of 
5 5 cutting score can lead to greater 
or rates than are produced by prediction 
“tom base rates alone. 


5 


In summary, the SVIB, if carefully used, 
probably has a significant positive utility for 
most individuals, despite the limitations on 
predictive efficiency imposed by low occupa- 
tional base rates. This utility, however, can 
be cancelled or even rendered negative under 
some circumstances if nonoptimal cutting 
scores—including those recommended by 
Strong—are employed. Whether or not this 
latter effect occurs depends directly on the 
relative negative values placed on false posi- 
tives and negatives and the relative positive 
values placed on true positives and nega- 
tives, that is, on the gains matrix of the indi- 
vidual. These conclusions point to a need for 
the measurement of actual counselee gains 
matrices. If gains matrices of individual 
counselees could be assessed, optimal cutting 
scores and expected utility of the SVIB could 
be assessed for each individual. 


o Measurement problems in the present context 
are not as complex as they might seem at first 
glance. The four utilities in an individual’s gains 
matrix need to be measured on only an interval 
scale. A ratio scale, that is, a scale with a rational 
zero point, is not necessary (see footnote 6). Stan- 
dard psychometric procedures are generally consid- 
ered to give adequate approximations to interval 
scales (Nunnally, 1967, chap. į): 
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of consumer dissatisfaction can 
uses in the marketplace, such 
malfunctioning of products or 
of illegitimate pricing. How- 
today’s discontent with mat- 
ption may possibly be viewed 
n frustration in life roles other 
sumer role. Support for this 
mes from Hulin (1969), whose 
‘> satisfaction and its relatedness 
“ving conditions in the community 
drawn attention to the possibility that 
faction and/or dissatisfaction in one life 
le may be transferred into other life roles. 
application of this principle to the cur- 
nt problem of consumer dissatisfaction, 
However, is notably absent in the literature. 
et, in view of what is known about the 
Various phenomena of adjustment to frus- 
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pplication of Alderfer's existence, relatedness, and growth (ERG. 
y investigated whether frustrated relatedness needs resulted in addi- 

consumer existence needs which, in turn, produced consumer dissatis- 
ion. Studies of 306 randomly chosen subjects indicated that a decrease of 
:mers’ ratings of satisfaction in thelr relations with “significant others” 
with people at the place of employment was correlated with an increasing 
„ for more items of material value in the consumer role. Some correla- 
s reached higher statistical significance when respondents of the lowest 
sme group and those of both the oldest and the youngest age categorics 
« eliminated from the sample. Higher incomes correlated positively with 
tence need satisfaction. The results supported Alderfer's theory. 


trations, particularly projection and scape- 
goating, it is quite possible that consumer 
satisfaction and/or dissatisfaction originates < 
from satisfaction or frustration experienced 
in various life roles, not merely in the con- 
sumer role. In fact, personnel working in 
consumer complaint centers could perhaps 
testify to the face validity of this proposition. 
On a theoretical basis, such a cause/efi 
relationship for consumer dissatisfaction 
also be constructed from the concepts 
Alderfer's (1969) E.R.G. theory; this cons 
ceptual framework classifies human needs into 
three categories: (a) existence, (b) related- 
ness, and (c) growth. 3 
Existence needs include all goods and ser- 
vices offered in the marketplace. In accord- 
ance with the E.R.G. theory, existence needs, 
the lowest of the need categories, are ordi- 
narily pursued only to the point where the 
individual’s physiological and psychological 
survival is assured. In our North American 
setting, this state can perhaps be reached by 
a majority of consumers with relatively little 
effort. Simultaneously with the pursuance of 
existence needs and predominantly after the 
existence needs are met, the individual works 
at the satisfaction of the next higher level of 
needs: relatedness needs. 4 
Alderfer defines relatedness needs as needs 
involving relationships with significant other 
people. People are assumed to satisfy related- 
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ness needs by mutually sharing their thoughts 
and feelings. If the social environment is 
tier, the relatedness needs will soon be 

relatively satisfied, and the individual can de- 
vote his time and effort to the pursuance of 
growth needs. However, if the social climate 


is unfavorable, the pursuance of relatedness 
needs may soon lead to a state of frustration. 
As a result of his being unable to satisfy his 
relatedness needs, the individual may then 
regress to the creation and pursuance of addi- 
tional existence needs in the form of expanded 
consumer wants. (The E. R. G. theory’s Propo- 
sition 2: The more relatedness needs are 
frustrated, the more existence needs will be 
desired.) Since most consumers’ income is 
inevitably limited, dissatisfaction is likely to 
arise soon out of their inability to satisfy all 


d the newly created existence needs. 


* 


The general possibility of using Proposi- 
tion 2 of the E. R. G. theory to explain some 
of the problems of consumer dissatisfaction 
5 has already been advanced elsewhere (Scherf, 
: 1974). The purpose of the present study is to 

test empirically several hypotheses which can 
be deducted from such a frame of reference. 

E following hypothesis is derived from the 

_ E.R.G. theory’s Proposition 2: 

e 


Hypothesis 1. The less consumers’ related- 
ness needs are satisfied, the more existence 
needs in the form of income and consumer 
goods they desire. 


a 


Low levels of relatedness satisfaction would 
eventually be expected to cause low levels of 
existence satisfaction, due to the consumer’s 
inability to satisfy the new existence needs. 
Such a proposition is not readily assessed in 
an unbiased way by means of the present 
study. This project could, however, be ex- 
tended to explore possible causal relationships. 
On the other hand, it is very likely that an 
È income differentiation will exist with regard 
to the extent to which consumers can satisfy 
ditionally created existence needs: It should 
easier to meet them with higher incomes 
an with lower incomes (although the actual 
level of income does not necessarily affect 
consumers“ desire for more existence needs). 
The following hypothesis is therefore ad- 
vanced: 
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Hypothesis 2. As the con 
to meet additional existence 
due to higher actual income 
satisfaction with existence n 
accordingly. 


Finally, it is proposed to ve 
cability of Proposition 1 of th: 
to man in the consumer role. 
of the E.R.G, theory states th 
istence needs are satisfied, the 
be desired. The hypothesis“ 
context is formulated as follow 


Hypothesis 3. The less consur 
needs in the form of income 
goods are satisfied, the more the: 


sition 1 
less ex- 
they will 
present 


existence 
onsumer 
desired. 


This application of Proposition 
ered especially important in vii 
that the approach used in the 
for the assessment of existence n 
as a consumer differs significant 
used previously for studies of n 
ganizational setting. 

In view of the hierarchically d 
tions occupied by existence an, 
needs, it is necessary, furthermore 
the possibility that a person’s ac: 
not only governs his satisfaction 
ence needs as postulated in Hypo: 
also affects the fundamental 
between existence and relatedness ie 
pecially for the lowest income group it is 
possible that the deprivation of essential ex- 
istence needs is so intense that the related 
ness needs are hardly yet considered. This 
problem is examined by selectively excluding 
the poorest segment of the sample in 4 
separate analysis. Similarly, it is consider 
necessary to examine whether age factors 
might also place limits on the applicability 
of the theory. 


METHOD 
Subjects and Procedure 


The subjects, approximately 306 consumers, Wel 
drawn from a small city and one of its neighborini 
towns in the province of Ontario. Respondents for 
the town population were selected by means of a 
random process using the name lists of the Region 
Assessment Office. They were contacted personally g 
the research assistant, who provided them with 
questionnaire, and asked that it be returned b. 
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TABLE 1 


M xos or Consumers’ Desire ron More Existence Nexps (D-E) at Varvio 


LEVELS OF SATISPACTION with RELATEDNESS NEEDS (S-R) 


St 
1 | eae E. 
—⁊ ͤ ee 
1, Total san 
, 89 154 45 
J 5.64 5.97 5.58 
S 1.70 1.70 1.61 
1 
2 Age group only 
r 78 140 40 
3 5.64 6.04 5.55 
S 1.75 1,65 1.59 
J. Age grou; ) only | 
N 62 120 31 
5.55 6.21 5.61 
1.51 1.60 1.65 
4. Income g ove 
$5,000 on 
n 59 100 30 
J 5.58 5.99 5.60 
1.70 1.74 1.56 
5. Total sam 
n 27 116 | 64 
* 5.22 5.69 5.84 
` 1.62 1.69 1.38 


S-R Rating 


— = = = = — = 
Analysis 
| Fest | P 
> eee | | 
„57 8.00 | 7.50 n | OS 
59 | 0 | 1.50 | | 
| S 
i 
d 
i 


7 

6. 

1 

5 2 1 

6.50 8.00 9.00 | p< 05 05 
1.30 0 0 

| 

4 P, 1 

6.25 8.00 900 |p<.01 Al 
1,09 0 0 


5 2 2 | 

6.20 8.00 7.50 | p<.0l| 08 

1.42 0 1.50 
37 11 3 | | 2 
6.05 7.00 733 | p<.05| Us 
1,86 1.81 1.25 ] 


Reeds with people at the place of employment. 
mail. A total of 59 consumers completed the ques- 


ae satisfactorily (61% of the questionnaires 
a uted). Respondents for the city population 
kate selected by randomly identifying small dis- 
15 within which every family was then contacted 
y the research assistant. The assistant handed the 
* to the respondent and asked that it be 
m y for personal collection by a specific time the 
lowing day. A total of 247 consumers in this 
se answered the questionnaire satisfactorily 
15 of the questionnaires made available). Homo- 
— 1 tests (F tests) were carried out with re- 
ae 0 the response patterns of both the existence 
the ee and the relatedness needs ratings of 
<i o groups before they were combined into a 

ingle sample. 
ted consumer was given assurance of complete 
ie pu both verbally and in writing on a cover 
ate = ich accompanied the questionnaire. The 
8 ent ormation regarding the project given to 
85 jects was that its purpose was to assist the 
5 of Consumer Studies “in finding possible 
ih tod 9 — many of the problems which people face 
ay’s markets and in our society in general.” 


relate satisfaction with relatedness needs with significant others; Study S relates satisfaction with relatedness — | 


Measures 


Alderfer (1969) assessed man’s existence needs in 
the role of producer (employee) primarily through 
questions which related to the strength of desire 
for, and satisfaction with, income and fringe bene- — 
fits. Man in his role of consumer, however, has 
principally two alternatives for expressing his ex- 
istence needs: He may perceive his level of con- 
sumption to depend on his income and/or on his 
fate in the marketplace. The desire for existence 
(D-E) need was therefore measured by giving equal 
weight to the rating of the respondent’s desire for 
more income from employment sources and to the 
rating of the desire for “more” in terms of wants 
related to the marketplace: D-E = Dincome + 
Daarxet. While the desire for more income could be 
assessed in one direct question, the desire for more 
consumption opportunities from the marketplace 
was measured as the average response to five ques- 
tions: The desire for (a) better prices, (b) better 
quality, (c) better information, (d) the ability to 
replace outstyled goods more frequently, and (e) 
the ability to replace worn-out goods more fre- 
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that those included in signiñcant others are a part 
of a relatively durable system of interaction for a 
person in his consumer role, while the contact with 
_ gales personnel, in contrast, is rather transitory by 
7 and relatively brief in duration, and hence is 
unlikely to have a pronounced influence on a con- 
^s desire for more“ or his satisfaction with 
It was felt that the salesman is 
more likely to influence a consumer's satisfaction 
‘ C 
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f 
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were stratified according to | ating of 
satisfaction with relatedness in two 
different contexts, satisfaction related- 
ness needs with significant ot (Studies 
1-4) and satisfaction with re! ess needs 
with people at the place o‘ loyment 
(Study 5), and the mean ra of each 
group’s desire for more exi (D-E) 
needs were calculated. In evt udy the 
D-E means increased quite sys ically as 
the satisfaction with relatedne eds de- 
creased (increasing satisfaction related- 
ness ratings), in apparent agree: with the 
hypothesis. Particularly notew is the 
fact that the last three groups, es 2, 3, 
and 4, that is, those groups re ng “dis 
agreement” responses on the t scale, 
contained very few respondents invaria- 
bly showed higher means for dc- or more 
existence needs than did thos: he first 
three groups, that is, those ing an 
“agreement” response on the | -t scale. 


h series 


However, the F test performed fo: 
sample 


of means showed that for the 


statistical significance at the el was 
obtained only in Study 5 for satis on with 


relatedness needs with people at place of 
employment. 

Studies 2-4 were carried out i: 
establish whether qualifications rez% ding age 
and income would affect the data sought for 
the support of Hypothesis 1. Accordingly, the 


order to 


A S-R Rating 


Graphical representation of the data of Studies 3 and 5. 


MER DISSATISFACTION AND INTERPERSONAL RELATIONSHIPS 
TABLE 2 < 
ds oF Consumers’ SATISFACTION wrin Existence Nexos (S-E) ov 
Vanyine Income Classes IN Sruny 6 
Income group | Analysis 
Measure — — 
$5,000 | $5,000-10,000 s10000-15,000 | Above $15,000 | F test | 5 
" | 109 4 2 | 
M 0 7.71 6.19 $38 |p < Ol 16 
SD 9 2,08 2.16 1.09 | | À 
means anal the satisfaction with re- appeared that the significant differences (p < 
latedness ne th significant others strati- .05) occurred almost exclusively when thè 
fication use tudy 1 was repeated after comparison involved one group whose satis- 
selectively uing those groups of re- faction with relatedness needs rating was de- 
spondents \ seoretically may be consid- rived from an “agree” response of the Likert 
ered as exc = to the assumed need hier- scale, and one whose satisfaction with re- 
archy. The Studies 2 and 3 indicated latedness needs rating was derived from a 
that the sta al significance expressed by “disagree” response of the Likert scale. 7 
the F test red the .05 level when re- Study 6 was carried out for the examina- 
Spondents | n the ages of 20 and 65 tion of Hypothesis 2. Respondents were 
only were c red, and it improved to the stratified according to their reported income 
Ol level wt be age groups included only group, and the mean response to their satis- 
those betw: ) and 50. A similar improve- faction with existence needs was calculated. 
ment was ned in Study 4 when the re- The data, recorded in Table 2, agreed with 
spondents o lowest income group were the hypothesis: The satisfaction with exist- 
eliminated fru: the sample. ence needs increased systematically with 
A closer ction of the data suggested higher income levels. The corresponding — 
that the as- ion between the two varia- graphical presentation of the data, shown in 
bles, satisfacion with relatedness needs and Figure 2, suggested the existence of a curvi- 


desire for exisience needs, may be of a curvi- 
linear rather than linear nature. A graphical 
illustration oi this trend is shown in Figure 
l 1 for two selected studies. In order to ob- 
tain an adequate measure of the strength of 
the total association between both variables, 
the corresponding correlation ratios, n° (Hays, 
1963) were calculated. The / values con- 
firmed the conclusions drawn from the 
means / E tests regarding the effect of age and 
income on the satisfaction with relatedness 
needs/desire for existence needs relationship. 

For those studies where the F test indicated 
that the differences among the means were 
Statistically significant, yet the total associa- 
tion revealed by the „? reached only rela- 
tively small values, it was of further interest 
to determine where the main differences would 
occur. Multiple ¢ comparisons were therefore 
carried out ior Studies 2-5 of Table 1° It 


Detailed results are available from the author 
"on request. 


these variables. 
in Study 7. 
to 


linear relationship between 
Hypothesis 3 was examined 
Respondents were stratified according 


their desire for existence needs rating, and 


the mean responses for satisfaction with ex- 


istence needs were calculated. The data, 
recorded in Table 3, supported the hypothe- — 


sis: The desire for existence needs increased — 


as the satisfaction with existence needs de- 
creased. A closer inspection of these data 


g 7 
ys 
is 
S 5000-10000 10900-15000 >15000 
Income 
Ficure 2. Graphical representation of the data of 
Study 6. 
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TABLE 3 


RESPONDENTS’ MEAN RATINGS OF SATISFACTION WITH EXISTENCE NEEDS (S-E) AT \ 


Dese For More Existence Neens (D-E) IN Srupy 7 


D-E Rating 
Measure 7 ——— Fh) 
2 3 4 5 6 7 | s 10 
n 4 21 4 
M 4.00 | 4.94 | 5.66 | 7.26 | 698 | 8.07 | 8.95 | 9.14 | 9.50 
SD 0 | 1,03 1.77 | 191 | 1.32 1.66 1.65 | 2.09 1.06 


Suggested that the trend towards curvilinear- 
ity may be less pronounced for the desire for 
existence needs/satisfaction with existence 
needs association than for the satisfaction 
with relatedness needs/desire for existence 
needs associations discussed previously. 


Discussion 


The finding that Proposition 2 of the 
E.R.G. theory is applicable to man in the 
consumer role, although with some limitations 
_ as to income and age, is perhaps the most 
important result of the study. The fact that 
_ the more relatedness needs are frustrated, the 

more existence needs will be desired may add 
an entirely new dimension of strategy to 

those concerned with making consumers more 
= satisfied in today's environment. It would 
appear that real improvement of satisfaction 
Could occur only if man's meeting of his re- 
latedness needs is facilitated along with the 


improvement of his economic welfare. 

In accordance with most need hierarchies, 
the most elementary and vital existence needs 
have to be reasonably satisfied before higher 
needs such as relatedness needs, take on 
major importance; hence it is not too surpris- 
ing to find that the interdependency of de- 
sires for more existence needs and interper- 
sonal satisfactions was more pronounced when 


persons of very low incomes were excluded 
from the sample. 


a The exclusion of the youngest and of the 
most senior segments of the sample also 
produced an increase of the satisfaction with 
3 relatedness needs/desire for existence needs 
i relationship. This finding might be tentatively 
A explained by assuming that to persons of a 


very young age, the opportunity to translate 


frustrated relatedness needs 
existence needs is not yet a 
of their still limited earning 
other end of the scale, pers 
senior segment of the sample : 
find themselves in the same 
cial position; alternatively, 
possible that senior persons, 
life experience, no longer cons: 
sive transfer of frustrated re 
into additional existence neco 
this group, future studies whic 
to assess the effect of frustrat 
needs on growth needs will 
terest). 

It is acknowledged that the 
measuring existence needs in 
role may still be incomplete. y 
ferent items, for example, wer: 
express the strength of desire ior 
deals“ in the marketplace, while a mu 
larger number would be needed if one W 
to exhaust all possible alternatives. 

The combination of measuring satisfa 
items on a 6-point Likert scale and des 
items on a 5-point scale has been adop 
without renewed qualification from the omgi 
nal work on the E.R.G. theory (Alde 
1969). As all the items were phrased so thá 
satisfaction was at the 1 end and the dest 
scales had high ratings on the 5 end, 
possibility exists that some responses : 
biased by an individual’s mere preference 1 
high numbers. ] 

Furthermore, the present work shares 
Alderfer’s work the limitation that both 
correlational studies and hence do not pet 
causal inference. 


more consumption opportunities. 

manuscript, University of Guelph, 1975, 

Scherf, G. W. H. Consumer education as a means of 
alleviating dissatisfaction. Journal of Consumer 
Affairs, 1974, 8, 61-75. 
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determinants of usage. 


The present study is part of a project aim- 
ing at investigating variables relevant for the 
use of seat belts in private automobiles. 
Knowledge of such variables should be of 
value in designing campaigns with the pur- 
pose of raising the level of seat belt usage by 
motorists. 

In a previous study it was shown that indi- 
i vidual variation was responsible for a larger 
Proportion of belt use behavior variance than 
was situational variation (Fhanér & Hane, 
_ 1973b), This finding led to a search for indi- 
_ vidual variables related to belt usage in the 
hope of finding variables which could be in- 
fluenced by campaign measures. 

As was pointed out in a literature survey 
(Fhanér & Hane, 1973a) the striking fact is 
chat belt users and nonusers alike express a 
very favorable attitude to seat belts, the belt 
users being only somewhat more favorable. 


* This study is part of a research Project con- 
cerning factors influencing the use of seat belts in 
private automobiles. The Project was carried out by 
the authors under the Swedish Road Safety Office 
Contract Inf. 32, and it was supported by a grant 
from the Swedish Renault, Inc. (AB Svenska 
Renault). The complete report, No. 25, Dnr 93- 
782100 u, is available at the Swedish Road Safety 
Office, Driver's License and Research Departments, 
_ Fack, S-171 20 Solna, Sweden. 
‘ 2 The authors are equally responsible for the in- 
vestigation and its presentation. 
Requests for reprints should be sent to Gunilla 
> Kristina Hards vag 9, 8-443 00, Lerum, 


SEAT BELTS: 
RELATIONS BETWEEN BELIEFS, ATTITUDE, AND 


GUNILLA FHANER® axo MONICA HANE 
University of Uppsala, Sweden 


A sample of 368 car owners answered a mailed questionnaire constru 
the basis of a model of attitudes toward seat belt use. A factor 
yiekled a belief pattern interpreted in terms of five factors, labeled “d 
fort," “worry,” “risk,” “effect,” and “inconvenience.” The model a; 
useful since an independent measure of attitude to seat belt use (A.) c 
predicted (r=.84) from a linear combination of these factor scores 
correlation between the attitude measure and reported use of seat belt 
-56, or about the same as that between the belief factor combinatio: 
reported use. The combination of the “discomfort” and “effect” factor 
near optimal predictions (r=.52). The results were replicated on a 
sample. A model of seat belt use was suggested in which conceptions 
discomfort of belt usage and effects of belts in an accident were regard 


practically 
item at a 


However, a serious limitati: 
all studies so far is that onl: 


time has been used as an in on of atti- 
tude, probably resulting in | ~ reliabilities 
and, hence, low correlations + usage. 
The popularity and extens ise of the 
attitude concept has led to l~ conceptual 


on Allport 
O different 


precision. As early as 1935 G 
(1967) considered more than 
definitions in his review of the xeneral ares 
of attitude theory and researc: (Allport, 
1967; see also McGuire, 1969). in the pres 
ent study the concept of attitude is regarded 
as a latent variable referring to the amount 
of affect for or against a psychological object 
[Thurstone, 1931].” Cognitive and action 
variables are seen as determinants or conse 
quents of attitude. j 

The attitude model adopted in this study 
is consistent with that proposed by Rosen- 
berg (1956, 1960) within a cognitive con- 
sistency theoretical framework, and arrived ât 
by Fishbein from a behavior theory pee 
view (Fishbein, 1967a). Rosenberg predict A 
that the affect attached to an attitude objec 
will be highly related to (a) “the perce? i 
instrumentality of the attitude object, tha 
is, the judged belief or probability that the 
attitude object would lead to, or block, 
attainment of “valued states,” and (b) E 
“value importance,“ that is, the amount 0f 
affect expected from these “valued states 
Similarly, Fishbein stated that an individual 
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y object is a function of 

Í his beliefs about the 
evaluative aspects of those 
a). This was expressed 


N 
r * Bia; 


=! 


toward object o, 

of belief i about o, that is, 
bility” or “improbability”’ 
ssociated with some other 


ive aspect of By, that is, the 
f X45 
r of beliefs about o. 


attitude regarding seat belt 
seen as a function of the alge- 
products obtained by multi- 
ngth of his beliefs associated 
by the evaluative aspects of 


literature search, formal and 
sions with motorists of many 
egories, cy ts and nonexperts, as well as 
Me prelin ay studies based on question- 
res and views at a gas station yielded 
Variety of beliefs related to seat belt usage. 
The injury preventive effect of seat belts 
ed to be a salient belief of most people. 
ever, concern about seat belts aggravat- 
injuries was also frequently expressed. 
instance, some people expressed the be- 
that a seat belt might be difficult to un- 
in the event that the car caught fire or 
ed in the water. 

feeling of constraint and discomfort 
to be rather widespread, and was a 
mon excuse for nonuse. For some people 
Mis feeling seemed to be a general and deep- 
aversion against being tied up, but for 
it appeared to concern the restricted 
of motion, the difficulty to reach, etc. On 
other hand, some people felt increased 
fort from wearing a seat belt, either by 
Physical support offered by a belt, es- 
y on curbed roads, or by the genera- 

n of a feeling of security. 


BELTS: RELATIONS BETWEEN Bexrers, Arrrrupe, axo Use 


mo 

Tt was also held that the use of seat belts 
could influence the risk of accidents, either 
making people feel too secure, leading to 
driving, or by making drivers 
leading 


careless 
anxious, to safer driving. 
The 


The image of a driver also seemed to be 
affected by belt use, To some people seat belt 
usage was a sign of being a good and careful 
driver and of being responsible and having 
good judgement, while to some others a seat 
belt user was an anxious and insecure driver. 

Using Fichbein's (1967a) model, the atti- 
tude to seat belt use (A.) for an individual 
can be estimated from a simple regression 
equation involving these beliefs related 10 
seat belt usage. 

Ay = Biti +. . Ba. 
We shall make the simplifying assumption 
that each evaluative aspect, a, has the same 
value for all individuals. The variation in As 


can thus be predicted from variation in each 
B, or belief strength. ` 

According to Fishbein (1967b) and Fish- 
bein and Raven (1967) the strength of a 
belief can be measured in probability terms, 
The strength of the belief concerning the rela- j 
tion between seat belt use and injuries can be È 
regarded as a conditional probability. First, 
there is the probability of an accident, and 
second, the probability of a seat belt reduc- 
ing or aggravating injuries, given an acci- 
dent. The strength of this belief could thus be 
measured by a two step process, whereby the 
perceived accident risk is first obtained and 
then combined with the belief about seat belt 
effects, given the accident. 

As a strategy for finding variables causally 
related to belt usage, it would be wise to 
start by investigating existing correlations 
between belt usage, attitude, and the beliefs — 
discussed previously. 4 

STUDY 1 


The primary aim of the present investiga- 
tion was the identification of attitudinal and 
cognitive variables related to the use of seat 
belts. A subordinate goal was the construc- 3 
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tion of an instrument, tapping important be- 
lieis, which could be used in further studies. 


I Method 
Measures 


Attitude to belt use (A.). To get a “direct” mea- 
sure of attitude to belt usage (A.) seven statements 
intended to tap the evaluative aspect of belt use 
were constructed, such as “Everyone ought to use 
a seat belt when driving a car.” The respondents 
indicated on a S-step verbal scale their degree of 
agreement with each statement: completely agree 
(1), mainly agree (2), partly agree (3), mainly 
disagree (4), and completely disagree (5). “No 
opinion” could also be checked. 

Beliejs. On the basis of the beliefs about seat 
belts and belt usage that seemed most salient, ac- 
cording to the previous discussion, 52 Likert-type 
items were constructed. The respondents indicated 
their degree of agreement by using the aforemen- 
tioned 5-step verbal scale. The content of the items 
b given in Table 1. 

Reported seat belt use. To minimize possible influ- 
ence of behavior reports on the answers to the 
belief and attitude items information on the re- 
spondent’s actual seat belt usage was obtained two 
weeks after the questionnaire had been returned. 
The degree of influence of the questionnaire re- 
Sponses on the behavior reports was assumed to be 
less than in the reversed direction (see Wicker, 
1969). The respondents were contacted by telephone 
and were asked to indicate their habitual seat belt 
usage in city traffic and on the highway. The re- 
Sponses were coded in four categories: never or 
almost never (1), sometimes or seldom (2), often 
or generally (3), and always or almost always (4). 
According to the one dimensional model presented 
in a previous study (Fhanér & Hane, 1973b), mea- 
Sures of seat belt usage in city and highway driv- 
ing can be summated to an index of overall belt 
usage. Level of belt usage is thus given in terms of 
@ 7-step scale, a value of 2 indicating nonuse and of 
_ 8 indicating consistent use in both highway and city 

driving. Those lacking seat belts in their cars (10 

respondents) were coded as nonusers. The reliability 
of self-reports of belt usage in terms of homo- 
_ geneity has been shown to approach unity (Fhanér 
4 & Hane, 1973b), and the validity of such reports 
has been demonstrated to be relatively good 
(Fhanér & Hane, 1972, 1973b). In the following, 
“seat belt use” or “belt usage” refers to the sum- 
mated. 7-step scale. 
Background variables. Beside the belief and atti- 
5 tude items the questionnaire also contained ques- 
_ tons concerning type and age of car, type of belt, 
age of drivers license, mileage driven each year, 

sex, level of formal education, and size of home 
commun ty. 


Subjects 


' D 


r 


* 


* 
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From the automobile registration files in Örebro 
5 county (län) covering 80,000 cars, a sample of 600 
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car owners was randomly drav 
sion of business firms, official 
sample consisted of 545 private 
ers. Örebro county is located ir 
parts of Sweden and is often 
“Miniature Sweden,” since its dem: 
sition quite closely resembles that 


Procedure 


The questionnaire was mailed 
on December 8, 1970. Enclosed w 
letter pointing out the scientific 
study and stressing the inaccessi 
vidual’s answers to people outside 
increase the motivation to respon 
arranged among those who return 
questionnaires. Complete informati 
for 368 subjects (68%). 

One third of the nonrespondent 
by telephone. Thirty-two were re 
refused to answer any questions, 
sponded showed roughly the same 
reported belt usage as did the re 
lation statistics for car owners ir 
were available for the age and se’ 
tistiska Centralbryän, 1970). In t 
bles the distribution of the pres 
almost identical to that of the | 
even though subject loss probabi, 
random, no reason was found fo: 
biased resuits due to subject loss. 


Results 


The index of seat belt use a: 
ground variables were cross-tai 
was the only variable that was 
related to belt usage, women 
higher level of usage than men 
df =2, p< .001). 


Attitude to Seat Belt Use (As) 


The intercorrelations between the sev! 
items ranged from .25 to .62. The scores 
these items were added for each individual 
get a measure of his attitude (Ae). The smi 
number of respondents (mean 4.1%) N 
had checked the “no opinion” alternative) 
included in the middle response category i 
“partly agree.” The reliability of the attitil 
measure in terms of homogeneity was © 
mated with the Kuder-Richardson (KR) 4 
formula and was found to be .84. 


Beliefs about Seat Belt Usage 


Factor analysis. To investigate whel? 
the hypothesized belief structure with resp 
to seat belts and the use of them was 


BELTS: RELATIONS BETWEEN Bevrers, Atrrrupe, AND Use 


TABLE 1 
Rorarep Facrok Matrix 
a = 
) and content 
u II | v 
N2... í so} — — — 
174 ... fc aining 67 — — a 2 
19 ... n irritated ... worse driver A — — — —,20 
155 ... fe sted 56 — — æ 
HE.. sit mfortably -48 — 30 — = * 
. . ci idents AS — — —.2 — * 
Bi... a »vercautiousness H ~ 24 — — 48 
28... al more nervous 42 — < — -22 4 
N . vi c road ... difficult 39 5 — =. 32 AS 
10 ... dif » reach 38 — — — -37 37 
210 ... fai of security 35 5 = -H — 37 
164... in irivers use 32 = 20 —.29 — H 
n5. of ry ... accident * 81 — — sze — 
20 Before ... often worry p A — sas — 5 
Th... co p worrying = 59 — = = 37 
„ al orry ... family is driving — 58 — — —.22 0 
M8... x never think of risk = —49 27 = 1 33 
128 Durin: awful to drive — 47 — — — 24 
156 ... ne: ok of being injured — — 41 27 — — 24 
130 ... an can happen ... ? — — 4⁰ “æ — —.23 22 
173... ne ak of accidents mb as ed eel Nea * — 
20 eas esist ... by the arms — = 47 = = 39 
171 . pr fes to resist ... by his arms — = e = 28 
AI su the body —40 — At z= Ta 4 
160 . fee f security ... drives safely — 34 — 43 >, ~~ 47 
a People c> gerate the risks — = — p. p 1227 
oe dents avoided ... by judgment = = ~ 5 * 7 
142 ust as safe as erana = — 27 34 — 25 2⁰ 
M3 .. oup in accidents = — 34 21 — 10 
135 |.. driving carefully ,.. avoids accidents = ra -33 yt a 12 
267 ... a group of accident prone drivers — ae 33 — — 18 
258 s of risk ... drive carefully — 29 31 — = 22 
26 . ape injury by getting down 20 — 31 — — 23 
261 It has been proven ... escape injuries — — 60 — 55 
by preventing being thrust out = 2 = 0 * 38 
B. prevent casualties = aes <a a 2 85 
effective st brain dam P 7 > p à 
r ae 357375 
136 It has not been proven . prevent injuries = = 28 ~Al ae 27 
= ay lighter injuries prevented — =r 30 —.39 — 43 
better be thrown clear out 21 = 2¹ —.3⁴ = 30 
10 +++ Satisfactory protection a Eri 27 * a 5 
27 . trouble putting on — — — — —.83 71 
163 . rather simple to put on = ae ye 1 a w 
x trouble to adjust se cay a p7 = 72 155 
150 +++ gets caught in the door 26 FA = =a $ 
ie <+- wrinkle the clothes 27 iy Ej N F 39 
difficult to undone . . if one fire 29 = = = —.36 43 
ie One out of three . injured on 22 — Tä — v 
50 Even .. experienced drivers ... involved Ke on Pi ES 18 
1 won't drive an inch... cautious 29 oo zj 25 2 23 
Bays 15 . Poin to wear ory ay oe q! 2 EDE 26 
e 5 e 


ide collisions . no help 
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adequate description of that of our respond- 
ents, the product-moment correlations among 
the items attempting to tap such beliefs were 
factor analyzed. A principal-factors solution 
with squared multiple correlations as com- 
munality estimates was used, Oblique rota- 
tion was performed with the rotation cri- 
terion for simple loadings suggested by Jenn- 
rich and Sampson (1966). Before the rota- 
tion “Kaiser normalization” of the com- 
munalities was performed. Five factors 
emerged that had eigenvalues greater than 
1.0. These five factors accounted for about 
36% of the total variance, and the extraction 
of additional factors contributed very little to 
that proportion. Of the 54 variables 41 had 
only one loading above .30, and no variable 
had more than two such loadings. The ro- 
tated matrix and communalities are presented 
in Table 1. The items are arranged in ac- 
cordance with the factor pattern to facilitate 
interpretation of the factors. Loadings below 
an arbitrary limit of .20 have been omitted 
to increase the readability of the matrix. 

The intercorrelation between Factors I 
and V was —.42, between I and IV —.36, 
between I and III, as well as IV and V, just 
above .20. The remaining intercorrelations 
were close to zero. 

Interpretation oj the factors. The items “I 
feel uncomfortly confined wearing a seat belt” 
and “The seat belt feels constraining, I 

think” had the highest loadings in Factor I 
and low loadings in the other factors. The 

largest negative loading in the first factor was 
_ received by the item “One sits more com- 
_ fortable wearing a seat belt.“ Thus, Factor I 
_ seems to cover feelings of discomfort. 
The second factor had its highest loadings 
in items expressing worry, for example, “I 
often worry about getting involved in a car 
accident.” Large negative loadings in this 
factor were typical of items denying thoughts 
about accidents and injuries, exemplified by 

“I never think of the possibility of being 
seriously injured in a car accident.” An ap- 
_ propriate label for Factor II appears to be 
a “worry.” 
* The items with rather pure loadings in 
Factor III concerned the low risk of getting 
into accidents and of being injured without 
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wearing a seat belt. Hen or THT seems 
to represent feelings of ri 

The fourth factor can e e interpreted 
in terms of seat belt efie ess. “It bw 
been proven that the use o t belt is om 
of the best ways to avoi es in a oh 
lision” had the highest lo and the ne 
gation of that statement ha irge negative 
loading. Factor IV will acco: ly be labeled 
an “effect” factor. 

The interpretation of Factor V also seems 
rather straightforward. “It i lot of trouble 
putting on and taking off a se elt,“ and the 
obverse “Seat belts are rath imple to put 
on and take off” had th: est absolute 
values. Thus, inconvenience idling seems 
to be the common denomin Í Factor V. 

Three of the hypothesized s can easily 
be identified, namely the Ei actor (IV), 
the Discomfort factor (1) the Inom 
venience factor (V). The i constructed 
to cover the experience of ri re evidently 
not homogeneous but seen cover two 
dimensions not closely relat ssibly Fac- 
tor II, Worry, rather represe: personality 


trait such as anxiety, while © III, Rist 
appears to have a more cog 
The items concerning the i: 0 
user and the items on seat 
the accident probability did noi 
rate factors, but had moderai 
several factors. 2 
Individual factor scores. The most simple 
way of estimating a factor score for an! 
vidual is to select the variables loaded only 
in that factor, and to sum the scores of thos 
variables. The variables are given 
weights thus enabling the estimation of 4 
factor score in new studies with different 
viduals without the computation of a nes 
factor analysis (Sjöberg, 1972). Four market 
variables were chosen for each factor. ; 
agreement between the matrices of interco! 
relations between factors and factor scoresy 
respectively, was quite close. The reliabilit® 
of the factor scores in terms of homogene! 
were computed using the K-R20 1 
and ranged from .86 for Factor I, Discomtes 
to .55 for Factor III, Risk. iy 
Since it could be suspected that not on 
the reliabilities of the factors but also 


Sample 1 
(m = 184) 


82 
$1 
83 
+1 8 
+I 83 
E+I 84 


Fat belt use, 

ied among individuals with 

belt usage, separate factor 
factor score variables were 
he constant users (n = 62) 
90), respectively. Both 
pattern that was similar to 


different le 


analyses oí 
performed 

and nonus 
analyses yit 


the origina! ‘n. Inspection of the com- 
munalities : ed no systematic differences 
between the ‘xtreme groups. It was, thus, 
concluded th: either the pattern of beliefs, 


Ror the relia! 


dependent uj 


ties of the factor scores were 
n level of belt usage. 


Correlation between Attitude to Seat Belt 
Use, A., and Reported Use 


A cross-tabulation between the attitude 
measure, A,, and the index of belt usage 
mdicated that the regression was approxi- 
mately linear. Thus, the product-moment 
Correlation coefficient of .56 would be a fair 
ee of the relation. Due to imperfect 
A ‘abilities of the variables it is of course an 

nderestimation of the “true relationship.” 
results are interesting in light of the 

oad Studies showing a lack of relationship 
1 seat belt use and attitudes to belts 
lt usage (Fhanér & Hane, 1973a). 


Predict; 
rediction of A, from Beliefs about Seat 
Usage 
— investigate how well the attitude to belt 
€ could be predicted from beliefs in terms 


Berts: RELATIONS wee es, Aru, axo Use 


TABLE 2 
Bites CORRELATED wit A, AND wire Keromtnp Use 


a7 
z ' — — 
| Reported we 
Cross- valida- l — 
tion, Sample 2 | Samoei d Sample 2 
(a = 184) ! } (= = 184) 
2 
. 16 A 
3l -0 as 
SS 4 28 
—4¹ -4 -3 
a ss st 
2 st 49 
* S E 
78 00 48 
27 9 2 
75 ói w 


of factor scores, multiple correlations were 
calculated for one half of the sample (m, = 
184). The regression equations thus obtained 
were then used for the other half (m = 184), 
and the product-moment correlations between 
the predicted and obtained values were com- 
puted. The predictions were made on the basis 
of single factors as well as on those factor 4 
combinations that seemed most promising. 
The results can be seen in Table 2. 

The most striking result presented in Table 
2 was the high correlation between Factor I, 
Discomfort, and attitude to seat belt use, Ap. 
Combining Factor I with one or more fac- 
tors barely improved on the predictions for 
Sample 1, although it tended to reduce the 
i in Sample 2 somewhat. Maximal 
values of the multiple correlation coefficient 
were obtained with a linear combination of 
all five factors, but summing the scores of 
Factor I, Discomfort, and Factor IV, Effect, 
gave a very good approximation. The coef- 
cients for the total sample were .82 and .80, 


respectively. 
Prediction of Reported Belt Use from 
Beliefs about Usage 
As in the prediction of seat belt attitude, 
A,, correlations between beliefs and the belt 


use index were calculated on Sample 1 and 
compared to those achieved for Sample 2, 


S 


; 
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using the regression equations obtained for 
Sample 1. The results are presented in Table 
2. 

Factor I, Discomfort, was the best single 
predictor of belt usage. Factor V, Inconveni- 
ence, rated second, but the addition of Factor 
V to Factor I did not improve on the predic- 
tion due to a high intercorrelation between 
those two factors. Factor II, Worry, and 
Factor III, Risk, had both very weak rela- 
tions to usage, while Factor IV, Effect, and 
usage had a somewhat higher correlation. 

The regression equation that yielded the 
best prediction after shrinkage included Fac- 
tors I, II, IV, and V. To obtain optimal esti- 
mates of the regression coefficients they were 
recalculated for the total sample (n = 368) 
yielding a multiple correlation of .56, when 
the four factors were combined, and .52 for 
the Discomfort-Effect combination. On the 
whole, reported belt usage could be predicted 
equally well from beliefs in terms of factor 
scores as from the attitude index, A,. This 
should be expected on the basis of the high 
relation between beliefs and A,. 

As was stated in the introduction the be- 
lief, BI, that is, the instrumentality of seat 
belts with respect to injuries, was conceptual- 
ized in terms of a conditional probability. The 
probability of a car accident should, hence, be 
combined with the probability of a seat belt 
reducing or aggravating injuries, given an 
accident. This line of thinking implied that a 
conjunctive model for weighting the accident 
probability and the Effect would give better 
predictions than a linear model. Predictions 
based on a conjunctive model were thus 
computed and were compared to those based 
on a linear model. Since it was not obvious 
whether the Worry or Risk factor best re- 
flected the accident probability, Effect was 
combined with both factors. However, using 


the conjunctive model did not improve on 
the predictions, 


Stupy 2 


The aims of the present study were two- 
fold: first, to determine the generality of the 
seat belt belief pattern obtained in Study 1 
across individuals; second, to check the mea- 
surement characteristics of the factor instru- 
ment, particularly its predictive validity in 
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terms of self-reports of bi 
behavior observations. 


as well as 


Method 
Measures 
Beliefs. A questionnaire was icted on the 
basis of the results in Study 1 isisted of the 
four marker variables on eact e Factors I, 
Discomfort, II, Worry, IV, Efi id V, Incom 
venience that had yielded the be liction of re- 


ported belt usage. The responsi 
1 to 5 as in Study 1. 

Reported seat belt use. Th 
questionnaire contained 17 questi 
in various situations (see Fhané 


> coded from 


part of the 
on belt usage 
Hane, 1973b). 


Included, also, were the items frequency ol 
usage in city traffic and in his traffic, which 
were used to form the seat belt us- dex in Study 1. 
As in Study 1 a 7-step belt u ‘ex was com 
structed by summation of the = on the items 
concerning frequency of usage in d in highway 


driving. 


Observed seat belt use. Ths rations were 


carried out in the rush hours or srkday mom- 
ings chosen at random from No 17, 1970 to 
February 3, 1971. The observer hidden in & 
workers hut at a road intersect here around 
300 cars passed every morning. ! 2 plate num- 
bers and seat belt usage of the s were read 
into a tape recorder. Thus, for car the fre- 
quency of passage and of drive: t belt usage 
could be obtained. A reliability che, <f the observ: 
tions showed that two independ observers Te 
ported the same number of belt u hut that only 
73% coincided. The reliability of = observations 
was thus not perfect. Especially registrations 
of seat belt use in a series of n for an indi- 


vidual should be regarded with caution (see 
Fhanér & Hane, 1973b). 
Subjects 

The sample consisted of 105 drivers. Of these, 15 


were defined as seat belt users as they had a 
observed wearing a seat belt at least once out ofa 
minimum of five observations. The consistent z 
nonusers were randomly drawn among those 1115 
had been observed at least eight mornings. 3 
mean number of available observations exceede $ 
for the “users” and 10 for those defined as nor: 
users.” The average proportion of seat belt 10 
instances for the users was .61. Thus, there Was 
basis for assuming that the present sample was oF 
resentative of drivers in general. However, it a 
out to be highly similar to the sample in Stu 7 a 
with respect to background variables. The T 
were identified through the registration files. 


3 As an exception Item 141, with the fifth high! 
loading in Factor I, was included instead 
159 ranking third. The choice was based on ii her 
liminary analysis which had resulted in a hig 
loading for Item 141 than for Item 159. 


i em, cum, mmm, 
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TABLE 3 
ROTATED FACTOR MATRIX FOR THE VALIDATION SAMPLE 
Factor 

content ates —. — + = — Commu 

nality 

I TE | erga IV j 
nfortably —.37 20 — 30 i 30 
ted = —6 | 20 49 
ining AT —.21 —.62 21 69 
s 31 — | —.77 21 77 
. accident — 70 4 
p worrying 40 — 41 | Al 
ry ... family is driving = a9) 1) - —.28 - 33 
. often worry 5 62 = = AS 
.sualities A = | — e .68 49 
ainst brain damage = — — (eS 57 35 
ven ... escape injuries = == = —57 39 
ing being thrust out = =v — 50 27 
it in the door 22 = — .33 16 
le to put on —.90 85 
ing on 70 — |} —,30 — 60 
adjust 64 = aw 46 

| 


Were notified 


\ n and later on they were visited 
in their hom 


interviewers. 


Results 


Beliefs about Sai Belt Usage 


E factor analysis of the 16 items, consti- 
Ing the four factor scores that gave optimal 
7 0 ported usage in Study 1, was 
3 T factors were extracted in the 
Way as in Study 1, but in this case 


bene rotation with the rotation cri- 
Ee called Kaiser’s varimax yielded the 
a that was most easily interpreted. The 
¿ated matrix and the communalities are 

Sven in Table 3. 
is 1 pattern showed close agreement 
regard : original pattern in Study 1 with 
5 = the marker variables. Only two 
11 A os from the previous outcome. 
ien 155 its salient loading in the Incon- 
Retor 10 actor instead of the Discomfort 
158 the r Which it was chosen, and for Item 
: e was true. Thus, on the whole, 
9 9 oe obtained in Study 1 could 
a alized to the present sample. As in 
Y 1 the factor scores were obtained by 


Sum: 5 $ 
10 mation of the scores on the four variables 
each factor. 


Prediction of Seat Belt Use from 
Beliefs about Usage 


Reported use. On the basis of the factor 
analysis for the present sample the inclusion 
of Items 141 and 158 in the final instrument 
was regarded as questionable. A recalculation 
of the multiple correlations for the sample in 
Study 1 was made with the two items re- 
moved. The correlations were at least as high 
as for the original 16 items. 

In Study 1 the prediction that was made on 
only two factors, Discomfort and Effect, was 
nearly as good as the optimal prediction based 
on the four factors. The regression equation 
thus obtained was used to predict reported 
use for the present validation sample. The 
correlation between predicted and actual val- 
ues was .61, which was at least as high as 
that which resulted when the prediction was 
based on all the four factors. 

Observed use. To investigate how well be- 
liefs could differentiate between the users and 
the nonusers in terms of observed usage, dis- 
criminatory analyses were performed. The 
point-biserial correlation between the opti- 
mally weighted sum of the four factors (14 
items) and use (dichotomized) was .39. When 
only the two factors Discomfort and Effect 
were used the point-biserial correlation was 
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Users (N=25) Weighted sum Non- users (N=80) 


of D and E 


10 20 30 40 50 


Ficure 1. The power of the discomfort (D) and 
effect (E) combination to discriminate between users 
and nonusers, 


practically the same, or .38. The result for 
the two factors is illustrated in Figure 1. 

The difference between the two groups is 
clear, although the overlap between the dis- 
tributions is considerable. The shortcomings 
of the behavior measures, which have been 
pointed to, and the small size of the sample 
of users should be kept in mind when evalu- 
ating the outcome. Variations in especially 
such situational factors as driving environ- 
ment and distance could be important, even 
though the situation “driving to work” was 
assumed to apply to all drivers. 

The weights for the optimal combination 
of Discomfort and Effect for the prediction 
of actual use were similar to those obtained 
for the prediction of reported use. Hence, a 
high correlation (7 = .83) between the two 
predicted values was obtained. 


Discussion 


The organization of beliefs with respect to 
seat belts and belt usage deviated rather little 
from what had been expected and showed a 
considerable stability over individuals. Simi- 
lar patterns were obtained for groups of car 
owners differing widely in reported belt usage. 
Also, the belief pattern of the validation 
sample was almost identical to the original 
pattern in spite of the differences in pro- 
cedures for subject selection and data collec- 
tion. 

The model of attitude as a sum of products 
of beliefs and evaluative aspects of those be- 
liefs appeared to be useful. An independent 
measure of attitude (A,) could be predicted 
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almost as well as the relia 
sures would permit from 
the discomfort of wearing s 
injury preventive effects. 
The multiple correlation 
usage and the combination Discomfort 
and Effect factors was almo high as the 
correlation between A, and ed usage, A 
small improvement of the pre ons could be 
made by adding informatio: cerning ex- 
perience of inconvenience indling the 
belts and of worry about ge. injuried in 
a traffic accident. 


of the mea- 
ions about 
ts and their 


en reported 


For the validation sample tudy 2 the 
correlation between reported predicted 
use, based on the combination de Discom- 
fort and Effect factors, was rt must be 
kept in mind that it was no adom sam- 
ple, but as it showed the s listribution 
of background variables as sample in 
Study 1, and the reported lev belt usage 
barely exceeded that of the 1 sample, 
the coefficient can be taken as zer support 
for the predictive validity o? e weighted 
sum of these two factors. The «alue of the 
coefficient is considerably hig! han values 
that have generally been rer. ed for the 


and belt 
variance 
(see 


correspondence between attit: 
usage. Obviously, some com 
could be due to methods s 
Campbell & Fiske, 1959), sinc 
were used in all cases, although the 
identical in form. The Discomfort-Effect 
combination also had some power to dis- 
criminate between users and nonusers ob- 
served in the situation “driving to work.” The 
usefulness of the obtained relationships fol 
the construction of a model of seat belt use 
was thus further demonstrated. s 

An illustration of the relationships 
variables that seem most important 0n : 
basis of the present studies is given in Figu" 
2. 

On the basis of the intercorrelations 5 
in Figure 2 (a), a great number of oF 
models could be suggested, even if recipro 
causation and the operation of additum 
variables were not allowed for. The mo 5 
suggested in Figure 2 (b) is in line with 1 
rationale behind many attempts to influer 5 
public behavior in various areas, à ration® 


and 
the 


given 
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which rarel) een subjected to empirical 
test. In this the latent variable, “atti- 
tude to seat se,” is regarded as an inter- 


ediating between beliefs and 
riance being explained by 
tions about the discomfort 
vearing seat belts and about 
usage in a car accident. 


vening varia 
actual usagé 
variation in 
experienced 
the effects 


The attitud n, is assumed to be a de- 
terminant hypothetical variable, “dis- 
position fo: use’ (see Fhanér & Hane, 
1973b). 

Whethe: on in a given situation will 
put on his not will be dependent upon 
the relatio: een his value in the “dispo- 
sition for b e” variable and the charac- 
teristics of tuation. A situation can be 
described i ns of “difficulty,” that is, the 
proportion drivers using the belt in that 
situation ér & Hane, 1973 b). In this 
sense, city ing is a more difficult situa- 
tion with to belt usage than is high- 
way drivin ace level of belt use is lower 
in the city. 

By experi» ontal manipulation of the Dis- 
comfort ar ‘fect factors the value of the 
model prop.se4 in this study could be as- 
sessed. It would, however, be realistic to rec- 
ognize the possibility of additional determi- 
nants of the variable “disposition for belt 
use,” even if the nonperfect correlations in 


part are duc to measurement shortcomings. 
A consequence of a research strategy focusing 
on correlations as they exist in nature is, of 
course, that important causal factors can be 
disregarded due to low correlations. It is pos- 
sible, for instance, that worry about getting 
Involved in an accident is a determinant of 
disposition for belt use.” If the users reduce 
their worry by putting on the belt, feeling 
confident that the belt will reduce the nega- 
tive consequences of an accident, then a zero 
correlation could be obtained, in spite of an 
existing causal relationship. It is even more 
Teasonable, perhaps, to assume the existence 
of feedback loops in the proposed model. 
Behavior change, induced by means other 
than by changing attitude, for example, by 
law enforcement, or monetary rewards, could 
thus be assumed to influence the Discomfort 
and Effect factors. 


(a) 


(b) 


Ficure 2. Relations between discomfort (D), effect 
(E), attitude (A.), and reported use (Use). (a) Em- 
pirical relations obtained in Study 1. (b) Suggested 
causal links. 


In summary, the relations between beliefs, 
attitude, and usage obtained in the present 
studies were substantial and definitively large 
enough to merit further investigation into 
their nature. The traditional paradigm be- 
hind traffic safety campaigns would thereby 
be given an empirical evaluation. 
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FECTS OF CAFFEINE ON ALERTNESS IN 


SIMULATED AUTOMOBILE DRIVING * 
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minutes after ingesting 200 milligrams of caffeine or a placebo, each of 
~ale subjects drove an automobile simulator for 90 minutes. Immediately 
iter, the subject ingested a supplemental dose of the medication taken 
„ (200 milligrams of caffeine or placebo), and then drove for another 
nutes. The simulator provided a comprehensive and coherent set of 
us inputs which produced a degree of realism not usually found in 


atory studies. Both the initial 
ficantly enhanced performance be} 
yur measures of alertness. 


The be! rat caffeine enhances mental 
and physi -rformance is widespread and 
deeply en d in our folklore, but experi- 
mental su for this belief is neither ex- 
tensive no quivocal. In a comprehensive 
review of mental studies, Weiss and 
Laties (1: eported that caffeine seems to 
enhance si spects of performance but that 
the evider carding the effect on human 
performan not conclusive. Fleishman and 
Bartlett (( who reviewed recent research 
on the effect of drugs on human performance, 


Pointed out the need for experimental studies 
evaluating the effects of specific drugs and 


dosage on specific classes of human perform- 
ance, 

In view of the prevalence of the use of 
caffeine (often for the specific purpose of 
enhancing alertness upon awakening in the 

‘This work was performed at the Injury Control 
Research Laboratory, U.S. Department of Health, 
Education, and Welfare, Providence, R.I., and was 
supported in part by United States Public Health 
Service Grant GM-15904 and in part by a grant 
from the Bristol-Myers Company. 

The authors are grateful to William G. Cochran 
and Allen Donner for their guidance in statistical 
matters, and to Marion C. Oviatt and other stafi 
members at the Department of Health, Education, 
9 Weltare Injury Control Research Laboratory 
or their help in this study. 

Requests for reprints should be sent to Gene M. 
thy Massachusetts General Hospital, Boston, 

Assachusetts 02114, 
8 at the Department of Transportation, Al- 

ol Countermeasures Program, Washington, D.C. 


and the 
yond that found with placebo, on each 


tal doses of cafeine 


morning, when studying, or driving an auto- 
mobile), scarcity of systematic information 
about its effects on performance is somewhat 
surprising. For example, in searching the 
literature, we encountered only three well- 
controlled studies of the effects of caffeine on 
complex performance related to operation of 
vehicles. Two of these studies (Hauty & 
Payne, 1955; Payne & Hauty, 1954) demon- 
strated that caffeine can enhance monitoring 
performance in an aviation-training device. 
The other study (Baker & Theologus, 197) 
enhance perform- 


of performance in an automobile- driving 
simulator. The indices were based on time of 
response to accelerations and decelerations of 
a lead car the subject was following and on 
the speed and dependability of his responses 
to a high-beam signal. The unique facilities 
available at the Department of Health, Edu- 
cation, and Welfare (DHEW) laboratory 
where this study was conducted made it possi- 
ble to combine the control and precision of 
laboratory research with a highly realistic 
simulation of actual highway driving. : 


3 METHOD 
Subjects 5 


The subjects were 24 healthy males ranging in 
age from 21 to 26 years, whose vision was mea- 
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Automobile 
Chassis 


Ficure 1, Automobile driving simulator. 


sured as 20/30 or better. All were experienced 
drivers, and none drank more than five cups of cof- 
f fee per average day. The subject was told that the 
experiment concerned the effects on driving perform- 
"ance of a moderate dose of a commonly used, non- 
prescription agent. After completing a 45-minute 

practice drive in the simulator, the subject was in- 
Vited to participate in the study on each of two 
experimental days. Subjects were asked to get a 
normal night’s sleep before each experimental day, 
to avoid use of mood-altering drugs (including cai- 
feine) the evening before and the day of each ex- 
_ perimental session, and not to eat for a period of 
two hours prior to each session. 


i Apparatus and Experimental Tasks 


The automobile-driving simulator (see Figure 1), 
which was constructed around the chassis of a 1965 
Rambler with automatic transmission, presented a 
realistic situation in which the driver performed 
the operations required in actual automobile driv- 
ing. The driver received sensory input simulating 
that which would be experienced by an individual 
driving at twilight along a straight, continuous, flat 
road in a rural environment. The-driver interacted 
Wich a lead car by accelerating, decelerating, and 
braking as he believed the situation required. In 
addition, he responded to aperiodically appearing 
high-beam signals, as described below, and carried 
out these activities while continually steering the 
X Rambler to maintain correct lateral position on the 
_ roadway. 

k Softly lighted instruments and hidden light sources 

Provided low-level interior lighting. Visual displays 
mounted on the side and rear windows of the 
chassis provided a static simulation of a highway 
environment at twilight. 

_ The chassis stood at one end of five adjacent 
movable belts, each of which was a 70-foot loop. 
The belt system simulated two driving lanes, the 
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Optical 
Lens System 


centerline, and the roadsides (si 
highway. An optical lens system 
front of the car made the scale mo 
appear life-size to the driver, T 
display simulated approximately 

highway distance. 

Simulation of forward motion 
through the rotation of the cente 
belts toward the driver. The rotati: 
controlled by the operation of th 
the brake of the car, produced a 
of the “approach” of road and sce: 
actually be seen in a moving car; t 
of the Rambler’s accelerator and 
appropriate changes in the rate f 
road and scenery. Coordination of | 


speedometer readings provided 
inputs that simulated real driving. 
A plastic model of an automobile was mounitt 
on the right-lane belt and, by virtue the optic 
system, this model simulated a life-size lead = 
with which the driver interacted. The simula 
speed of the lead car (controlled by 
recorded on magnetic tape) and the 
Rambler (controlled by the drivers o 
the accelerator and brake) determined the appa 
distance between the lead car and the Rambler. ; 
the resulting visual display, the driver could 0 
take the lead car, maintain a constant distance a 
hind it, or drop back from it. The perceived 
ing and separation rates between the two 
simulated those experienced in actual driving 
corresponding speed differences. During aaa 
perimental drive, the lead car traveled at simu 5 
speeds ranging from 45 to 65 miles per hour, md 
five accelerations and five decelerations aperio 0 
during each 30-minute period. The subjects Moi 
was to follow the lead car at a distance © 
car length for each 10 miles per hour of spee 


el 


Er P CAFFEINE ON ALERTNESS IN SIMULATED AvTomonrte DW N 
TABLE 1 
RESPONSE Trak TO Leap Car DECELERATIONS 
— = — — — — — — = 7 = O — 
a’ X (seconds) . 
Dose Un, Sm 1 
i Logarithm | Antilogarithm =N 
Initial affeine 616 41 | . | 
14 2.32° 2 
Placebo 670 4.68 
Suppleme: alleine 614 4.11 
04 4,22°° 17 
Placebo 695 4.96 
* The stan don of eaffeine-placebo diik logari A ont anal towover dean 
(Cochran & oO ad a Lites det exterted from the enionate of e ee 
percent i reported in terms of antilogarithms of means following the formula : one (100), 
38055 
to correct wed whenever he recognized an especially attractive for our study, but the consid- 
increase or ase in the speed of the lead car. erable demand for its use limited its availability for 
High-bear us were controlled by a mechani- any one particular study. Initially, we had intended 
cal progran hich activated the high-beam indi- to select an experimental design which permitted a 
cator light Rambler's dashboard. Depressing test of two or more dose levels, in order to obtain 
the dimmer ch on the car’s floorboard extin- information about dose-response relationships and 
guished the Ii not extinguished by the driver, also to reduce the risk of failing to find real but 
the light w ymatically turned off after 3.2 sec- subtle caffeine effects. However, the restricted avail- 
onds, The ic appearance of the high-beam ability of the simulator forced us to limit our study 
light was cd to the subject by stating that to approximately 50 experimental trials. Thus, with 4 
a simulate: ct in his headlights would occa- a two-dose design (¢.g., 200 400 milli- 
sionally cau. em to switch from low beam to grams, and placebo), we could have studied only ' 
high beam. matically and noiselessly. He was 16 subjects; while with a one dose design (200 milli- 
instructed tc =x his lights as rapidly as possible, grams of caffeine and placebo), we could have 
whenever th -beam indicator light appeared. studied 24. Although we chose the latter alterna- 
During cach nutes of driving, eight high-beam tive, we did so realizing that our experiment micht 
Signals were presented. be too insensitive to reveal a statistically significant 
Desi effect with the 200-milligram dosage. For that rea- 
sign son, we adopted a dose-supplementation procedure 


In this comparison of orally administered caffeine 
and placebo, cach subject served as his own control 
in a double-blind, counterbalanced study in which 
12 subjects received caffeine first, and 12 subjects 
received placebo first. 

Drugs were administered in the form of white 
Peppermint-flavored tablets taken with water. Ap- 
Proximately 30 minutes after the initial dose of 
medication (tablets containing either 200 milligrams 
of anhydrous caffeine or placebo), the subject began 
a uninterrupted drive of 90 minutes. After this, 

ere was a break of approximately 10 minutes, 

which the subject left the car, took a sup- 
Plemental medication (repeat of the initial dose), 
and was instructed to void. Thereupon the subject 
ey to the car and began a second 90-minute 


Because the dose-supplementation procedure rep- 


„sents an unconventional experimental design, its 


Motion for this study merits comment. The un- 
aoe Combination of realism and laboratory pre- 
on provided by the driving simulator made it 


which made it possible to study a somewhat larger 
dose (as well as the 200-milligram dose of caffeine) 
without increasing the number of experimental 
sessions required. By administering 200 milligrams 
of caffeine as a supplemental dose two hours after” 
the first dose, it was possible to obtain information 
about the effects of 200 milligrams of caffeine dur- — 
ing the first 90-minute driving period and to obtain 
information about a higher level during the final 90- 
minute driving period. Although the effective level 
of caffeine during the final 90-minute drive cannot 
be specified in this study, data reported by Axelrod 
and Reichenthal (1953) concerning the fate 
caffeine in man, made it reasonable to assume that 
blood plasma levels of caffeine during the final 90- — 
minute drive would be higher than those during the 
first 90-minute drive. a 


Performance was evaluated according to 


= 
four indices of alertness: (a) response time 


RESULTS 


 —_ * Standard deviation of caffcine-placebo 
(Cochran & Cox, 1950, p 115) with variance due to order ex 


tp < as. 
“p <w, 


to lead car accelerations; (c) response time to 
“presentations of the high-beam signal; and 
(d) the number of missed high-beam signals. 
be effects of caffeine were assessed after an 
initial 200-milligram dose of caffeine (the 
st 90-minute drive) and after a supple- 
mental 200-milligram dose (the second 90- 
‘minute drive). In all analyses, response times 
ere transformed to logarithms. 
Lead car decelerations. Response time to 
lead car deceleration was measured from 
start of the deceleration to the first sub- 
_ sequent release of the Rambler's accelerator 
by the subject. For both the initial and the 
‘supplemental medication treatments, responses 
lead car decelerations were significantly 
er after subjects had received caffeine than 
placebo (see Table 1). The improve- 
ment in response time was 12% after the ini- 
i dose of 200 milligrams of caffeine and 
179% after the supplemental dose of 200 milli- 
g of caffeine. 


Lead car accelerations. Response time to a 
d car acceleration was measured from the 
: of the acceleration to the first subse- 
quent depression of the Rambler's accelerator 
by the subject. For both drug levels, responses 
to lead car accelerations were significantly 
after caffeine than after placebo (see 
able 2). The improvement in response time 
Was 13% after the initial caffeine dose and 
% after the supplemental dose. 

1 
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TABLE 2 
Krall Tom rO Leap CAR ACCELERATIONS 


differences (in logarithms) ie based on analysis of variance ‘ 
tracted from the estimate 


> Percent improvement le reported in terms of antilogarithms of means following the formula: SS 


to lead car decelerations; (b) response time 


variability; 4/ 


High-beam signals. Respo: 
beam signal, as measured by | 
between the onset of the high 
light and activation of the 
was significantly faster afte: 
after placebo (see Table 3). 
did not contribute to the sul 
time score.) There was an 11 
in response time after the in 
gram dose of caffeine and a 
ment after the supplemental & 

Missed high-beam signals. CG 0: 


ie interval 


30% fewer signals after the imiia 
caffeine and 50% fewer signals afier the sup- 
plemental dose. Both differences were sta- 
tistically significant (see Table 4). 
Comparison of 90-minute periods. AS al- 
ready indicated by the percentage of improve 
ment figures, the drug effect (caffeine-ph 
difference) was somewhat greater during 
second 90-minute drive than during the first 
This was not surprising in view of the dose. 
supplementation feature of the experimen 
design. Although each of the four indices 
showed a trend toward a larger effect during 
the second than during the first 90-minute 
drive, the difference was statistically sien 
cant with only one of the four; namely; 5E 
sponse time to lead car accelerations US 
10.72, df = 1/115, p < .005). 10 
Analyses jor 30-minute comparisons. 
shown in Table 5, none of the four caffeine 


Rá 


Drug 
(Ut 


eiue | 
cebo 
alleine 


'lacebo 


cons for the first 30 minutes 
inute drive was statistically 
he 20 remaining 30-minute 
were in the direction of caf- 
vlertness, and 14 were statis- 
it. These analyses indicate 
‘lect was definite enough to 
caffeine-placebo differences 
time units, that is, 30 min- 
he statistical significance of 
reported in Tables 1—4 rests 
appearing throughout the ex- 
the first 30-minute interval of 


the first dris 
The break vn of the data into 30-minute 

tervals aiso permits inspection of changes in 
he drug efiect (caffeine-placebo difference) 
Over time. Table 6 summarizes the analysis 
Variance performed on each of the four 
indices to test these changes. With two of the 
four indices (response times to lead car ac- 
felerations and to high-beam signals) signifi- 
t differences in the drug effect among the 
Six intervals were found. This was due mainly 
10 a progressive increase in drug effect over 
me (perhaps caused by progressive fatigue, 
increased absorption of caffeine, or both) 
‘4nd partly to changes in drug effect over the 
ak. Comparing the drug effects (caffeine- 
Placebo difference) obtained in the 30-minute 
intervals immediately preceding and immedi- 
Stely following the break showed results 
ch were not uniform from index to index 
e Table 5). The drug effect increased 
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TABLE 3 


Kesroose Tiue ro Hicon- Bean Seonata 


SS oe SS — — 


caffeine-placebo differences logarithms) ls based on analysts of variance for a creerover 
pretended mir a cn y Sh nba . 22. 
is reported in terms of antilogarithine of means folhowing the formula: ~ 7 uo. 


SD 


N 


after the break with two indices (response 
time to decelerations and missed high-beam 
signals), remained about the same with one 
(response time to accelerations), and de- 
creased with the fourth (response time to 
high-beam signals). The last mentioned in- 
dex was the only one for which change across 
the break was statistically significant (F= 
4.28, dj = 1/115, p < .05). 

In general, the analyses of separate 30- 
minute intervals failed to reveal simple time- _ 


TABLE 4 

Misst Hian- Bau SIGNALS d 

p a ee J 

x 2 

Drug ber | SDs t 

Dose condition | m proves 

Initial Sacto} 528) 10. | 203° |) 
Placebo 4.63 

Supplemental | Caffeine | 2.75 | || ae 
Placebo | 5.50 


Standard deviation of caffeine-placebo differences is based 
1 — pa variance for a cross-over design (Cochran & 
Cox, 7250.7 rot) ake ar to order extracted from 

te of varia! yi = 22. $ 
s Percent improvement is reported following the formula? 


oof (100). 


+p <.05. 
+p < 01. 
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) TABLE 5 
Combe Axo Pracepo Means ror Eacn or Srx 30-MINUTE COMPARISO) 
f INDICES OF ALERTNESS 
l | Driving intervals after initial dose* Driving intervals afte ental dose* 
Drug Minutes 
0-30 30-60 60-90 0-30 30 60-90 
J — — 
Response time to lead car decelerations 
Caffeine 887 ⁊ k -600 642" | .596 604 626 
Placebo 639 | -663 | 684 -696 | 666 O84 
Response time to lead car accelerations 
Caffeine | 671 | 642" .649* .562** .625* 628" 
Placebo 602 | 725 753 .702 784 801 
Response to high-beam signals 
Caffeine 145 177 * 149. 188 142 129*˙ 
Placebo 155 237 228 -198 235 .220 
is 4 — 
: Missed high-beam signals 
i — = P — 
Caffeine 83 1,04* 1.38 1.12* 67** 90. 
N Placebo 1.25 1.71 1.67 1.88 1.88 1.75 
r 8 time means ate in logarithms. The error term used for each t test was obtained as indicated in ti: ote to Table Hi 


1 8.05. 
22801. 
* <.001. 


changes in drug effect that might be suggested 
by the results reported in Table 5. This study 
clearly demonstrates that caffeine, when com- 
pared with placebo, did produce improved 
performance on all indices, but inferences con- 
cerning more complex caffeine effects (such as 
interactions among effects of drug, time, and 
performance index) are, we believe, unwar- 
ranted. 


Discussion 


The results of this study are consistent 
With reasonable expectations and with em- 
_ pirical findings of three earlier studies (Baker 
& Theologus, 1972; Hauty & Payne, 1955; 
Payne & Hauty, 1954) concerning the effects 
of caffeine on tasks involved in vehicle op- 

eration. Perhaps the most important aspect of 
the present study is that the effects of caf- 
feine were evaluated in a situation which 


provided the advantages of lal 
trol without undue sacrifice of 5 
oratory findings are often limited by thelt 
questionable transfer to evenis outside the 
laboratory setting. On the other hand, a 
tempts to study phenomena in natural set 
tings often sacrifice manipulative control and 
precision of measurement. š 
The present study was conducted in æ 
environment that provided a comprehensive 
and coherent set of stimulus inputs whi a 
produced a degree of realism not usualy 
found in laboratory studies. The subject 5è 
inside a full-sized automobile and recelV 
visual, auditory, and tactile inputs associat 
with being so situated. This was augment 4 
by a simulation of the exterior environme? 
involving such things as motor sounds, uae 
effects, and a visual field that included o 
rushing scenery and a continuously movil 
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ABLE 6 


vor THE CAPYEINE—PLACEBO 
Over Six INTERVALS 


Asacyses ov V 
Dur 


Source di MS* F 
Respor lead car decelerations 
Intervals (1 5 010 48 
Subjects (5 23 037 1.76 
Error (I X 115 021 
| — 
Resp to lead car accelerations 
I 5 107 3.34˙ 
8 23 050 1.56 
IXS 115 .032 
l Res: ne to high-beam signals 
I 5 085 2.69* 
8 23 O15 115 
IXS 115 -013 
A high-beam signals 

1 5 | 2.48 1.23 
8 23 3.86 1,91 
IXS 115 2.02 
* Response t a ae in logarithms, 

<.05. 
"> <.01. 


highway cen, ine. Within this setting, the 
caffeine-enhanced performance was broad in 
nature, encompassing responses to both grad- 
ually appearing signals (changes in lead car 
Speed) and signals which were discreet (on- 
off high-beam signals). Moreover, the lead car 
Signals came from the roadway in front of 
the driver's car, while high-beam signals, 
which were relatively unobtrusive, came from 
the instrument panel inside the car. 

e somewhat unconventional two-stage, 
Ose-supplementation procedure was selected 
tor this study with the expectation that caf- 
feine-induced alertness would be easier to 
detect later than earlier during the three 
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hours of driving, because of progressive in- 
creases in caffeine concentration in the blood. 
The observed tendency for the caffeine- 
placebo difference to be greater during the 
second than during the first driving period is 
consistent with that expectation. Moreover, 
the fact that the caffeine-placebo differences 
for the two 90-minute drives were consistent 
with each other adds reassurance to the con- 
clusion of a dependable caffeine-placebo dif- 
ference. The results for the second drive 
cannot be said to comprise an independent rep- | 
lication of those for the first, but the consis- 
tency observed does provide a kind of cross- 
validation, and this adds to the confidence 
with which we conclude that the behavioral 
parameters investigated in this study are 
dependably altered by caffeine. Similarly, the 
fact that 14 of the 20 thirty-minute com- 
parisons made after a 60-minute absorption 
period showed statistically significant su- 
periority of caffeine over placebo also adds to 
the confidence of that conclusion, 
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In an earlier series of experiments Laugh- 
© ery, Alexander, and Lane (1971) explored the 
| effects of several task variables on facial rec- 
< ognition. The paradigm of those studies re- 
embled the situation in which a witness who 
has seen a criminal tries to identify that per- 
_ Son’s picture from a set of alternatives, The 
results indicated that one variable in particu- 
lar, target position, had a strong effect on 
recognition performance. Recognition per- 

rmance decreased as the number of decoy 
pictures preceding the target increased. As 
_ Laughery et al. (1971) pointed out, the ef- 
fects of target position could be accounted 
for either on the basis of interference due to 
the number of decoy faces intervening be- 
tween the original target exposure and the 
appearance of the target in the search series, 
or on the basis of the greater amount of time 
elapsing when the target appeared in a later 
position. 
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Four experiments investigated the ability of undergraduate students t 
nize a target person (whom they had previously seen in slides or a filn 
test series of pictures. The first study, which manipulated time delay | 
seeing the target and starting the test series (from four minutes to onc 
demonstrated that delay had no effect on recognition performance. Th 
three studies manipulated similarity between the target person and the < 
pictures preceding the target in the search series. The greater the target 
similarity, the poorer the recognition performance. A significant target 
tion-similarity interaction indicated the presence of a larger perforn 
decrement due to position when similarity level was high. 
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The issue of which of th 
tions more appropriately acı 
sults of the position effect 
long-standing problem in p 


vo explana 
for the re 

course, & 
ry. Studies 


that have manipulated the it of inter- 
vening material between tra and testing 
in retention studies have t: nally con- 
founded interference and ti: ig (decay) 


lend itself 
lution, there 
gain some 
rtance of the 
uld be noted 
these factors 
the design of 


factors. While this issue doe 
to any clearcut experimental re 
are procedures which can hel; 
insight into the relative imp: 
two factors. Furthermore, it sh 
that the relative weighting of 
has important implications for 
criminal identification systems. 
Most current criminal identification files 
are organized strictly on the basis of sex 
race. One development that might take place 
involves designing a personal appearance cod- 
ing system that would permit faces to be 
classified on the basis of other characteristic 
(color of hair, bushiness of eyebrows, etc.). 
The size of the search set could then be 
decreased by additional descriptors of the 
criminal. Note, however, that such a Pr 
cedure would result in a search set where the 
decoys are more similar to the target an 
presumably would produce a greater amount 
of interference. If interference is the major 
contributing factor, the additional interference 
caused by the more similar decoys might ofi- 


naller search set. Ii time 

hand, were the more 
0 mphasis would be placed 
itness to the identification 
„ delimiting of the search 
vith less concern for the 


set the value 
delay, on th 
potent factor 
upon getting 
task quickly 
set could pri 


increased int e due to greater similar- 
ity. 

This artic ats the results of four ex- 
periments, í ding with time delay and 
three conce: cial similarity in the iden- 
tification tas different similarity studies 
employed d definitions of similarity. 

ERIMENT | 

In this ex nt the time delay between 
the initial re to the target and the 
start of the ch series was manipulated 
over six vali nging from four minutes to 
one week. 

Method 

Subjects. T biects were 292 undergraduate 
students cnro n an introductory psychology 
course at the te University of New York at 
Bufalo, Partic in the study partially fulfilled 
a course requ it 

Design. Th: in this experiment was basically 
the same as i ose reported by Laughery et al. 
(1971). The s ts were first exposed to a target 
person, vie win ther four sequentially presented 
Slides of the tarvt person in different candid posi- 


tions or viewing 2 one-minute movie film clip? of 
ene in which the i 

tant role. The 
„point scale (dcñnitely not, probably not, possibly 
Rot, possibly yes, probably yes or definitely yes), 
Whether cach side in a subsequent, seri pre- 
" Sented test series was or was not the target. The 
pn: picture appeared only once in the search 


In the four candid position slides, the target was 
hown in positions ranging from left side to right 
Side, full length, and bust views. The candid posi- 
tions were obtained by taking a large number of 
Pictures of the target, and selecting those that 
Seemed least posed. One target was shown with 
Süsses in both the four candid position slides and 
the test series; while a second target appeared 
without glasses, The slides were projected so as to 

approximately life-size on the screen. 

The film depicted a check-cashing episode at a 
department store. It was prepared by WCNY tele- 
Vision at Syracuse, New York. Five characters ap- 
thea in the film: three female customers, a = 
Somen and a male clerk. The male customer an 


Further details of the apparatus and procedures 
“mployed may be obtained from the author. 
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A separate group of subjects was used in cach con- 
dition, with the number of subjects per condition 
A point to be noted is that the two targets in the 


ibj in each of the 24 experimental con- 
PEER as a group, with groups is sub- 
j mly assigned to the various conditions. 
e carried out in a normal uni- 


The room was darkened 
to insure good vision of the slide or movie, but with 
sufficient light to read and 
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the target picture 
in Position 104. Although the 
only 180 slides in the test series, the 
were numbered to 200 in order to 
bias that might result from antici- 


iH 
iz 


in this experiment, the time 
introduced between the presentation of 
tte target and the second part of the instructions. 
0 minute condition there was no delay, and 
the second part of the instructions was presented 
immediately (it was these instructions that filled the 
4-minute interval). In the 30-minute and 1-hour 
delay conditions the delay period was partially filled 
by running the subjects in a separate memory span 
_ for letters task. In the 4-hour, 1-day, and 1-week 
_ conditions, the subjects were run in the span task 
and then excused. They returned, of course, at the 
appropriate later time. 

It was decided that if any subject knew the par- 
_ ticular target person to whom he was exposed, he 
Would be excused from the experiment. No subjects 
indicated knowing any of the target persons. The 
_ subjects were asked to indicate on their answer sheet 
if they knew any of the decoys. There was a negli- 
gible number of responses indicating any subject 
knew a decoy face. 


Results 


The responses to the target picture in the 
test series are referred to as the hit-miss 
(H-M) scores. Similarly, the responses to the 
149 decoys are referred to as false alarm- 
correct rejection (FA-CR) scores. For a 
given subject the H-M score could be a single 
value from 1-6, and the FA-CR score could 
_ range from 149 to 894 (the sum of all other 
response scores). A higher H-M score and a 

lower FA-CR score reflect better performance, 
Io analyses were carried out on the re- 
sults, The first analysis was based upon each 
subject’s response to the target picture, the 
H-M score. The mean H-M score for each of 
the 12 experimental conditions, collapsed 
across targets, was calculated. An analysis of 
variance was carried out on this data to de- 
termine the effects of the delay and mode 
Variables. The presentation mode had a sig- 


F 
> 


LAUGHERY, FESSLER, Lenorovitz, AND YOBLICK 


nificant effect (F = 50.) 
001) with performance 
tion where the exposure t 
of viewing four casual 
overall means for the c 
movie scene were 4.4 and 


1/204, p< 
in the com. 
rget consisted 
graphs. The 
pictures and 
spectively, A 


significant delay by mod action (F= 
2.93, dj = 5/204, p < .05 orobably due 
to the fact that perform: as especially 
poor in the 30-minute de! ie-mode com 
dition. (1.8 versus 4.6 in rinute delay, 


slide condition) 

The second analysis w 
FA-CR scores. The mean 
(the total FA-CR score di; 
each of the 12 experimen: 
computed. The mode of | 
was again significant (F = 
$ < .005) with performan 
FA-CR scores) in the fou: 
dition. The means for the s! 
were 1.19 and 1.32, respecti 
delay effect was not signifi 
mode interaction did reac! 
4.22, df = 5/204, p < .005 
is probably due to the fa 


ised on the 
e per decoy 
by 149) for 
nditions wat 
tation effect 
df = 1/20, 
erior (lower 
1 slides con 
1 film modes 
\Ithough the 
the delay by 
icance (F= 
is interaction 
it in the + 


minute delay condition thers a reversal of 
the mode effect; in other 5, with this 
delay the film mode led to etter FA-CR 


score (1.3 film versus 1.5 slid 
Discussion 


The focus of interest in this «> periment wa 
on the delay variable. Somew!::: surp Ya 
the results show quite clearly tbat the degre’ 
of time lapse between exposure to the ta 
and the beginning of the search through the 
test series had no meaningful effect on Per 
formance. The finding is surprising, since it 
was our expectation that the longer delay 
would lead to lower recognition performance 
Clearly, the vast literature on human vers 
memory indicates such an effect in ti 
and recognizing material. Apparently, me * 
for faces in this type of task does not a 
teriorate simply with the passage of time, 
least not over a period of one week. mode 

Given the nature of the delay by ! 75 
interaction with the H-M and FA- CR a 
it is difficult to draw any firm conclusi : 
about the factors underlying these effects. i 
the analysis based upon the FA-CR scores 


be that 
day conditi 
confusion 


elatively short 4-minute 
to a differential amount 
> two mode conditions. 
ion would be reflected in 
t is anything but clear as 
rential confusion should 


presentation mode varia- 
consistent. The fact that 
‘er on the recognition task 
ter the film may seem 
It might be assumed, for 
iynamic presentation in the 
information which in turn 
ognition performance, The 
ults are just the reverse, 
ın that the two-dimensional, 
s are simply more like the 
static slides in the search 
cently resulting in a better 
ther possible explanation is 
wolved distracting “drama” 
cet faces. A third possibility, 
the two targets in the slides 
ı some way more recognizable 
vets in the movie condition. 
is of the targets, however, 
' any hints as to what such 
mnerences v 
The result. of Experiment 1 indicate that 
he target-po-ition effect reported by Laugh- 
ery et al. (1971) was not due primarily to the 
Mifferential passage of time. Instead, it seems 
fasonable to speculate that the more im- 
ant factor is the interference caused by 
intervening faces. 


EXPERIMENT 2 

The purpose of the similarity studies was 
fo examine interference by varying the simi- 
larity between the target person and the 
decoys that appeared in the series before the 
target person. A major problem in carrying 
‘Out an experiment of this type concerns the 
‘definition of similarity. Three separate def- 
ions of similarity were employed. The first 
Was based upon results of the time delay 
study reported previously. For each of the 
149 decoys a mean (across subjects) FA-CR 
Te was available that reflected the extent 
to which each decoy was mistaken for each 
set. Hence, these results could be used 
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to deñne the similarity between a particular 
decoy and target. The second definition was 
based upon physical characteristics of the 
people whose pictures are in the slide library. 


The third definition was based upon a direct 


similarity rating given by the subjects. 


Experiment 2a 


ment 2a was a similarity study em- 
ploying the definition based upon FA-CR 
scores from the time delay study. 


Method 
All phases of Experiment 2 drew from the 


Experiment 1, 
essentially 
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Two separate analyses were carried out on 


the data using the H-M and FA-CR scores. | 
H-M analysis, the effect of mode was 


In the 
highly significant (F = 19.03, df = 1/84, ? 
< 001), with superior performance again in 
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the four casual slides condition. The mean 
H-M scores for the casual photograph and 
film modes were 4.3 and 2.7. Neither the 
main effect of similarity nor the Mode X 
similarity interaction was significant. The 
means for the high- and low-similarity condi- 
tions were 3.3 and 3.7, respectively. 

The analysis of FA-CR scores revealed a 
significant similarity effect (F = 5.47, df= 
1/84, p < 025), with means of 1.4 and 1.6 
for the low- and high-similarity conditions, 
respectively. This finding is not particularly 


interesting since the two similarity conditions 


were defined by differences in the FA-CR 
scores obtained in the previous experiment. 
The result does, however, serve as a useful 
check on the success of manipulating the simi- 
larity variable. 


Experiment 2b 


Tn the second similarity study the defini- 
tion of similarity was based upon nine physi- 
cal characteristics of the people whose pic- 
tures were in the film library. The physical 
characteristics of the people in the slides had 
been recorded at the time the pictures were 
taken (hair color, hair length, age, build, 
eye color, glasses, moustache, beard, and 
length and shape of sideburns). For 315 of 
the white males in the film library, the num- 
ber of characteristics each pair has in com- 
mon was counted. A similarity matrix for 
these 315 pictures was prepared and used to 
select high- and low-similarity decoys. The 
values in the cells of the matrices could 
change from zero to nine. One point should 
be noted about this definition of similarity. 
The measurements involved simple observa- 
tions or estimates on the part of the people 
taking the pictures; consequently, this simi- 
larity definition should only be regarded as a 
rough estimate. N : 


Method 


One hundred and one subjects participated in this 
portion of Experiment 2. The apparatus, materials, 
and procedure were exactly the same as Experiment 
2a with one exception. This exception was the mode 
of presenting the target in the original exposure; all 
targets were presented by showing four casual photo- 
graphs. $ 

Given the similarity matrix, an effort was made to 
select 52 high- and 52 low-similarity decoys for a 


LAUGHERY, FESSLER, LENOROVITZ, AND YoBLI‘ 


number of targets. Two 
highly similar if they ha 
istics in common, wh 
as four or less characteris 
seem inconsistent that low 
fined as two or less chara 
problem was that too 
characteristics. Since the 
and minimize similarity in 
ditions, respectively, it We 
attempt to equate the low 
selecting the high group fro 
common characteristics. 
high and low similarity, it 
find 52 high- and 52 low-si 
given target person; there 
having both 52 similar an 
Consequently, in this study 
were used in each of the 
resulting in a total of eight 
subjects. The number of subi 
from 10 to 16. 


were considers 
r nine character 
rity was defined 
common. It may 
rity was not de. 
in common, The 
W i | so few common 
ob was to maximize 
h and low cos- 
d to forego any 
zh conditions by 
with six to nine 
e definitions of 
1 impossible te 
decoys for any 
were no targets 
ssimilar decoys. 
different targets 
arity conditions, 
ms or groups of 
r group ranged 


Results 


it on the data 
scores, Scores 
different tat 
mditions, The 
ind high-simi- 


Two analyses were cari 
using the H-M and FA 
were collapsed across thi 
gets in each of the similo 
mean H-M scores for the 


larity conditions were 5.08 : 4.42, respect 
ively. A £ test showed thi fference to A 
significant (f = 2.05, df = 109, p < 025). 
In the analysis based upon FA-CR scorts; 
the mean for the low-similarity condition was 


it was 1.44. 
cant. 


1.40, and for the high conc 
This difference was not sign! 


Experiment 2c 


The definition of similarity in this study 
was based upon a direct rating of similarity 
between faces obtained from a large number 
of subjects. ‘led 

A matrix of similarity ratings was compile 
for 222 faces in our picture library by 023 
taining similarity ratings for pals. 15 
matrix of ratings served as a basis for de = 
ing the relative similarity of any two me $ 
bers. A minimum of nine subject ratings W 
available for each cell (pair). 


Method 


2 solved 

One hundred and fifty-two subjects were involve 
in Experiment 2c. Except for a few changes 

task, apparatus, materials, and procedure in the 
basically the same as the earlier studies. As 
second similarity study, the initial exposure 
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target always l of seeing four casual slides. 
However, one ce in procedure involved the 
position of th in the search series. In this 
experiment th t position was manipulated as 
a between-sul iriable and appeared either in 
Position 41 or 

Given the ty matrix derived from the 
ratings, the s juences for the high- and low- 
similarity cor were formed as follows. Each 
of the 222 s considered a potential target. 
For each o pictures the other 221 were 
ordered on t of their similarity to that pic- 


ture. Then picture the mean similarity for 
the 100 mos pictures was calculated. The 
three slides highest mean similarity for 
these 100 m ilar pairs were selected as the 
high-similari the appropriate 100 pictures 
were used : oys in the Position 101 condi- 
tion. In the way the three slides having the 
lowest mear ty for the 100 least similar pairs 
were selectec ts for the low-similarity condi- 
tions. For | tion 41 condition a set of 40 
decoys was from the 100 in such a way as 
to result in xœ mean similarity as the overall 
set of 100 six targets were selected: three 
that were to their decoys and three that 
were dissim their decoys. 

The desig 1 2X2 factorial with similarity 
(high and nd target position (41 and 101) 
both manip: s between-subjects variables. Each 
subject wa n two trials. To avoid obvious 
confounding irget was different in the second 
trial. In adc the levels of both the similarity 
and the posi:ion variables were changed. In other 
Words, if zoup viewed a high-similarity se- 
quence wit! target in Position 41 for the firs 
trial, they viewed low-similarity sequence with a 
different target in Position 101 for the second. 
Results 


Since the trials variable was confounded in 
the design with the other variables, and since 
it is always possible that some sort of dif- 
ferential transfer effect may exist, the data 
from the two trials was analyzed separately. 

S a rather gross check on a possible trial 
fect, however, 4 tests were run on the two 
‘mals for the H-M and FA-CR scores col- 
lapsed across all other variables, While there 
Was virtually no difference between trials on 
the H-M scores, the difference was significant 
for the FA-CR scores (t= 3.67, df = 302, 
P< 001. The FA-CR scores were lower on 
the second trial. A possible explanation for 
a 5 is a criterion shift with respect 
Tri e decoys. The practice acquired during 
r allows the subjects to become more 
amiliar with the task, perhaps learning that 
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the target is 
or that the target is not to be presented with 
small changes in appearance. This familiarity 
with the task could lead to greater confidence 
in rejecting decoys and, thus, a lower FA-CR 
score. 

The mean H-M scores for each of the four 
experimental conditions in Trials 1 and 2 
were computed. An analysis of variance 
showed the main effect of similarity to be 
highly significant (F = 27.13, df = 1/148, $ 
< 0001) on both trials, with better recogni- 
tion of the target in the low-similarity condi- 
tion. With respect to the position variable, 
Position 41 led to slightly better performance 
on Trial 1 and significantly better perform- 
ance (F = 6.76, df=1/148, p< .01) on 
Trial 2. The Position X Similarity interac- 
tion was significant in Trial 2 (F = 5.79, df 
= 1/148, p < .02) but not in Trial 1. The 
data underlying the interaction indicated that 
the difference between Position 41 and 101 
was greater in the high-similarity conditions. 

An analyses of variance was carried out on 
the FA-CR scores for each of the trials. The 
main effect of similarity was significant (F = 
12.51, df = 1/148, p < .001), in Trial 1, with 
better performance in the low-similarity con- 
ditions. The mean FA-CR scores for the high- 

and low-similarity conditions were 1.2 and 1.4, 
respectively. All other effects were not signifi- 
cant. 


Discussion 


The results of the three similarity studies 
exhibit a clear and consistent pattern. In Ex- 
periments 2b and 2c the lower similarity de- 
coys led to a significantly higher hit rate on 
the target picture. While the results of Ex- 
periment 2a did not produce a significant 
similarity effect on the H-M score, the means 
were in the expected direction. An appropriate 
conclusion can be drawn: the greater the simi- 
larity between a target and the decoys preced- 
ing it, the lower the probability that the 
target will be correctly identified. 

The results of the FA-CR analyses are 
somewhat confusing. The prediction is that 
higher similarity should lead to higher FA-CR 
scores. The effect was significant in Experi- 
ment 2a and Trial 1 of Experiment 2c, but 


not likely to appear frequently 
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pot in Experiment 2b or Trial 2 of Experi- 
ment 2c. In the latter case the explanation 
may be due to the criterion shift mentioned 
earlier, which may have produced a perform- 
ance level that washed out the difference (a 
ceiling effect). No explanation for the lack of 
a significant FA-CR effect in Experiment 2b 
comes to mind, although it should be noted 
that the difference between the means, while 
quite modest, is in the expected direction. 

The significant Position X Similarity inter- 
action on Trial 2 of Experiment 2c is most 
interesting and consistent with the arguments 
presented in the present context. If each 
similar decoy is contributing more interfer- 
ence than each dissimilar decoy, one would 
expect a larger performance decrement due 
to position when the similarity level is high. 
This is precisely what happened. 


GENERAL DISCUSSION 


= Returning to the issue of interference ver- 

sus decay posed in the introductory section 
of this article, these studies clearly indicate 
that the important variable in facial recogni- 
_ tion is the number and similarity of the faces 
preceding the target picture in the search 
series and not the amount of time elapsed. 
Indeed, the results of the third similarity 
study, 2c, show that the similarity between 
the decoy and target faces is more important 
than the number of decoy faces. 

The results have definite implications for 
the design of criminal identification systems 
involving a linear search through a series of 
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pictures. Reducing the the search set 
by eliminating alternati at differ irom 
the target on the basis in descriptos, 
will result in a more eneous, higher 
similarity set. This hig! nilarity could 
produce a level of interfe hat more than 
ofisets the desirable eff: iewer altema- 
tives. 

One possible directiot identification 
systems might take is to way from the 
linear search procedure example, an 
interactive search might veloped. Such 
a procedure would invol\ wing the wit- 
ness a picture from the he witness rè- 
acting by noting differenc: cen the target 
and the person in the | and a new 
alternative selected on th of these dif- 
ferences would be shown witness. The 
process would continue in manner until 
an identification is mad 1 termination 
criterion is reached. This of interactive 
approach based upon a “z in” or (tres. 
like” search procedure i: lerably more 
powerful than the linea hn. The data 
from the experiments rep: 1 this artide 


indicate that the consider: 
procedures is clearly warra 
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model which suggests that 
irs (the employee’s environ- 
pectations of individuals con- 
ior. Role theory posits that 
expected of an individual by 
nization are inconsistent, he 
role conflict and will experi- 
„b e dissatisfied, and perform less 
`: the expectations did not con- 

o predicts that role ambiguity 
lack of information about the 
employee does not know what 
hat authority he possesses, or 
evaluated, he may hesitate to 
ıd will have to rely on trial and 
in meeting the expectations 


ble tasks. If 
duties are 


n. 
H etica] relationships regarding role 
conflict argued in the preceding paragraph are 
Supported by a body of empirical literature. 
et al. (1964) found that high levels of 
Re Conflict were related to (a) low levels of 
b satisfaction, (b) low confidence in the orga- 
mization, and (c) a high degree of job-related 
fo Sion, Gross, Mason, and McEachern (1958) 
und a significantly negative correlation between 
red role conflict and three measures of job 
“itisfaction. Tosi (1971) found that role con- 
ct was positively related to job threat and 
anxiety and significantly related in a negative 
rection to satisfaction with the job. 
oy ambiguity has also been examined in the 
Ee fashion. Just as with role conflict, Kahn 
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ONSHIP OF ROLE CONFLICT AND ROLE AMBIGUITY 
TO JOB INVOLVEMENT MEASURES 


W. CLAY HAMNER ! M HENRY L. TOSI 
Graduate School of Business, Michigan State University 


‘s of role conflict, role ambiguity, and various job involvement vari- 
ere obtained for 61 high-level managers. Role conflict was negatively 
to the amount of reported influence and positively related to the 
t of perceived threat and anxiety; while role ambiguity was negatively 
ted with job satisfaction and influence, and positively related to job 
ind anxiety, It is suggested that organizational level be taken into ac- 
when studying the relationship of role stress factors with job involve- 


et al. (1964) found that high levels of role 
ambiguity were related to (a) low levels of job 
satisfaction, (b) low confidence in the organiza- 
tion, and (c) a high degree of job-related ten- 
sion. Cohen (1959) concluded that ambiguity of 
the situation and inconsistencies of directions 


raised the anxiety of subordinates, caused a 


less favorable attitude toward supervision, and 
lowered productivity. 

When Rizzo, House, and Lirtzman (1970) ex- 
amined role conflict and ambiguity they found 
that both tended to correlate (a) weakly, but 
positively with anxiety and propensity to leave 
the organization and (b) negatively with influ- 
ence in the organization. Hence, it appears that 
role ambiguity and role conflict result in un- 
desirable consequences for organization members 
and likely, therefore, for the organization itself. 

Some contradictory findings regarding the role 
conflict and ambiguity variables have been re- 
ported. While Rizzo et al. (1970) found role 
ambiguity correlated significantly with low satis- 
faction, their study revealed little support for a 
significant relationship between job satisfaction 
and role conflict. This finding contrasted with 


Tosi’s (1971) study which found just the op- 
no relationship between conflict and 


posite: 
satisfaction. 


In the light of these conflicting findings, the 


specific purpose of this research was to examine 
the relationship of role conflict and role ambi- 
guity to various job involvement variables. It is 
hypothesized that, as in the Kahn et al. model, 
high levels of role conflict and role 


(c) high levels of perceived threat and anxiety. 


Te A de ee, ely 


ambiguity 
are related to (a) low levels of job satis- 
faction, (b) low levels of job participation, and 
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ek 


This study also examined the relationship of 
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role conflict and role ambiguity to propensity 
do leave the organization. The relationships 
among the work involvement variables were also 
investigated. 


METHOD 
Subjects 
The data were collected from 61 high-level 
managers participating in an executive develop- 
ment program from many industries in a large 
midwest urban region. 


Instruments 


A 10-item instrument developed by Rizzo et 

al. (1970) was used to measure subjects’ per- 
ceptions of role conflict and role ambiguity. 
Rizzo et al. reported an estimate of internal 
consistency of about .80 for role ambiguity and 
‘82 for role conflict. The Role Conflict scale 
emphasized the incompatibilities or incongruities 
arising out of perceptions or performance of role 
_ requirements, while the Role Ambiguity scale 
_ stressed the clarity of behavioral requirements. 
A set of subscales was developed to measure 
the work involvement variables: The Job Satis- 
faction subscale used in this study was developed 
by Vroom (1963). The subjects responded to a 
S point scale with degrees ranging from “very 
much“ to “very little” on the three-item scale. 
Vroom reported test-retest reliabilties of . 75. 

A propensity to leave measure developed by 
Rizzo et al. (1970) was used to ascertain the 
plans for and conditions under which the respon- 
= dent would leave the organization, each item 
7 ranging from low to high propensity to leave. 
Participation was measured by the four-item 
scale used by Vroom (1963). While no reliabil- 
ities were computed in this study, Vroom re- 
ported test-retest reliabilities of 63. The items 
measured the amount of influence the executive 
a had cn the decisions made in the organization. 
The job threat and anxiety measure is an 
a priori scale developed by Tosi (1971). It was 
designed to measure the managers concern 
about his job as it may be affected by condi- 

tions in the future over which he may have 
little or no control. 
Ihe variables analyzed were all derived from a 
single combined instrument of approximately 
70 items. 


RESULTS 


Correlation coefficients were derived by testing 
r the significance of the relationship between vari- 
$ measures of job involvement and role stress 
riables. Role conflict was not significantly 


Snort Notes 


related to either job satis! 
the propensity to leave the org 
There was, however, a trend 
relationship to the amount 
(participation) (r = —.19, $ < 
positively related to the amo 
threat and anxiety (r = 27, % 

Role ambiguity, on the oth: 
tively correlated with job satis! 
p <.05) and was positively c 
threat and anxiety (r= 33. 
role conflict, role ambiguity w 
correlated with the propensity 
nization (r = —.07). 

Job satisfaction tended to rel 
the perceived threat and the 
by the job (r=—.19, p <.0! 
tively correlated with the amou 
fluence perceived by the en 
b .01). While job  satisfacti 
nificantly related to the propen 
organization (r = —.12), the ex 
the least influence were the on 
a greater propensity to leave 
(r =—.23, p < .05). 


DISCUSSION 


The results are consistent wi 
of other correlational studies 
support for the Kahn et al. moc 
conflict showed the predicted 
participation, perceived threat 
while role ambiguity also corre 
satisfaction, perceived threat anc 
dicted. Job satisfaction, surprisingly, 
relate with either role conflict or 
to leave the organization. Perhaj 
finding was due to the fact that the 
perceived role conflict as a “given” in an 
tive job, and because this conflict was © 
it did not produce dissatisfaction and a 
to leave the organization. i 

The executives who had the least influ 
the organization were the ones who expi 
the greatest desire to leave. This finding 
not seem unusual in that the role of an €x 
generally carries with it decision-making 
If this influence is not present, it ch 
expected that the individual would ! 
greater urge to move than he would W. 
has a great deal of influence in the O 
decision-making process. i 

Since it has been determined that T 
is multidimensional, it is not surprisi 
components of role conflict and role 
would act in different ways under differ 


Gens. This s: pported the findings of Rizzo 
eal. (1970 louse and Rizzo (1972) which 
showed role rity was related to job satis- 
faction, whi!: conflict was not. ‘The findings 
of this study „ted with the findings of Tosi 
(1971) and nd Tosi (1970) who reported 
that role cor is related to job satisfaction, 
while role a: y was not. 

By taking ser look at these studies, it 
appears tho results are not inconsistent, 
Instead, onc make the argument that the 
component s, which is related to the level 
of job satis is a function of organiza- 
tional level iake this hypothesis based on 
the nature amples studied. Both the loan 
officers (To ) and the teachers (Tosi & 
Tosi, 1970) the lowest level of their re- 
spective org ions. In fact, they are the ones 
performing primary operating functions. 
Given that now how to perform their jobs 
and that th bs are well defined for them, 
role ambigu probably not a major problem 
for these | order participants. They are 
much more ly to face conflicting role de- 
mands fro hers, and therefore, for these 
members ro aflict could be expected to pre- 
sent the g> cst problems and the greatest 
negative res ons, 

For indivi- in higher level positions, espe- 
cially those »anagerial roles, as is clearly the 
tase for the objects in this study and in the 
Rizzo et al. 0 and House and Rizzo (1972) 
samples, their primary problems may arise from 
lack of clarity in their job. Since it has been 
argued by many that the managerial job is one 
of solving unsiructured problems, or introducing 


More certainty into lower organization levels, it 
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could be reasonably argued that the effects 
higher ambiguity levels are more port 
determinants of satisfaction than are 
role pressures. 

In summary, we conclude that future investi- 
gators should take into account the organisa- 
tional level when studying the relationship of ro 
stress factors with job involvement measures, 
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STRESS AND PERFORMANCE DURING AND AFTER 
OFFICER CANDIDATE SCHOOL 


J. RICHARD JENNINGS, R. M. ROSE, ano LEO KREUZ 


Walter Reed Army Institute of 


Military performance during and after Officer Candidate School (OCS) w 
related to interview and questionnaire measures of stress and personality. 


company of candidates was studied early 
and after six months on duty as officers. 


and change in coping style related to class standing in OCS but not to subs 

quent military performance. Personality and maturational variables we 

related to performance after OCS. Motivational distortion by candidates a: 

initial impression formation by superior officers appeared to be factors 
standing 


determining OCS class 


Officer Candidate School (OCS) attempts to 
place an individual under physical and psycho- 
logical stress akin to that found in combat (U.S. 
Department of the Army, 1969). Candidates who 
successfully complete training under these condi- 
tions are commissioned second lieutenants. 

The current study examined the candidate’s 

reactions to the stress of OCS and his personal 
coping resources, Three psychological measure- 
ment approaches were used: (a) a psychiatric 
interview assessing “here and now” feelings and 
functional impairments during OCS; (b) a semi- 
projective test designed to measure reactions to 
the specific stressors of OCS and also to measure 
the individual’s coping repertory vis-à-vis these 
stressors; (c) a set of standard paper-and-pencil 
tests of personality traits. 
__ The psychological measures were related first 
to leadership rank (i.e., class standing) during 
an early, highly stressful period; second, to final 
OCS class standing; and third, to an assessment 
of the candidate’s performance after approxi- 
mately six months on his first permanent duty 
assignment. 
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in training, just prior to graduatio 
Psychiatric interview ratings of str 


METHOD 
Subjects and Design 


Subjects consisted of an entir: 
of 69 men at Fort Benning, Geor 
did not complete the course 
dropped from the sample. 

The first assessment was per! 
high-stress period (Week 3 with: 
program). Collection of test s 
data, and biological sampling (st 
& Jennings, 1972) took place 
within this third week. An iden 
period of data collection was conc 
minimal stress period (Week 22) 
ment was made approximately six ™ 
the candidates reported to their h. 
military assignment. 


Performance Measures 


Leadership rank (class standing within of 
was the basic measure of the candidates’ å 
to function efficiently under stress. The 
are assigned through a combination of scores 
academic military science subjects, physical tra 
ing scores and, most importantly, ratings by th 
immediate military superiors (tactical 
and their fellow candidates. A ranking of 
candidates is performed approximately €v 
four weeks. Ranks are assigned within a pla 
until the final ranking period, when a comp 
rank is assigned. 

In addition to these ranks, the physical traii 
ing scores of the men were included as à 86 
rate variable, and an index of performance 
quirements during Week 3 of OCS was Cre 
Termed “chain of command” points, this 
reflected the student command positions whicl 
candidate held during Week 3. Thus, for exa 
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a student pla ider received 4 points and a 
yad leader ts. 

Performanc: tive duty was assessed from 
a structured t ne interview with the former 
candidate’s ir e supervisor. The supervisor 
assigned a s nging from 1 to 7 for per- 
formance wi h of five areas. The arcas 
were (a) pur y in completion of work, (6) 
completeness ` work, (c) quality of the 
work, (d) sp learning new skills, and (e) 
budgeting an nizing time, 

Psychiatric [ w 

The three i igators as a group interviewed 
each candid r approximately 20 minutes 
during both and low-stress measurement 
periods. The t of the interview was struc- 
tured so tha candidate received the same 
questions wi milar wording; however, the 
questions we juently reordered, and specific 
additional qi s added. After each interview, 
each investig independently scored the indi- 
vidual at 1 n each of five categories: (a) 
performance functional impairment, (b) 
anxiety, em: response, arousal, (c) threat 
to self-esteem. 7) impulsivity, and (e) global 
stress, In ad 1, a report of sexual activity 
and sexual iy tion was taken. 

The avers strelation between the global 
stress ratings the three interviewers was .80 
during the his: ess period. This figure dropped 
somewhat dur the less stressful period due to 


the uniformly :ow ratings assigned to the candi- 
dates. Averag 


ratings were used for the data 
analysis, 


Stress Reaction Scale 


A semiprojective test of reactions to seven 
Specific stress situations encountered in OCS was 
developed according to the techniques of Gold- 
ried and D’Zurilla (1969). A candidate was 
asked to put himself into a situation, numerically 
ae his emotional reaction, describe what he 
ie do about the situation, and predict the 
come of the situation. The following is an 
item from the scale: 


` After a few days of academic classes in OCS, 
yor tactical officer heard that your class was 
5 in attentiveness and military bearing. As 
= aed class spent an hour low crawling 
to running, The next day you try especially hard 
1 a military bearing and address the 
R ctor as you are supposed to. Many of your 
0 however, do not seem to modify their 
50 room manner at all. You are concerned that 
u will be punished as a class again. 


<=. y 


80 


The answers were scored by two judges for cop: 
ing failure, solution inadequacy, and unfavor- 
ability of the predicted outcomes, Scores from 
1 to § were assigned; higher scores indicated 
greater stress, Interjudge correlations were 77 
for coping failure, .75 for solution inadequacy, 
76 for unfavorability. Averaged ratings wers 
used for the analyses, ‘ 

The 7-item stress reaction scale was admin- 
istered during the high-stress period at Fort 
Benning. A shortened 3-item form, based on the 
candidate’s individual choice of most relevant 
situations, was administered during the low-stress 
period, The tests were administered in a group 
setting, and candidates wrote out their answers, 
In addition to the basic scores for each category 
averaged over items, difference scores were cre- 
ated by subtracting low-stress-period item scores 
from the relevant high-stress item scores. 


Paper-and-pencil tests 

During the early, stressful period, the Mar- 
lowe-Crowne Social Desirability Scale, the Buss- 
Dirkey Test of Aggression, and IPAT (Institute 
for Personality and Ability Testing) Anxiety 
scale were administered in a single testing session. 
During the less stressful period the 16 PF (Per- 
sonality Factor) test, Form A, 1962 edition, was 
administered. After this testing session, the candi- 
dates rated each week of the OCS program for © 
the level of stress. a 


RESULTS AND Discussion 


Diference between the High- and Low-Stress 
Periods 

Mean differences between high- and low-stress 
periods for variables spanning this interval 
showed consistently the stressfulness of Week 3 
of OCS as compared to Week 22.* The stress 
reaction scale results indicate a decrease in the 
failure to cope score (f = 4.69, p <.001), but a 
no change in emotional reactions or assessments 
of the outcome of situations. 7 

The overall group results for the 16 PF scales 
may be relevant as a description of the men at 
the end of OCS. In comparison to the general 
population norms, the men scored significantly 
higher than the norms on five scales (ts > 2.37, 
df = 1183, p <.05). The men’s scores were sub- 2 
stantially higher than were the norms for Intelli- 
gence (Scale B), Dominance (Scale E), and A 
Surgency (Scale F). (Surgency is defined as Be 
enthusiastic, heedless, happy-go-lucky.) The men 
scored slightly higher than the norms for Ego 


For more complete tables, readers may write 
the first author. 
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TABLE 1 


Cors tate Berweex OCS (OFTICER CANDIDATE SCHOOL) PexrormMance AND S? ARLABLES 
{O DII Il e 
Performance scores 
yas Platoon rank: | Final company Physica Followup 
Week 4 rank: Week 23 training vcrformance 
Psychiatric interview* 
Functional impairment s —.26* — 1 
= Anxiety —.16 —.15 —.10 
Threat —.28* —.14 —.10 
Global stress —.37% —.23 —16 
Functional impairment: Week 3-Week 22 i —.17 0¹ 
Stress Reaction Scale“ 
Emotion 02 0 =. AS 
Coping Failure —.12 05 —.09 07 
Solution uacy —.03 05 —.12 ⁰ 
Unfavorability —.07 04 —.02 -03 
Coping: Week 3-Week 22 2⁵ 20˙ .22 19 
16 PF scales? 
_ Shy vs. Socially Bold (Scale H) -23 27* 15 00. 
Practical vs. Imaginative (Scale M) 21 =.09 02 31 
Forchright vs. Shrewd (Scale N) —.25 —.26˙ —.14 —33˙ 
Motivation Distortion: low vs. high .22 SAS 13 -.01 
Puberty and sex related scores 
13 16 —.13 ae 
—.05 —.02 —.12 28" 
Performance 
— 
— -70** 5 -09 
Company rank = z: .46** | 09 
Follow-up performance — — 19 = 
_ Chain of command: Week 3 —,29* 2 —.49% 04 
j — 


formance variables were found for all the psychiatric interview change Mw ap 


training score was partialed out. N = 58; N = 57 forf 


s from the total 4 X 49 matrix, 15% of independently assessed variables were ‘significant. 


Strength (Scale C) and Self- sufficiency or Non- 
_ group-dependent (Scale O2). 


Variables Relating to Performance during and 
aſter OCS 


Table 1 presents correlations of variables that 
related significantly to at least one of the per- 
formance measures. A number of other variables 


are — to give a balanced picture of the 
t3; 


The best predictors of rank during OCS vae 
the variables from the psychiatric interview 
Performance after OCS, however, was not . 
to these variables. All interview data show 
similar correlations and probably reflected a ges 
eral assessment of amount of disturbance the 
candidate expressed. As expected, during 


115 
5A correlation matrix containing all of the va" 
bles may be obtained by writing the first author. 


7 
sf expressed stress was associated with poor 
ng. The psychiatric interview did not 
particularly debilitating forms of psy- 
fal stress. Holmen, Katter, Jones, and 
on (1956) also reported general ratings 
i OCS performance but were un- 
identify specific predictors. 
Coping failure score of the stress reaction 
showed a negative relation to OCS rank, 
only after the men had an opportunity to 
m how to cope within the system. That is, 
ping failure score from Week 3 did not 
performance, but the difference score 
dates who shifted their answers toward 
ve, aggressive coping style attained better 
E Standings than those who did not shift. 
the Job Analysis and Interest Measure 
), Petersen (1971) and Petersen and 
pitt (1968) found similar shifts of behavioral 
€ within OCS. Petersen also found that be- 
ral style was related to success in OCS. 
Scales from the 16 PF test related to 
nce. In terms of the scale descriptions 
16 PF test, the venturesome, socially bold 
ranked better at Week 23 than did the 
reserved candidate; and the forthright, 
tral candidate did better than the shrewd, 

i candidate both at Week 23 and after 
ths on duty. Imaginative and inner-directed 

es were rated high by supervisors after 
hs on duty, although this scale was not 
to OCS class standing, The fourth rela- 
P involves motivational distortion, a lie 
e scale is keyed on shifts in response 
when the test is given in a job applica- 
g versus a neutral setting. The candi- 
$ showing motivation distortion tended to 
better ranks than the other candidates; that 
lidates who were concerned with “looking 
Were rated as doing better. 
fairly marked relationships between later 
of puberty and masturbation and higher 
Nance ratings after OCS were unantici- 
The result may indicate the value placed 
S Officer status by the late-maturing indi- 
al (cf. Jones, 1957). 

n OCS, rank at Week 4 was highly cor- 
to company rank (Week 23), and both 
a substantial relation to the physical 
score. The correlation between rank at 
and final rank reflect the relative fixing 
top and bottom third of the ranks by 
The correlations of the chain of com- 
Oints indicate a selection factor in the 
ent of men on the chain of command. 
ig relatively poorly in rank and physical 
Performance were more likely than others 
din stressful student leadership roles 
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during Week 3. In short, the performance corres < 
lations within OCS suggest te some extent that 
men are labeled early in OCS as “leaders” or 
“duds” and then treated accordingly, 

The OCS performance measures did not corre- 
late with the rating of subsequent military per- 
formance, Given that the goal of OCS is to pro- 
duce combat leaders, two facts are relevant to 
the absence of correlation between OCS and 
later performance. First, the ratings of permanent 
duty performance emphasized efficiency on the 
job rather than more abstract leadership qualis < 
ties. Second, 94% of the men were serving in 
technical and administrative leadership positions 
rather than field or combat positions. A study 
performed during the Korean conflict (Katter & 
Holmen, 1954), however, found correlations (s 
to .20) similar to the current results between 
OCS ranks and combat performance ratings, 

In summary, candidates who succeeded at OCS 
tended to be forthright, socially bold individuals 
who adopted active coping styles. They expressed 
relatively few problems due to stress, but may 
have adopted some alien attitudes in order to 
succeed, Their success may in part be due to 
making a good impression on officers during the 
first few days of OCS. 


REFERENCES 


Goldfried, M. R., X D’Zurilla, T. J. A bebavior- 
analytic model for assessing competence. In C. D. 
Spielberger (Ed.), Current topics in clinical and 
community psychology. New York: Academic 
Press, 1969. 

Holmen, M. G., Katter, R. V., Jones, A. M., & 
Richardson, I. F. An assessment program jor OCS 
applicants. (Tech. Rep. 26) Alexandria, Va.: 
Human Resources Research Organization, 1956. 
(NTIS No. AD-91 213). 

Jones, M. C. The later career of boys who were 
carly- or late-maturing. Child Development, 1957, 
28, 113-128. 

Katter, R. V., & Holmen, M. G. Infantry OCS 
evaluations and combat performance. (Tech. Rep. 
8) Alexandria, Va.: Human Resources Research 
Organization, 1954. (NTIS No. AD-39 552). 

Kreuz, L., Rose, R. M., & Jennings, J. R. Suppres- 
sion of plasma testosterone levels and psycho- 
logical stress. Archives of General Psychiatry, 1972, 
26, 479-482. 

Petersen, P. B. An investigation of the effect of 
training. Unpublished doctoral dissertation, George 
Washington University, 1971. d 

Petersen, P. B., & Lippitt, G. L. Comparison of 
bebavior styles between entering and graduating 
students in Officer Candidate pace Journal of 
Applied Psychology, 1968, 52, 66-70. 

us REER of the Army. The OCS story. 
Washington, D.C.: Department of the Army, 
Headquarters, October 1969. (Pamphlet) 


(Received June 25, 1973) 


TL HONS N 


A PERFORMANCE CORRELATE OF PERSONAL CON 
BELIEFS IN AN ORGANIZATIONAL CONTEXT 


W. J. HEISLER * 
University of Notre Dame 


The internal-external control beliefs of 175 employees of a government 
were compared to an index of personal effectiveness composed of five : 
indicative of success within a governmental occupational context: nur 
promotions, salary increases, and awards received; current salary; and 
differential. Employees with a greater belief in external control (Rott 
reported significantly lower personal effectiveness than those with a 

- belief in external control. When organizational rewards were perceis 

| agency members as skill-determined, the relationship between environn 

N control beliefs and personal effectiveness increased, while the relationshi; 
nearly zero when agency rewards were seen as chance-related. 


Over the past decade, a significant amount of 
research has examined what several investigators 
consider a major variable in behavior theory: 
the generalized beliefs that individuals have about 
their ability to influence or control their environ- 
ment.“ Although a majority of the studies in this 
area have focused upon the effects of such beliefs 

upon individuals’ efforts to acquire information 
and engage in social influence activities (Davis & 
Phares, 1967; Gore & Rotter, 1963; Seeman & 
Evans, 1962), more recently a number of inves- 
tigators have been concerned with the relationship 
between internal-external control beliefs and 
what might be termed “personal effectiveness” 
(Lao, 1970; Phares, 1965). 

While these latter studies have tended to sug- 
gest a modest, but significant, relationship be- 
_ tween personal effectiveness and environmental 

control beliefs, much of the evidence has been 
obtained using samples of college students in 
either academic or relatively simple, short-term 
Social influence situations. It was the purpose of 

this study to ascertain whether a relationship 

between environmental control beliefs and per- 
sonal effectiveness could be substantiated within 
a more complex organizational context. Were 
such associative properties to exist, it was hy- 
pothesized that organizational members with 
greater beliefs in their ability to control their 
environment would exhibit significantly greater 

personal effectiveness or success (as indicated by 
the receipt of greater organizational rewards) 
than would organizational members with lesser 
_ beliefs in their environmental control abilities. 


- 
l 
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Requests for reprints should be sent to William 
J. Heisler, Department of Management, College of 
Business Administration, University of Notre Dame, 
Notre Dame, Indiana 46556. 

*An excellent summary of this research can be 
found in Rotter, Seeman, and Liverant (1962) and 

Rotter (1966). 


METHOD 
The study was carried out in e office of & 
United States governmental « which pre 


vides technical assistance to v 
individuals in land and water 


5 groups and 
rce manage 


ment. Since agency personnel wi ographically 
dispersed throughout the sta restionnaites 
with measures of the various les in this 
study were mailed to all emp t their job 
address with a provision for mail retur 
to the researcher. Returns we eived from 
196 personnel, 75% of those sı d, although 
only 175 were substantively co enough fot 
data analysis. Because of the na f the occu 
pational field, nearly all respond were male 
with an age range of 20 to 60, « mean age of 
41.2, and a standard deviation 1 5. Appro% 
mately 30% of the respondents heid managerial 


positions; the remaining 70% were technical 
specialists and staff personnel. Of the respondents, 
10% reported having attended graduate course’; 
60% reported holding a four-year college degree) 
the remainder had at least a high school educi 
tion. 

The Internal-External (I-E) scale developed 
by Rotter (1966) was used as a measure of es 
individual’s generalized expectancies or beliefs a 
to whether events are contingent upon his owi 
behavior. For the respondents in the study, score 
ranged from 0 (the consistent belief that ind 
viduals can influence their environment) to * 
(the consistent belief that the locus of reward 5 
external to the individual, i.e., determined by 
luck, chance, powerful others, etc.) with a mean 
of 7.6 and a standard deviation of 4.3.° 


3 While this mean is somewhat more internal 10 
a number of the general norms reported by 5 
(1966), it is consistent with Gemmill and H 
(1972) previous results for managerial personne 
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versonal effectiveness was a 
dex incorporating a number 
onsidered to be global indi- 
nee effectiveness or success 
ntal occupational context: 
ons, within-grade salary in- 
received; current salary; and 
¢ difference between the em- 
ral schedule (GS) grade and 
z GS grade and level with the 
for each item were recoded 
le into five equal categories 
; of one to five. A total score 
t was then obtained by sum- 
is in the index, Scores on the 
m 5 (low personal effective- 
personal effectiveness) with a 
a standard deviation of 4.5. 
tency of the measure was 69. 


RESULTS 


ectiveness index was found to 
rrelated with the personal con- 
ncy members (r = —.25, p< 

hat those members who ex- 
relief in their ability to exert 

z environment reported experi- 
rsonal effectiveness. The per- 
index, which is cumulative in 
found to be a function of the 
(r= 41, p<.001), education 

5), and length of service with 
58, p < 001). Therefore, third- 
correlation coefficients were calcu- 
te statistically the effects of these 
ting degree of association be- 
ə environmental control 


R = —.22), but remained significant at 
me .01 level. 
Since prior research also indicated that a per- 
N's knowledge of the chance or skill nature of 
Specific task situation which he faces appears 
® moderate the effects of his more general 
Sonal control beliefs (Davis & Phares, 1967; 
otter & Mulry, 1965), a measure of employee 
Mtteptions of the chance or skill nature of the 
reward structure was also incorporated in 
Questionnaire. More specifically, members 
asked to indicate which of several factors 
ability and effort, knowing the right peo- 
» being in the right place at the right time, 
they saw as being most important in in- 
‘cing promotion and salary decisions within 
gency. Responses were classified on a priori 


Sunds into the dichotomous categories of skill 
chance. 
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As hypothesized, the difference in personal 
effectiveness between internals and externals 
was greater under skill than under chance per 
ceptions, with internals experiencing greater 


personal effectiveness. For those agency members 


who saw organizational rewards as skill-deter- 
mined (n= 93), the association between en- 
vironmental control beliefs and ratings of per 
sonal effectiveness was significant at the 001 
level (r = ), The association decreased to 
r = —.27 when the effects of age, education, and 
time with the agency were controlled for, but 
remained significant at the 01 level. However, 
for those employees who saw agency rewards as 
chance-related ("= 80), the relationship be- 
tween environmental control beliefs and 
effectiveness was nearly zero (r= 01), with a 
third-order partial r = —.08. 


Summary 


The results of this study support the hypothe- - 


sis that organization members with a greater 
external control orientation experience signifi- 
cantly less personal effectiveness (as indicated by 
the number of promotions, salary increases, 
awards, etc., that they receive) than do members 
with a lesser external orientation. This finding 
appears consistent with the results of a study of 
Gemmill and Heisler (1972) in which managers 
with more external control beliefs reported ex- 
periencing significantly greater job strain and less 
job satisfaction than did managers with a lesser 
external orientation. To the extent that external- 
ity is associated with lower levels of organiza- 
tional reward, one might expect the holder of 
such beliefs concomitantly to feel less satisfied 
with his job (particularly with his job progress) 
and more bothered by various elements in the 
job situation (eg., feeling unable to influence 
superiors’ actions and decisions, not knowing 
how a superior evaluates his performance, not 


knowing what opportunities exist for promotion, 


etc.) 

ue results further support prior conclusions 
concerning situational saliency in the effects of 
personal control beliefs. When promotions, salary 
increases, and awards were perceived by agency 
members as skill-determined, the differences in 
personal effectiveness between internals and ex- 
ternals increased in significance. However, when 
organizational rewards were perceived as chance- 
related, the internal-external control dimension 
lost its salience as a predictor of personal effect- 
iveness. 

Although the findings reported in this study 
are statistically significant at or below the .01 
level and remain so even when subjected to 
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partial correlational analysis, the strength of the 
associations is somewhat low by a number of 
contemporary standards, Nevertheless, when 
viewed as resultants of a field study within a 
complex organizational context, their associative 
properties remain of substantial interest and im- 
Port. 
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Locke (197 
lung: needed 


be commended for offering 
vement over previous sys- 
occupational satisfiers and 
icular importance is the dis- 
nt categories and event cate- 
o be commended for offering 
cation despite acknowledged 
sampling procedures. The 
me to his own judgment re- 
nce he wishes to place in the 
hese sampling problems. 
ible to reader judgment are 
‘atistically nonsignificant com- 
‘rs in his Tables 2 and 3. The 
ved for “good” and “bad” 
ar as to require no statistical 
nodological problems of analy- 
the remaining three compari- 
and all those comparisons in 


Wents are so 
st. However, 
S cast doubt 
Ons in these 


misuse of chi-square cited by 
rke (1949) is violation of the re- 
ement that cach tally in a chi-square table 
dependent of the others. In Locke’s study, 
tion of this independence requirement oc- 
d in two ways, In Study 1, Locke elicited 
iptions of four events, two “good” and two 
from each of his subjects. Locke then 
led the chi-square statistic to fourfold con- 
ney tables in which events were counted 
r than persons. Independence, in the con- 
t of this analysis, requires that each descrip- 
l of an event be elicited from a different 
wiect so that there is no basis for correlations 
Mong the incidents. In Study 1, collecting two 
S of data from each subject (Table 2) and 
iting them as if they represented different 
Miects led to the type of nonindependence of 
* Requests for reprints should be sent to Robert 
Hadley, Coordinator, Rehabilitation Counseling 
ee California State University, 5151 State 
‘ity Drive, Los Angeles, California 90032. 
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le published by Locke in the August 1973 issue of the Journal of 
Psychology offered an improved classification system for occupational 

and dissatisfiers, distinguishing between agent and event categories. 
r, some methodological problems appear to limit the conclusions that 
drawn from Locke’s study, An alternative classification system, taking 
ount the multiple determination of some events, is recommended. 


observations that inflated the Ns, and probably 
biased the chi-squares upward. Additionally, in 
the analyses of Tables 4 and 5, inflation of Ns 
occurs in both studies. 

The second violation of the requirement of in- 
dependence occurred in the analysis of both 
Study 1 and Study 2 (in which each subject 
described one good and one bad event) involv- 
ing the comparison of good and bad events (Ta- 
bles 2 and 3). The analysis failed to consider 
possible correlation between these two event 
categories; if such correlation is positive, the 
values of chi-square would be biased downward. 
However, examination of the percentages in Table 
3 reveals them to be so similar that the conclu- 
sions were probably not affected by any down- 
ward bias. While the inflated Ws in Table 2 biased 
the chi-squares upward, and the possible corre- 
lation between the two event categories may 
have biased the chi-squares downward, the mag- 
nitude of these possible biases is unknown, and 
consequently so is the extent to which they may 
compensate each other. 

McNemar’s (1969, p. 261) test of correlated 
proportions could appropriately be applied to 
compare the first good and bad incidents elicited 
from each subject. This procedure would require 
discarding some of the data of Study 1, but 
would use all the data of Study 2. An advantage 
of the McNemar test is that it capitalizes on 
possible correlation between good and bad events, 
a valuable feature of the repeated-measures de- 
sign. An alternative for Study 1, which would use 
all the data, would be to classify each subject 
according to whether the number of motivator- 
determined good events he described (2, 1, or 0) 
was greater than, equal to, or less than his num- 
ber of motivator-determined bad events. The sign 
test (Siegel, 1956) could then be applied. This 
same procedure could again be followed for the 
task-nontask analysis. 

In addition to the problem of inflated Ns, Ta- 
ble 4 reveals a second item on Lewis and Burke’s 
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(1949) list of common infractions, that is, failure 
to consider frequencies of nonoccurrence. It 
appears that the sums of the white-collar and 
blue-collar columns furnished the basis for the 
expected frequencies against which the observed 
frequencies were compared. The chi-square of 
18.91, for example, appears to be based on com- 
paring the observed frequencies of 90 and 32, 
respectively, to expected frequencies of 66 and 
56, without considering nontask frequencies. Be- 
cause of the problem of inflated Ns, this re- 
viewer did not undertake the relatively straight- 
forward task of recalculating the chi-squares of 
Table 4 with frequencies of nonoccurrence in- 
cluded. 

My own preference would be for a system of 
analysis which takes account of the multiple de- 
termination of some events. For example, if a 
worker reports being given more work and addi- 
tional responsibility without the promotion which 
normally goes with the responsibility, or without 
the pay raise which normally goes with the 
increased work, four of Locke's categories are 
involved (amount of work, promotion, responsi- 
bility, and money). To classify the event into 
one of these categories is to throw away some 
possibly valuable data. I propose assigning a 
constant number of points (e.g., 5 or 10) to each 
event and instructing the judge to allocate these 
points to the various categories according to the 
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t of achievement motivation 
achievement) has both theo- 
cal importance, In a recent 
970) reviewed efforts to mea- 
motivation and presented the 
) of an empirically developed 
cned to measure n Ach of col- 
1e Netherlands, 

esented a taxonomy of achieve- 
concepts. Ten categories of n 
d by prior research, served as 
developing the questionnaire. 
e as follows: aspiration level, 
or, upward mobility, persis- 


the categoric 

risk-taking be vio 

tence, task te; sion, time perception, time per- 
Spective, partic: re 0 
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retical and 
article, Hern 
sure achieve 
results (and 
questionnaire 
lege students 

Hermans a! 
ment motiva 
Ach, all su; 
& framework for 


choi iti jor, 
er be we recognition behavior, 
eg further cipirical study of the question- 
nine caiczories were examined (risk-taking 

A ied was excluded), and the concept of a 
questionnaire measure was extended to 

tealm of business management. Specifically, 
purposes of this study were (a) to examine 
ent motivation questionnaire using 
jects employed in the real world of business; 

) to examine the taxonomy of the achieve- 
= motivation construct used by Hermans by 
` tcorrelating the category scores; and (c) to 
ompare the n Ach motivation levels of mana- 
a Working in large corporations with those of 
der ovners of small enterprises and under- 
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AC VEMENT MOTIVATION AMONG MANAGERS OF SMALL 
BUSINESSES, CORPORATION MANAGERS, 
AND BUSINESS STUDENTS 
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tionnaire measure of achievement motivation tested on college students 

Netherlands was administered to business college students, small 
s managers, and corporation managers. In addition, category scores 
leveloped that reflect the more specific components of the n Ach (need 
hievement) construct. Item analyses and Kuder-Richardson 20 estimates 
dequate. Intercorrelations among the category 
for the student sample, which suggests the n Ach construct may be Jess 
ntiated among older managers. Managers differed significantly from 
css students on several components of n Ach. 
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scores were significantly 


graduate students majoring in business admin- 
istration. 
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Metitop 


Questionnaire and scoring procedures. The n 
Ach questionnaire consists of 29 items with four 
to six alternatives. Nine category scores were 
obtained by summing over those items compris- 
ing each category according to Hermans“ item- — 
taxonomy designations. The total score con- 
sisted of all 29 items, Two demographic items 
measured age and years of employment with 
present organization. 

Samples and sampling procedure. A question- 
naire and cover letter were mailed to 120 busi- 
nessmen selected from the local Small Business- 
men Association files with the only restriction 
that the organization employ 50 or fewer people. 
Forty-one usable (349%) questionnaires were 
returned. Personal visits to the central personnel 
offices of large nationally known corporations 
were made to collect the corporation manager 
data. Five major corporations participated, and 
80 questionnaires were distributed among lower- A 
middle managers with 50 or fewer subordinates, 
Thirty-six usable forms (45%) were obtained. 
The undergraduate sample consisted of business 
students in an upper-level, required management 
game course, The entire class (45 males) ccm- | 
pleted usable questionnaires. All questionnaires 
were completed anonymously. £ 

RESULTS 

Item analyses. The item-total score correla- 

tions (Pearson) were computed for each sample.* 


2Detailed tables of item-total score correlations 
by samples and comparisons of these correlatoins 
with the item-cluster correlations reported by Her- 
mans can be obtained from the author. 
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Only one item-total score correlation was nega- 
tive, item 20 in the student sample analysis (r = 
~24), and no category-related pattern of non- 
significant correlations emerged. The item corre- 
lations with either work experience or age were 
within chance expectations; none correlated con- 
sistently across samples. 

Category intercorrelations. A cell by cell com- 
parison using the f to s transformation revealed 
that the intercorrelation matrices for the small 
business manager and corporation manager sam- 
ples did not differ significantly at the Os level 
or beyond; both manager matrices did differ sig- 
nificantly from the business student matrix. Based 
on these results (and the f. test results), the 
small business manager and corporation manager 
samples were ¢ 

The intercorrelations were distinctly higher 
„ 32, signs disregarded) for the combined 
manager than for the business student 
sample (P = .17, signs disregarded). None of the 
categories nor total score correlated significantly 

with years of work experience or age. 

Reliabilities. The total score Kuder-Richardson 
20 reliability of .67 for the business student 
sample is considerably below the Kuder-Rich- 
ardson 20 reliabilities of .85 and 88 reported by 
Hermans, However, the Kuder-Richardson 20 
reliabilities of .79 and 82 for the small business 
managers and corporation managers, respectively, 
are comparable. 

The Kuder-Richardson 20 reliabilities of the 
category scores ranged from .01 to .69. In gen- 
eral, the reliabilities were highest for the corpo- 
ration manager sample. 

Comparison of means by sample. The business 
student sample had the lowest mean scores among 
te samples which suggest less persistence (stu- 
dent versus small business manager ¢ = 2.43. 

S os: student versus corporation manager 
£=3.03, p<.01; two-tailed tests); different 
_ partner-choice tendencies (student versus small 
business manager £= 1.91, p < .07; student ver- 
dus corporation manager f = 2.53, p < .05; two- 
tailed tests); lower-task tension or less tendency 
to resume a task when interrupted (student 
versus corporation manager t= 2.53, p .05; 
two-tailed test) and less seeking of recognition 
by performing well than do managers in corpo- 
rations (student versus corporation manager £ = 
2.21, Pos; two-tailed test). Similarly, the 
total score n Ach mean was lowest for the stu- 
dient sample (student versus corporation mana- 
ger t= 2.64, p < 01; two-tailed test). None of 
the sample means differed significantly between 
the small business managers and corporation 
managers, although the corporation managers had 
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Achievement. 

In terms of applicatio: 
category scores may incre- 
questionnaire. Such scores, 
should increase the under 


nas 
tify aspects of n Ach for w ch different 
of subjects may have similar scores and/or 
terns of scores. Use of the total score 
mask such similarities and differences. 
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about only one applicant—male or female. The 
procedure can best be conceptualized as an in- 
complete 2 N 2 N 4 repeated-measures design 
embedded in the personnel selection exercise. 
Each subject responded to each of the experi- 
mental conditions, one condition for each of the 
four jobs. 


Dependent Variables 


Subjects indicated acceptance or rejection of 
each applicant and evaluated the applicant on 
the following scales: (a) overall rating, (b) po- 
tential for technical aspects of the job, (c) po- 
tential for long service to the organization, and 
(d) potential for fitting in well in the operation. 
Evaluations were marked on 6-point scales rang- 
ing from (1) extremely unfavorable to (6) ex- 
tremely favorable. 


RESULTS AND DISCUSSION 


The major focus of the investigation was the 
influence of sex stereotypes and job demands on 
personnel decisions. As specific job requirements 
were irrelevant, we pooled the data from the four 
jobs. 

Females were selected for managerial positions 
significantly less often than males (59% accept- 
ance for females compared to 71% acceptance 
for males; x?= 14.51, p<.01). In addition, 
females were rated significantly lower than were 
males on all other dependent variables. Overall 
ratings for female applicants were lower than 
for males (F = 14.02, p<.01). Females were 
also rated lower than were males on “technical 
potential” (F=7.97, p<.01), “potential for 
long service to the organization” (F = 33.81, p < 
01), and “potential for fitting in well (F= 
17.26, p < .01). These findings provide strong 

support for our first hypothesis that female 
applicants for managerial positions are evaluated 
as less acceptable than are male applicants with 
identical qualifications. Furthermore, it appears 
that the general tendency to discriminate against 
female applicants for managerial positions is 
based on perceptions and expectations regarding 


a female employee’s in il competence 
and dependability as we! echnical A 

The second hypothesis d the differs: 
tial influence of various juirements as 
evaluations of male and applicants fer 
managerial positions. The nt statistic i 
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£ the dependent 
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Requirement interaction was icant for over. 
all ratings (F = 6.27, p < ings of poten 
tial for long service (F = p <.05), and 
ratings of potential for ñ a well in the 
organization (F= 5.76, $ The findings 
of this study support the se nypothesis that 
discrimination against won applicants for 
managerial positions is mot e when the j 
requirements are demandin challenging. 
These results provide fu evidence of 
subtle, but pervasive influen sex role stere 
types in situations where t! e of sex is 
made salient for the decisio ker. Addi 
research is required, however, tO establish tht 
beyond our 


generality of these findings 
of southern business students. 


REFERENCES 
Rosen, B., & Jerdee, T. H. The influence of sex-role 
stereotypes on the evaluation of male and 
supervisory behavior. Journal of Applied P. 


ogy, 1973, 57, 44-48. E- 

Rosen, B., & Jerdee, T. H. Infiuence of sex 1 i 
stereotypes on personnel decisions. Journal of AF 
plied Psychology, 1974, 59, 9-14. 


(Received June 14, 1973) 


ution has been given in recent 
ruction of performance rating 
ocedures developed by Smith 
) for preparing scales of 
tions. These procedures involve 
s. In the first step, behavioral 
ical incidents are generated 
which illustrate job perform- 
in the second, or retranslation, 
incidents are reassorted into 
ensions, and incidents are re- 
ey are reliably placed in the 
hich they were generated. The 
ach remaining incident accord- 
of performance it represents. 
le values equal to their mean 
sion representing the “good- 
vior. Items with large variances 


hess” of the | 
fe not retai: 
Most accou 
dealt wi: 
tales in an 


(Fogli, Hulin 


of the use of these procedures 
he development of performance 
reasing variety of work settings 
Blood, 1971; Harari & Zedeck, 
s: Mass 5; Zedeck & Baker, 1972). 
Campbell, Dunnette, Arvey, and Hellervik 
(1973) have compared the behaviorally anchored 
ales with another type of rating, and Borman 
972) has explored the generalizability of such 
es to raters other than those in the organiza- 
Monal position of persons assisting in scale con- 
Auction. Behavioral expectation scaling tech- 
ques can be used to accomplish other organi- 
“ational objectives in addition to the construc- 
of performance rating instruments. For ex- 
ample, Landy and Guion (1970) have used the 
Procedure to construct motivation scales. 
To cite another example, the first step of the 
E edure has been suggested as an information 
athering method to be used in the development 
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í FROM BEHAVIORAL EXPECTATION SCALE PROCEDURES 
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ioral expectation scale procedures have been used to construct perform- 
valuations in a variety of work settings. With slight modifications the 
ures can also be used to provide information to organizations for 
ing the domain of evaluated performance, developing training programs, 
sessing agreement on and communication of organizational policy. 


of a job analysis (Blum & Naylor, 1968; 
& McCormick, 1965). The critical incident t 


tion scales. ; 
Extending the Domain of Evaluated Performance 


Behavioral expectation procedures can be used 
to broaden the domain of performance evalua- 
tion, Borman (1972) has shown that when per- 
formance evaluation scales are developed from 
more than one perspective such as by organiza- 
tional superiors and by peers, it is possible to 
get different behavioral dimensions from each 
perspective. His evidence indicates that not only 
do the separate perspectives contribute different 
performance dimensions to the evaluation 
schema, but also persons from each perspective § 
are best able to rate performance on the scales 
developed from their perspectives. For example, 
peers may be better than superiors at developing 
and using a scale which measures cooperation, 
while superiors might be better than peers at 
developing and using a scale which measures 
quality of work. By using the scaling procedures 
from as many organizational perspectives as 
possible (e.g., superiors, peers, subordinates, cli- 
ents) we may considerably broaden the domain 
of behavior which we can assess accurately and 
on which we can provide feedback to organiza- 
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ion members. A retail clerk might conceivably 
be rated on different aspects of his performance 
by his department manager, by other clerks who 
work with him, and by customers he has served. 

Whether different organizational perspectives 
provide different scales may depend on the nature 
of the job and the visibility of the job holder to 
persons in the various organizational perspec- 
tives. Zedeck (1973) obtained performance di- 
mensions that were highly similar even though 
they were developed at two separate levels of an 
organization. 


Developing Training Programs 


The behavioral items which are collected in a 
behavioral expectations scaling procedure could 
provide the basis for a training program. This 
suggestion is also made by Campbell et al. 
(1973). The skills to be trained are specified in 
terms of the actual job behaviors rather than 
simply the name of a skill domain. As for a job 
description, a training program can be developed 
from the entire set of behaviors collected and 
agreed upon, not just the behaviors listed in the 
final performance evaluation scales, that is, items 
eliminated by the second step can be used in the 
training development. The behavioral expectation 
jtems also allow trainers to specify unwanted 
behaviors that trainees should learn to avoid. A 
great deal of actual job performance description 
is contained in the behavioral examples which 
the procedures provide. This can be used to make 
training directly relevant to the job performance 
expected of the trainee. Trainees in a program 
based on these materials would learn expected 
behaviors, and they would learn how their per- 
formance is to be evaluated. 

For example, Fogli et al. (1971) found an item 
for grocery checker performance which read 
“You can expect this checker to know the vari- 
ous sizes of cans—No, 303, No. 2, No. 24.” If 


this knowledge is expected of checkers, they 


should be trained to make the necessary dis- 


criminations. Also, the evaluation for this part 
of a grocery checker’s training program is obvi- 
ous from the behavioral item. 

Assessing Agreement on Organizational Policy 


The behavioral expectation scaling procedures 


provide an analysis of the clarity of organiza- 


tional policy within the level on which the scales 
are developed. If materials for the scales come 
from managers, the item variances used in step 
three of scale construction indicate the agree- 
ment among managers on the appropriateness of 
certain behaviors. For scale development, items 
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with large variances are d 
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development. Items with! 
areas where organizational! 
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behavior to be good peri 
consider it to be poor per 
important behavior, some o 
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should be made to detern her it should 
be encouraged or discourag 1, the organ 
zation’s policy regarding t! vior can be 
transmitted to the person! perform of 
evaluate the behavior. Large ices in Step d 
cue the organization to the tial problems 
of disagreement. 
Assessing the Accuracy of C cation of 
Organizational Policy 

In the behavioral expec scaling pro 
cedures members of the ot tion rate the 


nether the be 
ieffective job 


job behavior items accordin; 
havior represents effective 


performance. The scale valu n item is then 
established as the mean rati em means can 
be collected for the same from members 
at two different organizati ls to indicate 
how well policy has been co icated betwee 
levels. For instance, if sca‘: developed by 
managers for use on produ Workers, 

the behavioral incidents be rated for 
effectiveness (as in Step 3 both managers 
and production workers. II zers agree that 
a particular behavior is hig desirable and 


production workers either dont agree or they 
agree that it is undesirable, ihnen there is an 
obvious need to communicate the production 
workers how this behavior (on which they m 

be evaluated) is viewed by managers; or, 

is a need to enlighten the managers as to. 
this behavior should not be considered positively: 
This type of difference has been shown to ext 

in organizations by Zedeck (1973)- 


CONCLUSION 


as a perform: 


Perhaps because of its promise 
1 tentials of the 


ance scaling technique, several po 


behavioral expectations procedures 11 1 
overlooked. They can provide us with mal 0 
information in other areas of organizatio; 


out 
eration. Applied research should be carried 


to test these suggested additional uses. 
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During the past several decades, a considerable 
of literature has reported on weighted 
studies (Dornon, 1970; Dun- 
nette, Kirchner, Erickson, & Banas, 1960; Eng- 
land, 1971; Kavanagh & York, 1972). This arti- 
cle reports the results of a replication study of 
such an application blank designed to reduce the 

rate of clerical employees. Addition- 
_ ally, the study demonstrates how utility analysis 
can be used to provide indices of cost savings 
from introduction of the application blank 


Cxoss Val mrry Stupy 


| Two samples of intermediate level clerical 
employees who were hired on a permanent full- 
time basis were obtained. The sample on which 
‘item weights were developed consisted of 75 
long-term (16 months and over, mean = 21.22) 
and 75 short-term (terminated within 6 months, 
mean = 3.64) employees hired during August 
1969, to February 1972. The cross-validation 
sample consisted of 40 long-term (16 months 
and over, mean = 20.73), 20 intermediate-term 
{terminated within 7 to 15 months, mean = 
10.05), and 34 short-term (terminated within 6 
months, mean = 2.68) employees hired during 
March 1971, to March 1973. 
This study was conducted following the pro- 
cedure described by England (1971). Eleven 
_ scorable items were found to significantly differ- 
entiate between long- and short-term employees. 
After consulting with our Affirmative Action 
Department, two items (age and height) were 
dropped from the scoring system in order to 
reduce the possibility of adverse impact on 
minorities or protected groups. Using the remain- 
ing nine derived weights, all subjects in the two 
samples were scored, and a point-biserial corre- 


d Requests for reprints should be sent to Raymond 
Lee, Psychologist, Personnel, Hennepin County, 608 
4 Fourth Avenue South, Minneapolis, Minnesota 55415. 


A UTILITY ANALYSIS OF A WEIGHTED APPLICATI 
DESIGNED TO PREDICT TURNOVER FOR CLERICAL 


RAYMOND LEE* axo JEROME M. BOOTH 
Hennepin County Personnel, Minneapolis, Minnesota 


The validity of a weighted application blank was demonstrated in | 
turnover for clerical employees. Cross-validated results showed relati 
shrinkage of the validity coefficient. The magnitude of correlation 
O = 49%) indicated that weighted application blank scores could be 
a basis of selection. A utility analysis indicated that the maximum | 
savings could amount to about $250,000 during a 25-month period 


516 


ANK 


lation coefficient? (r,,) be 
application blank scores and t 
and Tilton’s (1937) overla 
were calculated for each sa 
results were 7 = .59 (n= ! 
an O, = 46% for the weight 
=.56 (n= 74, p < .001) with 
the cross-validation sample. 
Obviously, the magnitude 
coefficients held up very well 
system developed for the weis 
applied to the cross-validation 
intermediate-term group was 
cross-validation sample, a2 
product-moment correlation c 
tained, r = .56 (n= 94, p < 0 
was evident that the weighted 
scores predicted the criterion of 
be used in making selection dec 
the tenure criterion is specifical! 
the Equal Employment Opportunity C 
Guidelines (1970) as an acceptable ure 
job performance, the utility in dollars or bu 
necessity of the weighted application bk 
to be estimated before the blank’s 
justified. 


2Some researchers have criticized a high 
biserial correlation coefficient in weighted 
tion blank studies as being an overestimate | 
relationship because of the extreme diche 
the tenure criterion. The inclusion of an int 
term group in the cross-validation sample 
a highly significant product-moment corre! 
efficient between the tenure criterion 
weighted application blank scores (7 = -56; 
$< .001). 

The overlap coefficients demonstrate 800 
dictive efficiency of the weighted application 
Dunnette (1966) states, ‘Values between 
75% denote fairly good separation between © 
groups and usually denote a practically usel 
tionship,’ 


Suoer Norm 
curry ANALYSES thre and other obtaised valsts, the 
defined “utility” as the cots amoxiated with tersever were derived 
partion oi ccessful applicants — sost of 1 4 
aad “cost proportion of successful appli- 22 4— 
— reje hile some researchers, notably BEF person; (6) cout of 
Crabach eser (1965), state that the $593.94 (wages based on average 
— cats of good quality does not monthly salary rate of $781.50 for 1972) 
decrease | it of the institution, Dunnetie €t ol e 4 ye “ems — 
(1966) ary | there are many instances in tining patie ye f 
Buy v c rejection of successful execu- 38° monthly — Ar lr 
tives will costly. For a relatively low- * 1 8 
— job, most psychologists agree thal. 20% 10% of wage for fret to áith 
it is not vaful to consider the cost of pan er y): (9) y a 
rejecting rm clerical employees. There- — l ‘otal 
fore, the | of this section was to evaluate In — a 3 
the utility ng a weighted application blank a Lips number of permanent 
to the ex sclection procedure considering, TOL for the Intermediate clerical 
the cost inating unsuccessful applicants = — 400. The selection 
without c ng the cost of rejecting a po- found ta ba 18% (abeah 1,200 applicants 
tential lons employee, fi ot openings during month 
The co erminating is considered to be 9 IE — of 
the total c hiring, training, and orienting a explain come Jongg. eam, 20% int 
new cleric ployee to the job. A question. and 30% short term. 
naire distr to clerical supervisors provided re With 400 positions to in and a 
informatio: ssary to this analysis of termi- 50%, we could assume at least 200 
ation cos: response to a question relating emplopees were needed to replace the 


to the aver ogth of time required for a new mediate- and short-term employees te ing 
employee come fully functioning at an period, Based aasumption 
an lev roficieney or to à 3 — thie 3 ach 
norma srvision was $ moni re- number 
Sponse to ior question relating to average . group, the total 
percentage me spent in supervision of a tired 
Rew cleric. aployee was as ee first ie i 
month, 4% -cond month, 16%; month, Lore 7 
9%; fourth = th, 6%; fifth month, 5%. Using costs, and the average cost per long ag 


TABLE 1 


NUMBER OF APPLICANTS SCREENED AND Hiren AND RELATED Cost Estimates POR 
DIFFERENT CUTTING SCORES ON THE WEIGHTED APPLICATION BLANK 
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= — 
| Cumulative %s A 
| scoring higher 7 = — 
| ratio 8 
| Long Inter- Short 
| term | mediate | term 
1 24 | $85,376 
(Existing | 
Procedures) 
| 5 98 | 95 74 200 2 pines 
1 95 0 | .50 a 3 
72 30 24 5 
u 2 200 26 386,757 
n ae ‘07 [387,836 
10 W 200 105 355.687 
I 


Swort 


n blank. 


| The data in Table 1 were computed by a 
vised method described by Dunnette (1966) 
and England (1971). The computations shown 
below, for a score of 12, illustrate the method 
~ used: 
It can be seen from Table 1 that 42% 
long-term, 15% intermediate-term, and 
6% short-term applicants would be se- 
lected at a score of 12 and above. 
. Using a cutting score of 12, the total 
number of applicants which would have 
to be screened = (Desired long-term em- 
ployees)/(Base rate for long-term em- 
pl-yees) X (% long-term employees 
hired) = 200/(.50) (.42) = 952. 
. The number of potential intermediate- 
term employees = (Number screened) X 
(Base rate for intermediate employees) 
= (952) (.20) = 190, of whom (190) X 
(% intermediate-term employees hired) 
= (190) (.15) = 28 would be hired. 
4. The number of potential short-term em- 
ployees = (952) (Base rate for short- 
term employees) = (952) (.30) = 286, 
of whom (286) X (% short-term em- 
ployees hired) = (286) (.06) = 17 would 
be hired. 
. Hence, the total hired = 200 + 28 + 17 
= 245. 
The selection ratio= 245/952 = .26. 
. The estimated total cost = (Number 
hired) X (Training cost) + (Number 
screened) X (Recruiting cost) = (245) 
($1,378.44) + (952) ($20) = $356,758. 
„The cost per long-term employee = 
$356,758/200 = $1,783. 


As is shown in Table 1, the maximum utility 
of the weighted application blank is at a cutting 
score of 14. Using a cutting score of 14, the total 


Notes 

existing selection procedur <pproximately 

$585,376 (as shown in the of Table 1). 

Thus, the estimated maxi ilar savings 

expected from the use of t hted. applica: 

tion blank is $249,946 for onth period 

Of course, variations in su; i demand ia 

the labor market may requi rd or dows- 

ward adjustment of the cutt e which will 
decrease the amount of dolle: gs. However, 
it should be noted that the estimates of 
the weighted application blan: large enough 
to be quite significant from cal point of 
view. These figures show tha in the case 
of a relatively low-level job odest hiring 
and training costs, the dollar s that can be 
expected from the introduc of a valid 
weighted application blank are large. 
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that experi: 
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especially tr 
relating em 
Biersner & 
found that 
team trainin 
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duties of military divers in- 

underwater structures (hulls, 

id so forth), ship salvage, and 

Previous research has shown 

divers have several distinctive 

demographic characteristics, 
with authority and difficulty in 

liy to others (Biersner, 1973; 
ron, 1970). Other research has 

essful underwater demolition 
cess is predictable from physi- 
rceived health measures (Biers- 
Ryman, & Rahe, 1972), as well 
pay grade (Gunderson, Rahe, & 
‘owever, little research has been 

to pred: successful diving performance 
among those undergoing scuba training. It is not 
wn, for ex:mple, whether measures of social 
and emotional adjustment can be used to predict 
Such performance. 

The present research is an attempt to predict 
Performance in a hazardous military scuba train- 
Mg course from attitude measures, perceptions 
of physical and emotional health, and demo- 
graphic information. 

— 

This study, Report 73-54, was supported by the 
Nan au of Med’cine and Surgery, Department of the 
580 under Research Work Unit MFS1. 524. 002-5011 
ee The opinions expressed are those of the 
ah and are not to be construed as necessarily 
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‘Partment of the Navy. 

9 5 at the Naval Education and Training Sup- 
Ommand, Pensacola, Florida. 
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EDICTION OF SCUBA TRAINING PERFORMANCE ' 


ROBERT J, BIERSNER * axo DAVID H. RYMAN” 


United States Navy Medical Neuropsychiatric Research Unit, 
San Diego, California 


attempt to predict successful diving performance among thoi under- 
scuba training, 296 trainees of the Navy School for Divers Second Chast 
bed to complete a demographic 
(titude survey. Multiple correlations of 4S (p< .001) in a validation 
of 148 subjects and % (p<.001) in a cross-validation sample of 

| size were obtained between these variables and a pas-fail performance 
ion. Scales dealing with mental health and training apprebemion were 
nost significant predictors. Successful performance was abo associated 
bigher education, having fewer older brothers, and amount of parental 


„ a health inventory, and 


Merinop 

Subjects s 

Subjects were 296 trainees attending the Navy 
School for Divers Second Class at San Diego, 
California, The trainees were divided into valida- 
tion and cross-validation samples of 148 subjects 
erch. Subjects were from 13 sequential diving 
classes which lasted five weeks each. The trainees: 
came from all branches of the military service, 
with the greatest proportion from the Navy — 
(63%). The trainees had a modal pay grade 
(rank) of E-3 (seaman, Army private) and mean 
age of 22.3 years. The average education was 12,2 — 
years. | 
Training Program 

Scuba training consisted of two weeks of in- 
struction in diving medicine and physics and 
three weeks of practical experience involving the 
use and maintenance of diving equipment. The i 
final week consisted of ocean dives to a depth of 5 
60 feet. 4 
Criterion Measure f 

The criterion was success or failure in — 
(pass-fail), as obtained from the official records 
after each class finished the course. f 


Predictor Measures 

A demographic questionnaire, a health inven- 
tory, and an attitude survey were administered 
at the start of training. The demographic ques- 
tionnaire asked for the following: marital status, 
pay grade, branch of service, education, size 0. 
hometown, number of older brothers and sisters, 
number of arrests and truancies before the age 


of 18, and frequency of parental criticism and 
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TABLE 1 


VALIDITIES AND BETA WEIGHTS OF PREDICTOR ITEMS 
AND SCALES FOR Two SAMPLES OF 
SCUBA TRAINEES 


] 
2 | r Sample 1 | r Sample 2 Beta 
Item or scale | (a = 148) | (n= 148) | weight» 
— — 
Education 42228 500 
No. Gider Brothers 47 —.15 —.25 
Criticism and Maternal 
employment .16* 19 
Visual AS —.43 
General Mental | —.20* —.49 
Concern | —.26** —.15 
Leadership | | * 13 
| 
For Sample 1, R = .45*** for Sample 2, R = .30***, 


Standardized regression coefficients of the multiple 
regression equation derived from the validation sample, 
*p < .05. 
* <.01. 
s+ p < 001. 


praise. In addition, the following information was 
obtained about the parents of each subject: mari- 
tal status, education, occupation, and number of 
hours employed. The health inventory, a con- 
densed version of the Cornell Medical Index, 
contained 100 items about physical and emo- 
tional health. Subjects were given four choices 
in answering this questionnaire: (a) never, (b) 
rarely, (c) occasionally, and (d) often (scored 
1 to 4, respectively). The attitude survey con- 
sisted of 25 items about training motivation, 
leadership, and training apprehension, Subjects 
were given six choices, ranging from disagree 
strongly to agree strongly. 


Analysis 


A multiple dicriminant analysis was initially 
performed on the validation sample but failed 
to distinguish among the three failure groups 
(voluntary, medical, and miscellaneous drops). 
This indicated that common factors were associ- 
ated with failure in these groups. Regression 
coefficients were therefore computed (comparing 
the combined failure groups with passes) for the 
validation sample using demographic items, and 
health and attitude scales that significantly (p < 
.05) and independently correlated with the pass- 
fail criterion. Regression weights computed on 
the validation sample were then applied to pre- 
diction of the pass-fail criterion in the second, 
cross-validation sample. 


RESULTS 
Criterion Measure 


Oi the diving students 69% passed, while 12% 
failed because of minor physical injuries and 
illnesses (medical drops), and 12% voluntarily 
withdrew (voluntary drops). The remaining 
failed because of major health problems, disci- 
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plinary trouble, poor swin 
demic difficulties (miscellar 


ibility or aca- 
drops). 


Predictor Measures 


Four factors were subse: 
the health inventory. These 
item reliabilities of each we: 
(72), General Somatic ( 
(.91), and Anger (.75), 


identified for 
and the inter- 
ollows: Visual 
neral Mental 
factors were 


found among the 25 items or ttitude survey. 
These factors and the inte reliabilities of 
each * were as follows: Lead 82), Concern 

A multiple correlation oi hree predictor 
measures showed three demo: c items (edu- 
cation, number of older bro and a combi- 
nation item consisting of 1 employment 


titude scales 
health scales 


and parental criticism), 
(Leadership and Concern), an 


(Visual and General Mental) > significantly 
and independently associate: the pass-fail 
criterion in the first sample rbove the .05 
level. It was found that the it of parental 
criticism had a low but sig it correlation 
with pass-fail, and that th sociation was 
significantly moderated by mai i employment. 

Table 1 lists the correlatio ficients, beta 
weights and significance levels nese predictor 
measures. On the validation a multiple 


obtained with 
easures. Table 
measures were 
pass-fail cri- 
ie, resulting in 
< 901). Using 
the predic- 
the validation 
in the cross 


regression of 45 (p < .001) 
the pass-fail criterion using ths 

1 also shows that some of the 
significantly associated with 
terion in the cross-validation sa 
a multiple regression of .39 
these regression equations, 70 
tions of pass—fail were correct 1 
sample, while 69% were correc! 
validation sample, 


DISCUSSION 


The results showed that scuba training Per 
formance can be successfully predicted using 
multiple regression analysis based on deme 
graphic items and health and attitude scales. 5 

Education and birth order have been ee 
ated with performance under hazardous 929 
conditions (Radloff & Helmreich, 1968). E 
results of the present study confirmed the “on 
ciation of education with performance. tie 
higher the education level of the trp 
more likely that he would be successful. ccm 
Radloff and Helmreich (1968) found 1 | 
older-born did not make as many actual d. 


4These were inter-item reliabilities whic 
computed with a Spearman—Brown correction. 


(76), and Confidence (81). 
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er-born, the results for these 
dicated that the older-born 
ntly more often than did the 
se results indicated that fac- 
i success in training may not 
ted to success under actual div- 


nd that the training perform- 
subjects who were criticized 
rs was more likely to be suc- 
ther was employed. This sug- 
from the mother or the ways 
jated criticism were important 
ızardous situations many years 
ently not the number of docu- 
such as arrests and truancies 
c between success and failure, 
ch these problems were medi- 
especially the mother. 


the attitude scales indicated that 


jects who thought they were 
o passed the course more often. 
attitude results substantiated 
Stouffer et al. (1949) which 


diers who are not apprehensive 
wed had the highest ratings of 
eness. Scuba trainees who were 
vucern factor, which consisted 
about possible physical injury 
serformed better during training. 


This is further supported by the results of the 
health inventory which found that those scuba 
trainees who reported fewer visual problems and 
emotional symptoms at the start of training 
were more likely to succeed. Gunderson, Rahe, 
and Arthur (1972) also found that health mea- 
sures could be used to predict pass—fail during 
hazardous underwater demolition team training. 
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PREDICTING ALUMNI FINANCIAL DONATIO 


WARREN S. BLUMENFELD! anb PATRICIA L. SARTA/ 


Georgia State University 


The purpose of the study was to predict alumni financial donation - 
donation from demographic information available at graduation, Re: f 
59 donors and 59 nondonors were item analyzed. Weights were assi: 1 
cross-validated using the records of 50 held-out donors and 50 held-( 

donors. A biserial correlation of .37 (p< .01) was obtained. 


University alumni are probably the most 
common target of professional fund-raising ac- 
tivity. However, relatively little research has 
been accomplished predicting financial donation 
(e.g., Morris, 1970). Such research would have 
practical application for fund-raising organiza- 
tions. 

There is an extensive body of literature in 
applied psychology regarding the successful use 
of weighted personal data, such as that on appli- 
cation blanks, in a large variety of occupational 
groups (for a recent review, see England, 1971). 
Most of these studies attempted to predict cri- 
teria such as turnover, accidents, supervisory 
ratings, and/or some index of production. The 
use of weighted personal data has also been 
applied successfully toward forecasting finan- 
cial credit risk (e.g., Buel, 1968). As yet, how- 
ever, no attempt seems to have been made to 
apply weighted biographical data to the problem 
of predicting financial donation. 

The purpose of this study was to develop and 
cross-validate a psychometric scoring procedure 
(based on demographic information available at 
graduation) to predict alumni financial donation 
or nondonation. 


METHOD 
Subjects 


Ihe subjects were 218 students who either 
graduated from or last attended Georgia State 
University in 1963, 1964, or 1965, Of these 
alumni, 109 were donors to the 1968-1969 
Alumni Annual Fund. A control group of 109 
nondonors was developed by selecting the name 
following the alumni donor in the questionnaire 
file (year of graduation or last attendance held 
constant). The overall sample was composed of 
sixty-eight 1963 alumni (31%), fifty 1964 alumni 
(23%), and one hundred 1965 alumni (46%). 

1 Requests for reprints should be sent to Warren 
S. Blumenfeld, Department of Management, School 
of Business Administration, Georgia State University, 
Atlanta, Georgia 30303. 


Independent Variables 


iilable at time 
a variety of 
ristics investi- 
he spouse at- 
luation or at 
tended within 
oniversity, type 
associate de- 
degree, and/or 
financial as- 
n in athletics, 
Social, hon- 
from another 
ister’s degree, 
undergraduate 
de point aver- 
other institu- 


Demographic characteris 
of graduation were collecte. 
existing sources. The 22 ch 
gated were sex, whether or 
tended the university, age 
date of last attendance, scù 
the university, degree from 
of degree obtained (certifica 
gree, bachelor’s degree, mast 
doctoral degree), whether o: 
sistance was received, partici 
organization membership (rei: 
orary, and/or professional), de 
institution (bachelor’s degree 
and/or doctoral degree), maj‘ 
grade point average, graduate 
age, and hours transferred fro 
tion to the university. 


Data Analysis 


An item analysis was conducted with the 
demographic information of nor records 
and 59 nondonor records. As weights a 
described by England (1971) were developed 
The weights were then cross-validated with the 
records of hold-out groups of 50 donors and 80 
nondonors. 


RESULTS 


Item analysis indicated that 7 of the 22 chat 
acteristics appeared to differentiate between te 
donors and nondonors. Donating alumni Wet 
most likely to be (a) male, (b) business schon 
students, (c) Georgia State University gradu 
ates, (d) holders of a master’s degree 1 
Georgia State University, (e) economics 1 
(f) with high or low undergraduate grade we 
average, and (g) with moderate to high gradua 
grade point average, Item weights were a 
and a biserial correlation of 37 (P< 01) ¥ 
obtained in the cross-validation. 
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ISCUSSION contribute financially is a feasible and institu- 
tionally meaningful objective for alumni rela- 
tions management (and other fund-raising-ori- 
ented agencies) to pursue. 


The data ii d that it is possible to fore- 
cast, at a be n chance level, the financial 


donation beh f one set of alumni based on 
available den ic data. A behavioral pro- 
file can be d d (and cross-validated) that Se ee 
can pre-identi se individuals most favorably RERERENCES 
predisposed À contribution The agency Buel, W. D. Evaluating mortgage loan risk. Journal 
attempting t it funds might segment the of Applied Psychology, 1968, 52, 399-402. 
market in te (a) types of appeals to be England, G. W. Development and use of weighted 
made to dif ily disposed individuals and application blanks. (Bulletin 55) Minneapolis: 
(b) amount ention given during subse- University of Minnesota, Industrial Relations Cen- 
quent fund-r activities. Other applications ter, 1971. 
of the abilit forecast donation propensity Morris, D. Anatomy of the big donor. Alma Mater, 
may also exis 1970, 37(5), 23-27. 

In conclusi ve data reported suggest that 
the predictio who will and who will not (Received June 6, 1973) 
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EFFECTS OF SPECTRAL DIFFERENCES 
ILLUMINATION ON FATIGUE 


JAMES B. MAAS, JILL K. JAYSON,? ann DOUGLAS A. SER 


Center for Improvement of Undergraduate Education, Cornell ity 


An experiment was conducted to determine the effects of differe 
environmental illumination on fatigue after a period of studying. 
differences in the subjects’ self-reported states were identified. Ho. 
tive measures revealed less perceptual fatigue and better visual 
lighting which closely approximated the spectral quality of natur 


than under traditional cool-white lighting. 


The present investigation was an initial at- 
tempt to explore the relationship between spec- 
tral differences in environmental illumination 
and general and perceptual fatigue in humans 
performing mental activity. In this case, subjects 
studied in a room under lighting conditions which 
were varied over a period of evenings. Because 
of the dearth of prior research on which to 
base assumptions, and as a conscious step to 
reduce experimenter expectancy effects, no spe- 
cific hypotheses were advanced as to the relative 
advantages of the different light sources tested. 


METHOD 
Subjects 


Twelve male and 29 female undergraduate 
students took part in the experiment. Subjects 
were chosen randomly from a larger pool of 
introductory psychology students who had volun- 
teered to participate in research. 


Procedure 


A university classroom was converted to a 
study room and equipped with two independent 
fluorescent light sources installed in the same 
fixtures. The first, Vita-Lite“ (V),® was a full- 
spectrum light closely approximating the spectral 
quality of natural sunlight. The second was a 
standard cool-white fluorescent light (CW), com- 
monly used in institutional settings. The physical 
structures of the lamps are identical; only upon 
direct comparison are they seen as differing in 
the color or the light they emit. An analysis of 
energy emitted by the two lights at each fre- 
quency is presented in Figure 1. 


Requests for reprints should be sent to James B. 
Maas, Center for Improvement of Undergraduate 
Education, Cornell University, Rand Hall, Ithaca, 
New York 14850. 

2 Now at Georgetown Medical School. 


Developed by the Duro-Test Corporation, North 
Bergen. New Jersey. 


The lamps were ins 
translucent covers and < 
located directly behind ti 
mination of the lights wa 
at 82 footcandles. 

The experiment made u 
on each subject thereby c 
ity due to individual diff: 
studied in the room four 
consecutive days, Subjects 
each of the experimental 
for ordering and position 
anced design was used that 


behind standard 
led from a room 
ssroom. The illu- 
ized and equated 


repeated measures 
lling for variabil- 
es. Each subject 
s a day, on fout 
t two days under 
tions. To control 
ts, a counterbal- 
rnated the light 


ing conditions from CW. CW to vong 
CW-YV across subjects. i 1 
All subjects, and the people involved in acti 


e to the experi- 


data collection, were kept h 
t nature of tht 


mental conditions and the a 
dependent variables being ix gated, Subjec® 
were instructed to bring to session enous 
course homework to keep them occupied for 7 
hours, and they were asked not to leave 
seminar room. d 
The tests for fatigue were administered to = 
subject during the first and last 15 minutes 
each four-hour session. e a 
In attempting to measure fatigue, this Gee i 
ment considered two separate phenomena: 11 
jective fatigue (feelings of weariness, ennu, 15 
tiredness, obtainable only through 5 1 
and objective fatigue (fatigue that can be 0 i 
tained by objective measures of decremen ost- 
performance on various tasks). Pre- and 59 
treatment measures included a subjective 4 | 
tionnaire and three objective tests. edo 

The subjective questionnaire was compos wis 
three sections. The first section of dae eg id 
designed to ascertain the subjects’ overal 17 7 
ness and fitness to study; this section ques 
somewhat in the prestudy and poststudy ty 
tionnaires to avoid a response set to the P stion⸗ 
lar items. The second section of the que 
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RELATIVE IRRADIANCE 


TT 


— — 
12 EF ee i 


Wavelength in nanometers. 


‘ve spectral irradiance distributions of the fluorescent test lamps compared with natural 


at S500 K (as specified by the International Commission of Illumination). 


FIGURE 1. Re 
pane was a checklist for measuring subjective 
atigue, developed by the United States Air 


Force (Pearso 


: & Byers, 1956). The checklist is 
composed of 


statements concerning feelings 


of fatigue. s were asked to check whether 
they felt “better than,” “the same as,” or “worse 
than” the de d feeling. The final portion of 


e contained 21 bipolar items 
5 ags of fatigue and general reac- 
tions to the atmosphere of the room, such as 
Warm-cold, tense-relaxed, dark-light, and pleas- 
ant-unpleasant. A format similar to the semantic 

ferentia] was used—a five-point scale with the 
endpoints defined by the pair of words making 
up each item, 

The objective measures used were based on 
ma tasks, since vision is the sense most 
as by study and environmental illumina- 
5 8 he Critical Flicker Fusion test developed 

imonson (Simonson & Enzer, 1941) was 


Used to assess both visual and central nervous 
System fatigue.“ 


== 


iR 5 19 5 j : 
or a general review of critical flicker fusion 


Se Simons 2 F 
Saito 6487755 and Brozek (1952), and Kogi and 


A Lafayette Model 12025 Critical Flicker 
Fusion Instrument with viewing chamber was set 
at 90% illumination/cycle and Jamp intensity on 
bright. The critical flicker fusion for each sub- 
ject was determined by the staircase method of 
limits, with the same starting point and darkened 
surround in the viewing chamber across both 
conditions. 

Visual acuity was measured with a standard 
Snellan Eye Chart. Subjects were seated 15 feet 
from the chart and instructed to read as many 
lines as they could with one eye, then the other. 
The number of the line on which they made two 
or more errors was recorded. No feedback was 
given, to control for practice effects. 

A pursuit rotor test was used to measure 
hand-eye coordination. The apparatus consisted 
of a Lafayette Model 30013 Photoelectric Ro- 
tary Pursuit Instrument connected to a repeat- 
cycle timer and a clock/counter. Subjects had te 
track, with a stylus, a light traveling counter- 
clockwise around a triangular path at 6⁰ 
revolutions per minute. The test required five 
trials of 60 seconds each, with a 10-second res 


between trials. 
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RESULTS 


The first step in analysis involved examining 
each individual’s change in performance over 
each four-hour study session. This was done by 
computing difference scores (from pretest to 
posttest) for all the variables of the fatigue tests. 
To preserve the control for order and position 
effects, the scores from the two sessions under 
the same lighting conditions were averaged to- 
gether. This yielded two difference scores (CW 
and V) per variable for every subject. Two 
mean scores for each variable across all subjects 
were then computed to see if there were signifi- 
cant differences in performance measured under 
each of the lighting conditions. 

Only one of the subjective variables (lively— 
lethargic) showed a statistically significant dif- 
ference between lighting conditions. This differ- 
ence demonstrated a tendency for subjects to 
become less lively or more lethargic after four 
hours under the cool-white light, while there ap- 
peared to be no change under the Vita-Lite con- 
dition. But the fact that it is the only difference, 
and is not qualitatively very different from some 
of the other subjective variables, suggests that 
this result was quite likely due to chance, 

On the Photoelectric Rotary Pursuit test there 
Were no significant differences between the two 
Conditions. However, the other two objective 
measures, flicker fusion and visual acuity, showed 
significant differences. The presumption behind 
critical flicker fusion is that a neural system that 
can respond to high rates of intermittency with 
an intermittency of its own is functionally better 
than one that responds by continuity, Therefore, 
the higher the critical flicker fusion, or in the 
case of this experiment, the smaller the decrease 
over time, the better. The decrease in the criti- 
cal flicker fusion was significantly less under 
the Vita-Lite condition than under cool-white 
(b < .05). 

In measuring visual acuity, a comparison is 
made between what a subject can see at 20 feet 
and what a “normal person” can see at other dis- 
tances. A high test score, for example, 100, indi- 
cates poor visual acuity, since the subject sees 
clearly from only 20 feet what others see from 
100 feet, Difference scores for this test were 
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computed by subtracti etest scores from 
posttest scores, and the ive mean (680 
obtained for the Vita-L ition suggests that 
subjects could see more after four hour 
of study under the Vita. hile their perform. 
ance under cool-white ied the same. In 
other words, the effect Vita-Lite condi. 
tion was actually to enh isual acuity over 
time rather than merel ninimize fatigue 
This may have been du ly to adaptation, 
with the visual process ig to the condi- 
tions after a few hours ly. If so, the re- 
sults suggest that it was to adapt to the 
Vita-Lite or, alternative! the adaptation 
to the cool-white light w ated by fatigue. 
There were no initial diffe in visual acuity 
between the two conditi n the pretests, 
thus discounting the possi! that the adapta- 
tion to the Vita-Lite was r to poor pretest 
performance in the unusual ng environment. 


Discuss: 
Tt is clear that this lin te 
should be expanded; other spectra s 
5 tasks should be 


be experimentally manipula 
changed, and individual d:i 
considered. The results of 
the promise of further rese 
environment and human beh 


nces should be 
study point to 
on the visual 


OF. 
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ECT OF AMBIENT ILLUMINATION ON NOISE LEVEL 
OF GROUPS 


MARK SANDERS,! JERRY GUSTANSKI, axo MIKE LAWTON 
California State University, Northridge 


ımbient illumination level was varied in a natural setting. Unobtrusive 
ires of the noise produced by groups of people gathering in the area 
led a significant (p < .001) reduction in noise under the low-illumination 
ition, Suggestions for future research are presented. 


Isaac, in ries of studies (Alexander & 
Isaac, 1965 & DeVito, 1958; Isaac & 
Reed, 1961 cid that ambient illumination 
was an imp determinant of motor activity 
in animals. cific effect of increased illumi- 
nation appe o be dependent on whether the 
animal is d or nocturnal. Illumination in- 
creased the ty of the diurnal rhesus monkey 
(Alexander € ic, 1965; Isaac & DeVito, 1958) 
and decrea e activity of the nocturnal cat 
(Isaac & R: 1961). 

In the pr study, it was proposed that the 
noise level)? ed by people (a possible mea- 
sure of aroi could be varied by altering the 
level of amis illumination. 

METHOD 

The experi was carried out unobtrusively 
in the halls » classroom building at California 
State Unive: , Northridge. Before the dis- 
missal of classes, students, waiting for the next 


Class, regular gather in the halls. They typically 
Sit against the walls in pairs or small groups and 
talk, One exparimenter sat along the way ap- 
proximately half way down the 10 & 135 foot 
hallway with a General Radio 1551-C sound pres- 
Sure meter concealed in a briefcase such that 
only he could see the instrument. The meter was 
set on the “C” weighting with the damper cir- 
cuit engaged to eliminate the effects of peak 
Sounds. A second experimenter unobtrusively 
counted the number of people in the hallway. 
„Prior to creation of the illumination condi- 
tions, and before students began to gather, back- 
Stound noise levels were obtained. Only two 
illumination conditions could be created due to 
electrical wiring constraints. The high-illumina- 
tion condition (the natural environmental condi- 
tion) consisted of having all 20 large overhead 
uorescent light panels (48 inches X 24 inches) 
— 


8 * Requests for reprints should be sent to Mark 8. 
anders, Department of Psychology, California State 
niversity, Northridge, California 91324. 


in the hallway lit. This produced from 15 to 25 
footcandles of illumination measured at waist 
height by a Weston Illumination Meter, Model 
756 with Viscor Filter. The low-illumination con- 
dition was created by turning off two thirds of 
the light panels in an on-off-off pattern, This 
provided an irregular pattern of illumination, At 


waist height, the illumination under the „off“ 


panels was 1 footcandle, under the “on” panels 
the illumination was 25 footcandles. Halfway 
between the on and off panels the illumination 
measured 8 footcandles indicating a rapid drop- 
off in illumination as distance from the on panels 
increased, 

The lights were turned off for the low-illumina- 
tion condition 45 minutes before the observation 
began, and the light switch was taped with a note 
marked “loose wire.” This was done to prevent 
students or faculty from turning on the lights 
during the experimental condition. 

Under the high- and low-illumination condi- 
tions 10 observations were made. Time of day, 
day of week, and location were equated in the 
two conditions. 


RESULTS 


Even though a damper circuit was used on the 
sound pressure meter, readings still varied over 
a 2 to 5 decibels (C scale) range. The midpoint 
was therefore used as the data point. Decibel 
readings in both the background and-illumination 
conditions were converted to sound pressure 
ratios. Background sound pressure ratio was then 


subtracted from the sound pressure ratio re- 
corded under the appropriate illumination condi- 


tion, and the difference was converted back to 
decibels (Peterson & Gross, 1967). 1 
The mean noise level in the low-illumination 


condition was 50.3 decibel (C scale) with an 2 
average of 12 people present. The mean noise 


level in the high-illumination condition was 61.1 
decibel (C scale) with an average of 12 people 


present. An analysis of covariance was performed 


on the data, using the number of people present 
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as the covariate. The difference in mean noise 
levels was significant at the. 001 level (F = 33.1, 
j dj = 1/17). 
; Discussion 
d 
1 


The results indicate that ambient illumination 
can effect the noise generated by people in the 
environment. Due to the unobtrusive nature of 
4 the experiment, several important variables were 

not controlled. First, the duration of exposure to 
the altered environment (i.e., low illumination) 
could not be controlled. Some students entered 
the darkened environment immediately before 
measurements were made, while others had been 
; in the hallway for up to 30 minutes. 

Further research should investigate the effects 
of the duration of exposure. It is possible that 
the reduced noise level in the low-illumination 
condition may be a transitory phenomenon. Pos- 
sibly, with prolonged exposure, noise levels in 
high- and low-illumination environments may 
become more nearly equal. Nevertheless, even if 
the effect is temporary, there may still be some 
practical utility in reducing ambient illumination 
in areas, such as hallways, where it is desirable 
to reduce the noise generated by people in the 
area. 

In the present study, the low-illumination con- 
dition resulted in a heterogeneous illumination 
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Cullen, Ha 
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nulators and devices emphasize 
he subject on visual or bodily 
cful to know (a) if there are 
ces in the degree to which sub- 
vices depend on such cues; and 
ins of simulator use affect these 


sch and Witkin (1948) provides 
casuring the visual field depen- 
ce of individuals. Some encour- 
: characteristic may be relevant 
mance comes from work by 
and Kidera (1969) who found 

generally were less field depen- 

engineers, and by Benfari and 
no found that field independent 

a tracking task better, and that 
pilots were unduly influenced by 
the frame around the visual scene. 

The objectives of the present study were to 
obtain preliminary data on the field dependency 
of pilots and personnel, at the Langley Research 
Center, Hampton, Virginia, used as simulator 
subjects. Specifically, the distribution and degree 
of field dependence were compared for male, 
female, and pilot groups, and for two task condi- 
tions frequently examined in simulators, namely, 
Supine versus erect position of the subject. An 
additional aim was the examination of alternate 
Measures of field dependency. 


METHOD 


Subjects 


A total of 37 subjects, both male and female, 
tanging in ages from 22 years to 52 years, was 
—— 


"Requests for reprints should be sent to Jacob H. 
tenstein, Aerospace Dynamics Branch, Flight 
ics and Control Division, NASA Langley Re- 

æarch Center, Hampton, Virginia 23665. 
*Now at the U.S. Veterans Administration, Vet- 
E Administration Hospital, Perry Point, Mary- 
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VISUAL DEPENDENCY IN THE ERECT AND 
SUPINE POSITIONS 
JACOB H. LICHTENSTEIN * axo RAYFORD T. SAUCER * 
NASA Langley Research Center, Hampton, Virginia 


‘eld dependency-independency of 37 subjects was examined under supine 


ubjects exhibited greater field dependency in the supine position than in 

rect position, The results suggest the need to consider three separate 

ires of field dependency (slope, breakpoint, and variability). Posible 

cations for the use of subjects in simulators involving visual and body 
discussed. 


tested. The group consisted of 8 females and 
males; 9 of the group were male pilots. 


in two positions: (a) erect, where both gravity 


for aid in alining the rod. In the erect condi- 


LETSE 

iets 

FE 
FE 


23 
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driven and could be at any desired angle 
between —45 degrees and +45 degrees by remote 
control, with the resulting angles read from 


rem t 
the rod-and-frame apparatus were mounted at 
opposite ends of a platform. This permitted 


5 
i 
$ 
5 
1 
G 
F 
i 
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The subject was placed in cither the erect or 
supine position. He was allowed about three 
e ere acclimated to the dark, during 
which time he was informed that the rod and 
frame would be set at various angles to the 
vertical (sometimes at the same angle and some- 
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SLOPE SLOPE 
Ficure 1. Histograms of the slope distributions for the test grou 
the rod and frame were being set for each new on the subject; the break; on the 
pair of test angles. hand, indicate how far this i ce will 
The range of angles used for these tests was This suggests that a criteri r desi 
£40 degrees for the rod and +45 degrees for the person’s dependency on thi ıl field 
frame. The same 55 different rod-and-frame include both the slope and oint 
angle combinations were presented to each sub- tion and perhaps the standard ation A 
ject, although a different order of presentation of response at a single angle rticularly 
was used in each case, Auditory cues for setting is beyond the breakpoint v could les 
the angles were minimized, These tests usually an entirely different conclusic: om the d 
ran between 45 and 60 minutes. of another angle only a coupi: degrees 
The slope measure develope the slo 


RESULTS AND DISCUSSION ê 


An inspection of all of the graphical presenta- 
tions of subject response showed that the scores, 
rather than dividing into two groups, field inde- 
endent and field dependent, were distributed 
icross the range from those weakly influenced by 
he visual field to those strongly influenced. It 
was also found that for most subjects, through 
he central angle region, the frame exerted a 
early linear effect on the subjects’ responses 
ip to a breakpoint between 15 and 25 degrees. 

Theoretically, a completely field dependent 
erson’s response in locating the rod is identical 
with the angle of the frame (slope = 1), whereas 
a completely field independent person’s rod re- 
sponse is the same no matter what angle the 
irame was (slope = 0). The slope, therefore, is 
in indicator of the influence that the field exerts 


®The full set of data obtained from the tests for 
il] subjects in both the erect and supine positions is 
resented in a report by Lichtenstein and Saucer 
(1972), which is available from National Technical 
Information Service, Springfield, Virginia 22151, as 
NASA TN D-6883 at $3.00. 


the best fit straight line thr j 7 
the response of the rod against the tilt off 
frame, obtained by an unweighicd | 
procedure. 1 

Figure 1 presents histograms of the ; 
data obtained for subjects in both the erecta 
supine positions for male, female, and 
subjects. Figure 2 presents histograms of 
breakpoints for these same groups. i 

The data showed that the scores for the € 
test sample are disproportionally disi 
toward field independency. This is reflec 
the median slope values falling at .25 
and .35 (supine), rather than .50. 

In the erect position, the slopes for th 
females tested were generally among the 5 
of the population. Although it was © 
surmised that the pilots would be less 
dependent than the general group, the slope 
tribution for the nine pilots was similar 
for all the male subjects. k 

The comparison of subjects in the supine 
tion (with the consequent reduced gravity | 
with subjects in the erect position showed 
for the male subjects, pilots included, the 
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Ficure 2. H 


position incr 
The increas: 
001 level 
025 level ic 
the group in 
hand, for 


dependence on the visual field. 
lope was significant beyond the 
> entire group and beyond the 
`e pilots. The average slope for 
ed from .21 to .34. On the other 
sale subjects changes in body 


position shoes no effect (slopes of .48 and .49, 
Tespectively) 

The metho? of evaluating field dependency 
for the lim angular displacement by using 


only the slope data was compared with the gen- 
erally accepted method for determining field 
dependency (Asch et al., 1948). The relative 
_ fanking of the subjects from field dependent to 
independent, using the two methods showed little 

ference for subjects at the extremes of the 
field dependence-independence distribution. The 
rank difference coefficient of correlation was 
computed for the upper and lower third of the 
Tanking, and the average value was r = .81. How- 
‘ver, for the subjects between the extremes, the 
correlation was low (r= .20). 

Th the supine position the relative ranking 

tween the two methods exhibited the same 
Pattern, better agreement at the extremes and 

Poorer in the center but at a much lower level of 


rr eee eee Se eee 


Snort Norris 


SMI 


correlation. In the present context the correla 
tion coefficients for the extremes and center 
were r= .59 and .03, respectively. 

This pattern of agreement between the two 
methods is understandable because for the field 
dependent subjects the high slope results in 
large angles and for the field independent sub- 
jects both the slope and angles are low. How- 
ever, for the subjects in the center of the dis- 
tribution both the slope and breakpoint are im- 
portant, and using only one type of criterion 
can be misleading. In view of these data, it 
appears that a method for determining the field 
dependency or independency should include the 
slope, breakpoint, and variation data. 


Possible application of the field dependency 


information to simulator operation can be di- 
vid:d into two areas. If the simulator has 4 
rather limited motion, say +15 degrees or less, 
the essential field dependency parameter that is 
applicable would appear to be the slope, The 
breakpoints, usually occurring at larger angles, 
probably have only a minor influence, If on the 
cther hand, the simulator has large angular dis- 


placements greater than +15 degrees, then bot 


g 


| 
7 


* 


5 


the slope and breakpoint should be considered. 
The question of whether field dependent or 


field independent people actually perform dif- 
ferently on simulators or vehicles involving 
visual and bodily cues still needs to be deter- 
mined. The present study at least suggests some 
factors which may need to be taken into account. 


REFERENCES 


Asch, S. E., & Wirxw, H. A. Studies in space 
orientation. I. Perception of the upright with dis- 
placed visual fields. Journal of Experimental Psy- 
chology, 1948, 38, 325-337. 


Benrari, R., & Vitaxt, P. The relationship between 


vertical orientation in the rod and frame test and 


vertical orientation in a compensatory tracking 
task. Bethpage, N.Y.: Gruman Aircraft Engineering N 


Corporation, Research Department Memorandum 
RM-260J, 1965. 


Curren, J. F., Harrer, C. R., & Kiera, C. J. 


Perceptual style differences between airline pilots 


and engineers. Aerospace Medicine, 1969, 40 (4), 


407-408. 

LICHTENSTEIN, J. H., & Saucer, R. T. Experimental 
investigation of the visual field dependency in the 
erect and supine positions. (NASA TN D-6883) 
Springfield, Va.: 
Service, 1972. 


(Received May 1, 1973) 


National Technical Information 


4 
4 


N Mh OBER 


barrier 


upon reading 
were exposed to 


ual difficulty is an important considera- 
the translation of science and environ- 
mental information for public consumption. It is 
especial in view of the importance of an 
informed citizenry in a technological society. Fac- 
tors which increase difficulty can be expected to 
decrease comprehension and to weaken the al- 
ready tenuous citizen role in support and deci- 
on making in this area. 
Funkhouser and Maccoby (1971) suggested 
positive but indirect relationships between read- 
ability, enjoyment, and information gain in a 
study of science articles, Relationships between 
message content and degree of comprehension 
have been studied in terms of many factors, and 
there is little doubt that difficulty inhibits com- 
prehension (Weiss, 1969). 

The examination of numerical data as a factor 
nfluencing textual difficulty, however, has been 
neglected. The few studies that have been done 
are correlational, and they show extremely tenta- 
tive results. Donohue, Tichenor, and Olien (1970) 
found indications of possible curvilinearity in the 
relationship between numerical units and under- 
nding of responses to news stories. Thus, nu- 
cal data may enhance understanding up to a 
point beyond which understanding declines. 

_ It is suggested that numerical data should in- 

crease difficulty by raising the level of precision 
by which textual substantives are modified. It 
imply becomes a case of having to decode more 
3 information in a given message. For example, 
here is less uncertainty and therefore more in- 
formation in the sentence, “There are five to six 
quadrillion tons of air,” than there is in the sen- 
tence, “There are millions upon millions of tons 
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EFFECTS OF NUMERICAL DATA UPON READIN 
AND RECALL OF SCIENTIFIC WRITIN( 
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Traditional “readibility” studies investigating textual difficulty as 3 
in the communication of scien 
exclusively upon the word unit. This study 
time and recall. Half of the subjects, 100 uni 

text containing numerical references, and 
half were exposed to identical text with descriptive modifiers subs 
the numbers. Numerical data significantly increased reading time but 
in the predicted direction, fell short of significance in decreasing rec 


tific information have focu 


examined the effects o 


of air,” though they are 
of words and almost so i 
1970; Bross, Shapiro, & A 
1946; Klare, 1963). 

Reading material which 
mation should be harder t 
take longer to decode. T 
difficulty, reading time : 
should show significant difi 
cal data present and abser 
examined the effects of nur 
ing time and recall in a po; 
cle. Specifically, it was hy 
numerical data present (o 
increase, and (b) recall si 
compared to equivalent tex 
data. Unlike previous studi: 
measure of difficulty, readab 
was controlled, This allowed ony 
to vary in the article. 


METHOD 


Two versions of an article 
were prepared from a publi 
1970). They were identical 
carried 12 numerical references an a 
carried descriptive modifiers where — 
curred in the first. All numerical references Wig 
in familiar units, and no complex mathemattes 
were involved. * 

For example, the version eae numerical 
data reported, “Man is sensitively À 
upon that part of the air composed of one 
and cannot survive much more than a few m 
utes without it.” The version with numer 
data reported, “Man is sensitively me) p 
upon that one-fifth of the air compo q 87 
gen, and cannot survive much more than | 
minutes without it.” 

Otherwise, they were identical in text, iene 
(200 words), number of sentences (10), E 
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posttest-only design in 
ed a brief demographic 
rticle, and completed a 
quis on the facts pre- 
pondents were 100 uni- 
dem male, all members 


nications classes at the 


Stevens Point. Testing 
c academic session of 


ved all portions of the 
and, upon signal, pro» 

end at his own pace, 
he began and finished 
sckets contained identi- 
he article. Half of the 
article with numerical 
eceived the article with- 


LTS 


nd computer analyzed 
rder in which units oc- 
been varied systemati- 
be nonsignificant, Sex 
significant. 


the subject area of the 
conservation, ws measured in the demo- 


1 a S-point (0-4) scale 


had been developed and validated earlier 


ical data were found to have a signifi- 
in the predicted direction, upon arti- 

ding time, with recall and interest con- 
d (5.71 < F <7.15). However, the effect 
recall, though in the predicted direction, 
to reach significance, with time and interest 
0 (50 < F < 3.55). In fact, interest 
reached significance (F = 3.87), and nu- 
data almost went to zero with article- 
time uncontrolled. Interest showed no 
nt effect upon reading time (Table 1). 


Discussion 


e results indicated that numerical data 
reading time, the increase ap 

among individuals of lower rather than 
recall (low: 53.4-66.6 seconds versus high: 
seconds—see Table 1). If we can view 
t hoc basis those who exhibited higher 
dividuals with greater potential for 
sion, it would appear that numerical 
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marily for the lower potential individuals. 
appears to coincide rather well with the 
stricted use of numerical data in science 
in the mass media, which aim at the “lowest 


ized publications as, 
American, which aim at a higher 


with article reading time controlled statistically 
rather than experimentally (respondents’ reading 
time was not limited in this study) results failed 
to reach significance. Control of time of exposure” 
on an experimental basis might be desirable for 
more positive results, Interest in the subject 
area also might respond favorably as an inde- 
pendent variable under these conditions, since it 
would seem that heightened interest might well 
enhance recall. 
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ERRATUM 


In the article by Michael R. Cooper, Charles E. Boltwood, «: 
Wherry, Sr. entitled “A Factor Analysis of Air Passenger Reactions 
and Airport Security Measures as Related to Personal Characteristics 
tives to Flying,” which appeared in the June 1974 issue (Volume 59) o 
of Applied Psychology, Footnote 2 on page 365 appeared incorrect! 
read: The authors wish to thank Victoria Fein Bernstein for her inve: 
ance during the preliminary phases of the study, Thomas M. Ostrom |: i 
ments on the construction of the questionnaire, and Milton D. Hakel, i 
Wood, and the late S. Rains Wallace for their constructive criticisms 
drafts of this report. 


eo 
1-936 


s were returned. 


-search texts contain discussion 
c research related to variables 
nse rates in mailed question- 
r, 1970; Parten, 1950; Simon, 
issions generally include sugges- 
vg the rate of return by use of 


A numbe 
of and sur 
influencing 
maires (cg 
| 1969). Suci 
tions for in 


follow-up i s, inducements, timing of ques- 
tionnaire 2 proper questionnaire format 
and appear and type of return envelope 

| provided, t but a few. 
The pres tudy focuses on differences in 
Tesponse rat a function of having received 
(a) a form us a semipersonal transmittal 


| letter and (“ 
naire, It w 
cantly great 
semipersonai 


colored versus a white question- 
vpothesized that (a) a signifi- 
ercentage of those receiving a 
insmittal letter would respond 


those receiving a form letter and (b) among 
these groups who received the same type of 
transmittal lever, a significantly greater per- 


Sentage of those receiving the colored question- 
naire would respond than would those receiving 
white questionnaire. 


METHOD 
Subjects 


The subjects were 2,123 members of a na- 
ional professional organization, This sample rep- 
Fesented 90% of the total membership of the 
ganization and 100% of the membership re- 
Siding within the continental United States, as 

termined by the organization’s most recent 
membership roster, The organization is com- 
Shee primarily of individuals holding advanced 
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F TRANSMITTAL LETTER AND QUESTIONNAIRE COLOR 
AS TWO VARIABLES INFLUENCING RESPONSE 
RATES IN A MAIL SURVEY 


MICHAEL T. MATTESON * 
University of Houston 


fects on response rates of using a form versus a semiperional trans- 
letter and a white versus a colored questionnaire were investigated 
onnaires were mailed to 2,123 members of a national profesional orgs- 
on. A significantly greater number (p < 001) of returns were received 
the groups receiving the semipersonal letter. With respect to color of 
tionnaire, no significant differences in rate of return were found among 
roups receiving the semipersonal letter, while among the groups receiving 
orm letter a significantly greater number (P < os) of colored question- 


academic degrees, approximately 959% of whom 
hold teaching, research, and/or administrative 
positions in colleges and universities. 


Procedures 


A two-page (legal size) open-ended question- 
naire was developed that requested the respon- 
dent to list title, author, and publishing informa- 
tion for up to cight examples of what in his 
opinion represented significant contributions to 
the literature in his feld. The time needed to 
complete the questionnaire could vary from ap- 
proximately 10 minutes to an hour or more, de- 
pending on how many examples were listed and 
how long the respondent pondered his choices. 
The questionnaires were printed by an offset 
process on good quality paper. Half of the ques- 
tionnaires were printed on white paper, the other 
half on pink. Pink was chosen because, of the 
colors available, it was thought to have the 
highest intensity (brightness). 

The transmittal letter was on letterhead sta- 
tionery, also printed offset. It contained a brief 
discussion of the questionnaire, the uses to which 
the collected data might be put, and a promise 
to send a summary of the results to those re- 
spondents requesting it. Half of the letters were 
printed with a “Dear Colleague” salutation 
and a facsimile signature at the closing. The 
remaining letters had the addressee’s address 
individually typed at the top of the letter, con- 


tained an individually typed personalized saluta- 
tion and were individually signed by the investi- 
gator. All other details in the two sets of letters 


were identical. 

The subjects were randomly assigned to four 
groups. Group 1A received the semipersonalized 
letter and a white questionnaire; Group 1B, the 


semipersonalized letter and pink questionnaire; 
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e TABLE 1 
Treten OF QUESNONNAIRES RETURNED 


en — — 


White | Colored 
Transmittal | Cuestion- | question- | Total 
3 letter naire naire 
Semiperwonal | 318 323 31.9 


243 22.0 


2 Group 2A, the form letter and white question- 
naire; Group 2B, the form letter and pink 
questionnaire. Group 2B consisted of 530 sub- 
_ jects with the remaining groups each containing 
531, A postage-paid business reply envelope was 

included in 3 mailing. Two slightly different 

— ae were used on the reply enve- 

lopes so that responses could be identified as 

coming from a Group 1 or 2 respondent. All 

23 questionnaires were mailed bulk rate. 


RESULTS AND Discussion 


_ Eighty-three of the mailings were returned by 
the post office as undeliverable. This reduced the 
sample size to 508 in Group 1A, 510 in 1B, 512 
in 2A, and 510 in 2B. Of the remaining 2,040 
mailings assumed to have reached their intended 
recipients, 550 were completed and returned for 
overall response rate of 27.0%. 
Table 1 summarizes the results. Among those 
having received the semipersonal letter, 31.9% 
(325/1,018) responded as compared to a 22.0% 
(225/1,022) response from the form letter 
This difference, significant beyond the 
-001 level (t = 5.06), supports the first hypothe- 
sis. Such a difference indicates practical signifi- 
‘cance as well in that the slightly greater ex- 
pense involved in “personalizing” one part of 
mailing appears to have led to an increase 
in returns of better than 44% (100/225) over 
As of the form letter group. 
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The second hypothesis, that 
ing received the same type of 
a higher percentage of those r 
ored questionnaire would resp 
those receiving the white questi 
ported only for the group re: 
letter (t= 1.77, p<.05). Ar 
those receiving the colored que 
an increase in returns of alm¢ 
over those having received th 


z those haw 
nittal letter 
ng the col- 
than would 
re, Was sup- 
g the form 
this group, 
uire showed 
o (23/101) 
te question 


naire. Thus the eye-catching efi f color may 
operate to slightly improve r s where tt 
sponse rates are low, such as v re case with 
the form letter group. The fact color made 
no apparent difference in th sponse 

among those receiving the se: “sonal letter 
suggests that the influence valu the person- 
alization, the effect of which i rhaps to in- 
crease the recipient’s personal i fication with 


the survey project, supersedes fect of color 
as a more generalized, and thu: ersonal, eye 
catching device, 

In summary, these data len: 
importance of personalizing th 
ter in a mail survey and sugge 


situations where a form letter t be 

with its resulting lower rate oí 1, a colored 

questionnaire may improve tl ponse rate 

significantly but not to the leve hed by the 

use of the more personalized form: .:. 
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Apps 
. 
EFF} 
he possible ts of noise in the environ- 
have be: source of increasing con- 
Industri chinery, urban construc- 
transpo ı vehicles, and warning 
tes all coni ote to this special kind of 
ronmental tion. Noise is certainly an 
t which cause emotional responses, 
logical c health problems, or loss 
p (see ¢ Broadbent, 1957; Kryter, 
ile the effie of noise on human per- 
ice have a studied, generalizations 
t the effects of noise on specific tasks 
hard to come by. Historically, there ap- 


ar to be two possible reasons for this situ- 
on. First, the research may have employed 


or categories of performance. Second, 
and insensitive measurement tech- 
5 have been used. 

8 ith respect to performance tasks, Alluisi 
267), Fleishman (1967), and Fleishman 


S Paper is based, in part, on research sup- 
td by the National Aeronautics and Space Ad- 
3 tion under NASA Grant NAS1-9854 with the 
ean Institutes for Research. Work under this 
i. Teported in full in Theologus, Wheaton, 
Now » Brahlek, and Fleishman (1973). 
with BRX, Inc., Washington, D.C. 

ests for reprints should be sent to Edwin 
4 New Moe American Institutes for Research, 
5 Mexico Avenue, N. W., Washington, D.C. 


OF INTERMITTENT, MODERATE INTENSITY 
{SE STRESS ON HUMAN PERFORMANCE * 


RGE C. THEOLOGUS* GEORGE R. WHEATON, axo 
EDWIN A. FLEISHMAN * 


imerican Institutes jor Research, Washington, D.C. 


evaluated the effects of prolonged exposure to two nole stressors 

ind patterned intermittent, 88 decibels) on the performance of 

s requiring different abilities (Reaction Time, Rate Control, and Time- 

The sensitivity of alternate metrics of performance degradation was 

uated within an analysis of covariance design. Sixty subjects were 

assigned to the various treatments, and a series of 2X (Noe 

I Noise Quality Trial Blocks) covariance analyses were carried 

€ effects of random 85-decibel noise on performance depended on the 

t task and performance measure. The Reaction Time task was affected, 

t Control task was not, and the Time-sharing task was affected only 

a utinued exposure to noise. Patterned noise had insignificant effects. 
i ons for predicting and generalizing stress effects are discussed. 


and Bartlett (1969) have pointed up the 
need for research using a wide range of 
standardized behavioral measures which tap 
a large number of different categories of 
human performance. Distinctions between 
mental and motor performance, for example, 
are not very meaningful and do not reflect 
our current knowledge about more specific 
categories of human skills (Fleishman, 1972). 
Generalizations about the effects of environ- 
mental stressors on the various tasks within 
such all inclusive categories of human per- 


formance are likely to be imprecise and diff. 


cult to achieve. 

Research on a variety of factors affecting 
human performance repeatedly reveals dif- 
ferential effects for different aspects of per- 


formance (Fleishman & Bartlett, 1969). Such 


studies have been concerned with the effects 
of stress (Gorham & Orr, 1957), diet (Bro- 
zek, Fleishman, Harris, Lassman, & Vidal, 
1955), high altitudes (Evans & Consolazio, 


1967; Carver & Winsmann, 1968), and drugs 
(Fleishman, 1967, 1972; Elkin, Fleishman, ~ 
Van Cott, Horowitz, & Freedle, 1965). For 


example, in a study (Fleishman, 1967) of 


the effects of the drug scopolamine, the effect 


of a specific dosage was a function of the 
particular ability (e.g., finger dexterity, multi- 


limb coordination, short-term memory) re- 


quired by the task. It was shown that the 
absolute performance decrement, the time to 
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teach this maximum effect, and the time to 
; from the effect, depended on the abil- 
measured by the task. It is important to 
pote that all of the aforementioned studies 
used tasks in which the abilities measured by 
the task had been empirically determined 
(Fleishman, 1967). 

Appropriate selection of tasks is necessary 
but not sufficient for thorough assessment of 
factors affecting performance. Thus, for a 
meaningful evaluation of noise effects, the 
performance metric must be considered in 
addition to the performance tasks employed. 
_ A review of the noise stressor research indi- 
cates that noise may haye its most potent 
effects on metrics other than mean perform- 
ance level, percent correct, etc. (Broadbent, 
1957). Adequate examination of noise ef- 
fects, therefore, requires a wider range of 
‘metrics capable of detecting other more sub- 
tle or complex effects. Chambers (1963), for 

example, has proposed a stress effects tax- 
onomy which may be adapted to reflect the 
possible categories of noise effects. For ex- 
ample, his 19 categories may be grouped into 
five more general classes of performance 
effects. These five major categories are as 
i : (a) changes in performance distri- 
bution over time, (4) decreases in perform- 


The present study is part of a larger effort 
(Theologus, Wheaton, Mirabella, Brahlek, & 
Fleishman, 1973) designed to develop a 
‘standardized performance battery of tasks for 
_ the assessment of the effects of environmental 
and other stressors on human performance. 
The rationale of this program is that a com- 
ination of (a) tasks, representing a range 
of empirically defined abilities and (b) met- 
Tics, differentially describing each effect cate- 
_ ory may provide the basis for a more sensi- 
tive and diagnostic methodology with which 


components should provide quantitative eval- 
uations of the disruptive effects of the stressor 
and of man’s vulnerability to it. The ap- 
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proach recognizes that a particular 
stressor may be studied wi ect to one or 
two modes of performance y often mod. 
ify other aspects as well. 

Specifically, in the stud ted, the ef- 
fects of two noise stresso: he perform- 
ance of three tasks, requi ferent abili. 
ties, selected from a proto tandardized 
assessment battery were eva The evalu- 
ation was designed to p. information — 
regarding (a) the sensitivit tasks in the 
battery to two kinds of n te intensity 
noise (randomly intermitte: patterned) 
and (b) the sensitivity of a ive metrics 
of performance degradatio: iese effects 
were evaluated over relativ. longed pe- 
riods of exposure to noise, ared to the | 
short exposure periods typi: tudied. 

METHOD 
Subjects 

The 60 subjects employed in tl y were male 
Students obtained from area univ: They were 
paid volunteers, averaging 21 yes ge. All sub- — 
jects were screened for hearing d 
Apparatus | 

In earlier work a device was dev i (Figure 1) > 
that allowed the assessment of r separate pet. 
ceptual-motor human abilities (Pa Reilly, Dik 


lon, Andrews, & Fleishman, 1965). Briefly, the sub- | 
ject’s console contains a cathode ray tube (CRT), 4 
pair of control sticks for one- or two hand tracking 
and a variety of meters and b hted push 
buttons. The experimenter’s console contains an aray 
of push buttons and rotary switches for program 


ming experimental tasks. It also includes meters and 
clocks, and counters for recording subjects TE | 
sponses. For the present research this equipment 
was placed in two rooms separated by a one 
mirror. The experimenter and his console were 
cated in one room while the subject and his 
were in the other. 
i 
] 


Abilities Measured 


Of the many tasks measurable on this apparsi 
three were selected for study. The first epar 
task was the Reaction Time task. In this task Sv? 
jects were required to keep the index finger of 75 
nonpreferred hand on a back-lighted push pes 
and to press the button as rapidly as possible Be 
ever the light behind the button came on. The ™ 
of stimulus presentation was eight per minute, 1 
stimulus onsets spaced at approximately equa pil- 
tervals. This task reflected the Reaction Time 5 5 
ity factor, which is defined as the speed with be f 
an individual can respond to a stimulus whe 


INTERMITTENT 


ptual-Motor Console 


Measures of response 


clock. 
Rate Control task which 
in 


re in electronic 


The second t 
Wired sub jc 
MS, point of rich 
Ritter of a CRT 


Se programmir track the dot of light, sub- 
kets manipulat 


ol stick mount 
went 


compensatory fash- 


moved away from the 


sine-wave, first-order, dy- 


Ficur: 

ceptual-n s. (Originally described in 
Fleishmar )pyrighted by NASA, 1965.) 
| their preferred hand, a 
a universal joint. Move- 
tick simultaneously affected 
the CRT through a pair of 
This task represented 
ability t W hich has been defined as 
justments 10 e ippropriately timed control 
t Rishman 95 8 si varying in rate ot movement 
ae 8). The measure provided was inte- 
tracking error recorded on galvanometers 

and y axes 


of the cont 
ad y axes o 
Bependent potent 


BE Rate Control 


eters. 


make 


ea ne task was employed. This 
5 a Mets ko perform the previous two 
rm the 5 heously. That is, subjects had to per- 
hand 3 action Time task with their nonpreferred 


3 . simultaneously tracking the point of 
eir preferred ha 5 
ponents 5 red hand. The use of these two 


Baminatis the Time-sharing task permitted 
T n of the effects of noise on the Reaction 
a 


Me and 
in a Rate Control factors both independently 
Combination. 0 


MODERATE 


(PEMCON), 


INTENSITY 


Norise Sreess 


developed to measure s p 
report by Parker, Reilly, Dillon, Andrews, & 
Independent Variables 
The manipulated variables were noise intensity 
and noise quality. The noise intensities were 0 
decibels (re: .0002 dynes/cm*) and 85 decibels. Noise 
quality was represented by randomly intermittent 
noise and patterned intermittent noise. The random 
noise consisted of 85 decibels of broadband white 


noise delivered in bursts of random duration, vary 
ing in nine one-second steps between one and nine 
seconds. The interpolated intervals of 
likewise random, varying in three one-second steps 
betw This noise averaged 
five seconds on and two seconds off. The patterned 
noise consisted of 85 decibels of broadband white 
noise presented in five-second bursts with interpo- 
lated two-second quiet intervals. 

These noise stressors were delivered to subjects 
through a set of Grason-Stadler earphones (Model 
TDH-39) from a Viking tape deck (Model 230). 
Intensity at the earphones was controlled by a 
Daven step attenuator. The 85-decibel intensity level 
employed in this study is well within the safe limits 
at all frequency levels (Kryter, Ward, Miller, & 
Eldredge, 1966). It has also been shown that the 
conditions chosen were in fact stressful in that mod- 
erate intensity, intermittent white noise has been 
shown to increase EMG, heart rate, skin conductance, 
and systolic blood pressure (Glass & Singer, 1972). 


silence were 


en one and three seconds. 
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TABLE ! 


Quiet 


The experimental design is shown in Table 1. In 
this design the 60 experimental subjects were ran- 
domly assigned to one of six groups (n = 10). Two 


each task, one group was 
assigned to the zero-decibel noise level and the other 
to the 85-decibel condition. While confounding the 
patterned random noise manipulation with sequence, 
permitted the most efficient use of the 
experimental subjects and provided for a reduction 
ol differences among subjects within noise intensity 


Procedure 


On each day of testing, four subjects reported to 
the laboratory at half-hour intervals beginning at 
8:30 am. There were three sessions per day, each 
ing approximately one-half hour. There was a 


subject was brought 
_ individually to his work station, seated at the con- 

lar task. During these trials, the subject was in- 
l structed to put on the earphones, although no noise 
_ stimulus was presented. In this session, 20 trials were 
7 administered, each trial lasting one minute, with one- 
minute rest pauses given at the end of each block 
of five trials. For the Reaction Time task, eight 

presentations of the stimulus light were administered 

within each one-minute trial, In the Rate Control 
task. tracking Performance was continuous over the 
one. minute trial. From the subject's point of view, 
performance within and between trials was unin- 


- During the second session, 
each experimental subject was exposed to the ran- 
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dom noise while performir 
During the third session, t r 
presented. Control subjects no nobe stime 


lation in either the second or sions, although 
they were required to wear t oncs. The tat- 
ing procedures for the second 1 sessions weer 
identical to those for the í on except that 
two one-minute practice tr administered 
prior to the 20 trials, and O trials were 
massed. 
Dependent Variables 

This experiment produced ti ing basic data. 
For each subject assigned to t tion Time and 
Time-sharing tasks, 160 respo: icies measured 
in milliseconds were recorded h experimental 
session. For each subject wł ormed on the 
Rate Control and Time-sharin 20 integrated 
error scores were recorded in erimental se 
sion for the x and y tracking hese basic data 
Were not subjected to analys er, they pro- 
vided a basis for derivation oí jent measures 
specifically tailored to provid: vation on the 
differential sensitivity of select: vrmance mea 
surement categories. ES. 

To provide more stable est of subjects 
performance, the data within ca on were col. 
lapsed into four blocks of five t: ain subsequent 
measures were derived from thes ked data, 

One of the primary reasons fo: research wat 
to determine whether the perfor: measurement f 
categories would exhibit different sensitivity t 
noise. Consideration of the basic i suggested 4 
variety of dependent measures w! could be — 
to represent each of the five major egoriss Which 
we imposed on Chamber's (1963) Q adation tat 
onomy. It was decided to limit initial concern % 


those rather commonly used to describe performanct 


on reaction time and tracking task: such as na 
used in this study. The categories 5°! cted and 
measures representing them were as follows: 


Category 1. Changes in performance distribution 
over time; assessed by (a) frequency of respons 
blocks and (b) mean duration of respons 
blocks. 

Category 2. Decreases in performance rate; a 
sessed by (a) mean response latency and ( 
reaction time. j 

Category 3. Changes in performance EEE 
assessed by (a) total mean integrated pen £ 
error, (b) x axis mean integrated error, (©) ? 
axis mean integrated error. 


The derivation of these seven measures from thë l 
obtained data was as follows. ie 

The first dependent measure used was raed 
sponse latency for each block of trials. This me, 
is conceptually independent of the usual rea i 
time measure. While it includes data whi ! 
normally be thought of as reaction time, 10 ect 
includes longer latencies reflecting time to ble 
the stimulus. Thus, some subjects may be capa? 


quickly, but may fail to perceive the 
it appears. (This distinction is espe- 
in the Time-sharing task.) To nor- 
distributions reciprocal transformation 
was employed 
Í appearance of longer than normal response 
in the data suggested that, in addition to 
in response latency, a lapse-in-performance 
was being measured. It was assumed 
longer responses were related to momen- 
in attention during which the subject 
detect presentation of the stimulus. This 
enon is referred to in the literature as re- 
blocking (Bills, 1937), Since no universally 
quantitative definition of a response 
€ i$ presently available, an operational defini- 
4 had proved useful in a study of long- 
monitoring (Baker & Theologus, 1972) 
d. Therefore, for the present study, a 
mse block was defined for each subject as any 
f which was two or more times longer than 
in of h's 20 fastest response latencies during 
zation session. Given a definition of a 
block for each subject, two measures were 
(a) The number of response blocks within 
of trials was counted to develop the 
me of frequency of response blocks. (b) The 
E duration of response blocks was calculated 
in each block of trials. 
mination of these three derived measures re- 
Mi that the measure of mean response latency 
d the total distribution of scores for each 
While the measures of frequency of response 
Es and mean duration of response blocks evalu- 
in the upper tail of the distribution. A 
ure (representing the subject’s reaction 
Was developed. Since all scores two or more 
than the mean of the subject's 20 fast- 
se latencies were defined as response 
‘Wwe defined all of the scores falling below 
enon as reaction times. Therefore, for each 
the mean of all scores less than a response 
‘Was calculated within each block of trials to 
te the measure of mean reaction time. 
error data obtained on the Rate Control and 
ng tasks were used to obtain three mea- 
Min each block of trials including total mean 
e error on both tracking axes, mean inte- 
error on the x axis, and mean integrated 
the y axis. 


nal ysis 


Primary analytic technique was analysis of 
Pe For each of the performance measures, 
nate employed to adjust the measures from 
experimental sessions was the similar mea- 
ted for each subject on the last 10 trials 

larization session. Fourteen 2 X 2 X 4 
niensity X Noise Quality X Blocks) designs 
Nzed, followed by single degree of freedom 
ted on the adjusted means derived 
Ses of covariance. 
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AT n a a ia 


Resvutts 


The more salient and interesting results 
are presented in Figures 2 through 8. In 
these figures performance curves are shown 


for different performance metrics and the 


three kinds of performance tasks which were 
studied. Performance is shown as a function 
of noise intensity (experimental = 85 deci- 
bels, control = 0 decibels), session (Session 
2 = random intermittent noise, Session 3 = 
patterned intermittent noise), and trial block 
within session. Effects due to the first two of 
these variables were of primary interest. 


Intensity Effects 


While in most of these figures the perform- 
ance of the random 85-decibel noise group is 


shown to be inferior to that of the control — 


group, the overall mean effects as tested in 


the complete analyses of covariance did not 


reach statistical significance. However, a priori 
comparisons between the experimental (85 
decibels) and control (0 decibels) groups at 
each block of trials, especially in the random 
intermittent noise sessions, did reveal a num- 
ber of significant effects. 

As shown in Figure 2, experimental sub- 
jects who were exposed to 85-decibel random 
intermittent noise while performing on the 
Reaction Time task had longer latencies than 
did control subjects at the first (F = 7.12, 
dj = 1/143, p < .01) and second (F = 6.90, 
df = 1/143, p < .01) blocks of trials. Simi- 
lar effects on performance of this task were 


JEAN RECIPROCAL 


RESPONSE LATENCY (seconds) 
w 
8 


ADJUSTED MI 


@ = -@ EXPERIMENTAL 
o@—® CONTROL 


SESSION 2 


SESSION 3 


Ficure 2. Adjusted transformed mean response 
latency at each block of trials in both sessions on 
the Reaction Time task. 


m y - a = 


=e EXPERIMENTAL 
2 CONTROL 


Ficus 3. Adjusted mean reaction time at each 
block of trials in both sessions on the Reaction Time 
task. 


found for the reaction time measure: the 
curves in Figure 3 indicate that mean reac- 
tion time of the experimental groups, per- 
forming under 85-decibel random intermit- 
tent noise, was slower than that of the con- 
trol group (0 decibels) at both the first (F 
= 8.06, dj = 1/143, p < .01) and second (F 
= 6.36, df = 1/143, p < .05) blocks of trials. 
In contrast to the latency results with the 
Reaction Time task the compensatory track- 
ing data from the Rate Control task showed 
no difference as a function of noise intensity. 
This outcome is clearly shown in Figure 4 
for total mean integrated tracking error. The 
x-axis and y-axis data, when considered inde- 
pendently, parallel these functions and sup- 
rt the absence of an effect. 
Finally, the data presented in Figures 5 and 
6 for the Time-sharing task parallel, to a 
degree, those presented for the Reaction Time 
task. While the results do not reach conven- 
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O Ficure 4. Adjusted mean integrated error on both 
the x and y axes at each block of trials in both 
Sessions on the Rate Control task. 
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—e CONTROL 


SESSION 2 


Figure 5. Adjusted transíc 
latency at each block of trial 
the Time-sharing task. 


tional levels of significance, 
initial exposure to noise ir 
sponse latency and reactioi 
the Time-sharing task, rel: 
ance under the no-noise con! 


suggest t 
s both 


Session Effects 


It should be recalled that 
sessions; the first familiariz: 
the task; the second presen 
intermittent noise; and (aft: 
break) the third session pres 
terned intermittent noise. Figi 
8 include the second and third -cssions, dur 
ing which the experimental group was ex 
posed to the noise conditions. The effects 
ported in this section thus may be viewed 
due either to the quality of noise (rand 
intermittent versus patterned intermittent 
noise) or to the first versus second exposure 
to noise. 3 

Significant interactions between noise in- 
tensity and session were obtained on the Re 
action Time task. As shown in Figures 2 
3, both response latency (F = 9.06, df = 1/ 
18, p <.007) and reaction time (F = 9-8% 
df = 1/18, p< .006) decreased across ses 
sions, with the experimental group clearly 
showing the greater improvement. 7 

For the Rate Control task neither the k 
teraction between noise intensity and session 
nor the main effects of session reached sta- 
tistical significance (Figure 4). 3 

Significant main effects due to sessions 
were obtained on the Time-sharing task for 
the measures of response latency (Figure 5); 
frequency of response blocks (Figure 7), an 


d OS P -rn vay 
I or INTERMITTENT, Moperate Intexsiry Noise Sreess S45 
Stal integrate (Figure 8). Mean re- 4 
ponse latenc 6.64, dj = 1/18, p< 10 
02) decrease: sessions as did the fre- 
quency of re blocks (F = 10.30, df = — 
1/18, p< 00 le total integrated track - sal 
ig error (/ 2, df=1/18, p< 03) as l 
improved. H while greater improve- H 20 3 
ment across sessions seemed to be 8 Ppi ; 
associated w experimental group (85 4 
decibels), ses y noise intensity interac- 5 ý 
tions were ni ñcant. * N 
Taken toge ese results with regard to 
session effect xe Time-sharing task indi- 
—.— im ent in the second session SUSSION 2 % 
regardless of er noise was present. Ficure 7. Adjuste y $ 
The data i res 2 EGNE also sug- blocks at each — K — j 
fest that unli ze results for the session dhe Time-sharing task. 
involving ran intermittent noise, where f 
ellects were d, the group under pat- may become manifest only in certain cate- 
termed noise performed worse than the gories of performance degradation. i 
quiet control The performance curves In the present experiment the use of tasks 
for the patt noise and quiet groups known to sample different abilities (selected 
overlap regar: of task or measure. This from a standardized assessment battery), 
appears to be ‘pendent of how disruptive coupled with measures from a stress-effects 
the random n: had been during the preced- taxonomy, permitted demonstration of differ- 
ing session, ential noise effects. For example, performance 
Be ec om Cee on 5 1 Time task n de- 
S ) gta ollowing exposure to randomly inter- 
The results provide support for the ap- mittent noise. By using a degradation tax- 
Proach taken assess stress effects. This onomy, however, it was possible to show that 
approach is precicated on the rationale that the degradation was highly specific. As evi- 


diferent types and levels of stressors will 
Yaniously affect different components of hu- 
man task performance and that these effects 
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Fig 7 
Block of go. Adjusted mean reaction time at each 
task, iab in both sessions on the Time-sharing 


denced by the response latency and reaction 
time data, initial exposure to noise slowed 
the speed at which subjects normally re- 


14.50. 
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Ficure 8. Adjusted mean integrated error on both 
the x and y axes at each block of trials in both 
sessions on the Time-sharing task. 
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sponded, but did not increase the number or 

duration of response blocks, In other words, 

a Category 2 degradation (Le., decreases in 

períormance rate) occurred, while a Category 

1 decrement (i.e, a change in performance 

distribution over time) was not observed. 

The potential value of using different 
tasks to assess stress effects is illustrated by 
the total integrated tracking error data which 
were analyzed for both the Rate Control and 
Time-sharing tasks, As already noted (Fig- 
ure 4) there clearly was no degradation in 
Category 3 (i.e, changes in performance 
accuracy) on the Rate Control task. This 
does not appear to be the case, however, when 
comparable data from the Time-sharing task 
are examined. Although the differences shown 
in Figure 8 do not reach conventional levels 
of significance, they do suggest that Category 
3 degradation is taking place, especially un- 
der the random intermittent noise conditions 
of Session 2. 

These same data, when considered jointly 
with Category 1 (Figure 7) and Category 2 
(Figures 5 and 6) data, strongly suggest a 
different time course for the emergence of 
effects on Time-sharing performance. The 

overall impression is that degradation is be- 
coming more pronounced with the passage of 
time. Had the duration of the initial exposure 
been increased, degradations in all three cate- 
gories might have occurred during Time- 
sharing performance. 

Taken together, the results indicate that 
the effect of 85-decibel random noise on hu- 
man performance depends on the type of 
task and performance measure used. Thus, 
the Reaction Time task was affected, but the 
task requiring Rate Control ability (timing 
of control action) was not. The Time-sharing 
task appeared affected but only after con- 
tinued exposure to this noise. 

The effects of patterned noise of the same 
85-decibel intensity produced insignificant ef- 
fects, regardless of task or measure. 

The combination of taxonomies of tasks 
based on human abilities and metrics of stress 
effects appears to provide a more precise way 
of predicting noise effects. The specificity of 
the current results with regard to both com- 
ponents of performance and degradation cate- 
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gories underscores the 
effects. Although further 
to confirm these results, 
ential sensitivity of meas 
cates the potential of the 
a wider range of stressors 
may permit developme 
method for diagnosing st 
deriving behavioral profil 
ety of stressors. 
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EFFECT OF NOISE ON INTELLECTUAL PERFO! .NCE* 
NEIL D. WEINSTEIN è 
University of California, Berkeley 
The effect of realistic noie levels on an intellectually challenging t 
examined. Thirty-three college students performed proofreading in 
with a background of intermittent teletype noise (70 decibels). As ; 
pole subjects did not differ significantly from quiet subjects in “ 
spelling errors, but were poorer at identifying grammatical errors ( f 
Contrary to expectations, recall of the content of the proofreading 
was unaffected. Detailed analysis revealed that subjects initially work: 
slowly (A< 01) and less steadily (p< 002) during noise bursts than 
intervening quiet periods, but more accurately (p< .01). The resu 
interpreted in terms of the feedback available to the subjects and the 
gics they adopt to cope with the distraction. 
Early studies of the effects of noise on Another possibility is that sks in these 
complex activities like learning, reading com- noise experiments are so str ward and 
ion, and intelligence test performance unidimensional that the best gies to use 
Were often frustrated by inconclusive results are self-evident. These stra‘ may leave 
{reviewed in Broadbent, 1957). Consequently, all measured aspects of the s perform- 
much recent research on noise has emphasized ance unaffected, but in son he must 
er tasks chosen for theoretical clarity and pay in order to maintain and acai- 
tended to tap more fundamental cognitive racy. 
cesses. Vigilance, response time, visual Support for this second nent comes 
rch speed, and the ability to solve simple from a group of studies wh’ ‘ound that 
rithmetic problems have been frequently noise caused performance eriorate if 
ted (Broadbent, 1971; Kryter, 1970). Still, subjects were required to car; out several 
Outcome of these studies, taken as a tasks simultaneously (Finkel; & Glass, 
le, has been no less confusing than the 1970; Hockey, 1970a, 1970 Woodhead, 


results of earlier research, and critics (Kryter, 
1970) continue to maintain that noise poses 
threat to any kind of nonauditory per- 
__ There are at least two possible explanations 

or the disappointing findings of recent ex- 
periments. One explanation is that noise af- 
fects only tasks which are intellectually de- 
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1966). Such studies have demonstrated that 
subjects sometimes maintain performance on 
one activity at the cost of poorer performance 
on others, but the use of rather exotic tasks 
and high noise levels (usually above 90 deci- 
bels) suggests that the effects of noise on 
more familiar activities are insignificant, at 
least in a laboratory setting. 

The experiment described in this study Was 
undertaken with two goals in mind: (a) © 
determine whether noise levels far lower than 
those generally employed would interfere 
with a familiar task which has some real-life 
significance; and (b) to understand how sub- 
jects cope with noise by a more thorough 
monitoring of performance than is usually 
attempted. : 

Proofreading was chosen as the expel 
mental task because it is intellectually de- 
manding, relatively familiar, and because it 
is sufficiently complex so that the optimum 
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is. It was predicted that 
pellings, which can be 
ng a single word, would 
noise, These were viewed 
en recognition, making 
nds on the subject's in- 
capabilities, In contrast 
ual errors, the investi- 
ontextual errors, a cate- 
nistakes like grammatical 
etected only by placing a 
ould be significantly af- 
uld presumably find it 
e individual words into a 
It was anticipated that 
use of its incidental char- 
med difficulty would be 
1015€. 
research by many others, 
ected to change overall 
Still, the mean speed 
: a number of interesting 
veriment was designed to 
to measure the length of 
h line. 


Spent on 
METHOD 
inis experiment were college stu- 


rductory psychology class who 
a course requirement, Random 


mment resulted 


18 subjects in the noise condi- 

Mand 15 subjects in the quiet condition. Subjects 

each group wer nearly equally divided between 
and females. 


freading and Comprehension Tasks 


Proofreading texts were prepared from non- 
= on sources. Selections were long enough so that 
3 subject could complete them in the time allotted. 
o Were introduced at irregular intervals; on 
S average, one line in three contained errors. As 
as three errors could appear in a single line, 
Most incorrect lines had only one error. Except 
the pattern of errors was re- 
All such errors were chosen to 
mutually exclusive categories, 
two different categories were 
ciud same line. Contextual errors 
onm, errors in grammar, missing words, and 
oa ct or inappropriate words (e.g., the word an 
alg S instead of the word and). Such errors 
el, ot be detected by examining each word sepa- 
ž ae only by placing the words in context. 
8 errors included misspellings and typo- 
errors. In each 50 lines of text there were 


“ap 
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seven lines containing contextual errors and chven 
lines with nencontextual errors. 

Each proofreading passage was typed of a . 
roll of paper in a single column approxi- 
mately 60 characters (S inches) wide. The paper 

Presented to the subject on a typewriter, 


rE, 


comprehension tests, one emphasizing recog- 
nition of specific topics and phrases, the other com- 
cerned with an of the concept and 
arguments presented in the selections, were prepared 


for each of the last two passages. 


Noise 


The noises heard by the subject were the, 
unamplified sounds of a teletype machine (Te! 
Corporation Model ASR) placed alongside 
cubicle in which the subject worked. The noise was 
produced in irregularly scheduled bursts lasting from 
24 to 15 seconds. These were arranged so that noise 

one- 


Fi 


ears, the A-weighted sound level was 66 decibels for 
typing and 70 decibels for the paper tape reader. 
In between noise bursts, one could hear the hum 
of the teletype motor, $5 decibels. Subjects in the 
quict condition worked with ambient background 
noise of approximately 36 decibels. (The C-weighted 
values of these same sounds were 71, 75, 68, and 55 
decibels, respectively.) 


Design and Procedure = 


The experiment was presented to the subject as a 
study of the relationship between selected aspects 
of intellectual performance and a number of person- 
ality traits that were to be tested in a later session. — 
Written instructions described the proofreading — 
task and gave examples of all types of proofreading 
errors. No distinction was made between contextual 
and noncontextual errors. Subjects were not informed 
that their comprehension would be tested later. 
Although each subject could work at his own rate, 
the experimenter stated that both speed and accu- 
racy were important and were given equal weight. 
The subject was then seated at the typewriter in 
the cubicle and was permitted to practice proof- 
reading on several sample lines. y 

The time the subject spent on a line and the 
number of errors he detected in it were recorded on 
a multichannel recorder. The output of a micro- 


Ae 
| 


f Mistakes on | Mistakes on | Response time 
noncontertual 


contextual | (second/line) 


1 
error linet ertor line 
(% wrong) | (% wrong) 

Means* 

28 28 +4 | 4.56 4.27 
36 35 s$ 65 | 4.68 | 4.00 
Variances* 

% 120 | 100 90 18 38 
160 | 300 240 430 | 1.16 6 
of lines on which mistakes were made, 


scores subjected to a linear covariance adjust- 
on passage one ar the covariate. 


5 


the noise condition were told: 
will hear some teletype noise in the back- 
N on working.“ Seven 
2 rk on each of the 
d and third passages, with a short rest in 
Thus, the quiet group worked in quiet on 
; the noise group worked in quiet 
and with noise on the last two. 
al presented in the same 


7 
i 
4 
zi 


ys 
of the Proofreading, the sub- 
the experimenter would like to 
the subject could recall of the 
subject then worked 
5 in a fixed order, 
ose dealing with infor- 
presented i the lines he had finished reading, 
experimental session ended with a brief, struc- 
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tured interview which dealt 
uation of his performance ar 
the noise. 


ResuLT 
Comparison of the Noise 


The proofreading spee 
the noise and quiet groups 
analysis of covariance, and 
to examine comprehension 
tions.“ Accuracy was mea 
the proportion of context 
noncontextual error lines 
rect. A line was consider: 
number of errors reporte 
differed by one or more fro: 
ber. The great majority 
errors of omission, Table 
means and variances for ac 
on each of the last two pas There were 
no significant differences | n the two 
groups on the first, contro! ve for any 
dependent variable. 

As anticipated, the effect 
contextual errors was not sta 
cant. The noise group did o 
mately 25% more errors 0. 
but the variation among indi 
ances within each group wa 
the power of the tests was < 
The effect of noise on context: 
agreed with the initial predici 5 
group was almost 50% less accurate, 4 dit. 
ference which is significant at the .001 level 
for both passages, and no improvement was 
apparent from the second passage to the 
third. Roughly 30% of the total variance m 
contextual error accuracy could be attributed 
to the experimental manipulation (see Fleiss; 
1969, for the formula used for this measure): 
A t test confirmed that the effect of noise cn 
contextual errors was significantly greatet 
than the effect on noncontextual errors (? 
05 and < .02 on the second and third pa 
sages, respectively). 

2 All tests reported in this study are two-tailed. 
Statements referring to the power of the sta 
tests reveal the probability that an F or ( V8" 
would prove significant beyond the .05 level if nos? 
actually produced a relative change in perfo re 
of 20%. The figure of 20% was chosen to illustra 
the ability of the experiment to detect a $ 
substantial change. 


iet Groupi 


accuracy of 
ompared by 
st was ied 
two condi. 
in terms of 
or lines of 
were incom 
rect if the 
the subject 
actual num- 
stakes were 
ents group 
and speed 


ise on nom 
illy signifi- 
k approxi- 
passages. 
perform. 
large that 
about .25. 
errors also 
The noise 


inges produced by noise 
omprehension tests, The 
answered an average of 
tions from the combined 
passage and 65.8% of 
he third passage, whereas 
dues for the noise group 
de internal reliabil- 
nsion tests averaged 80, 
he statistical comparisons 
ith selections. 
in Table 1 shows that 
nificant effects on mean 
in spite of the fact that 
tests was relatively high 
he last two passages). 
presented in Table 1 re- 
group was more variable 
The difference between 
ically significant for con- 
racy on the third passage 
esponse time on the second 
. indicating that individual 
nse to noise may exist even 
ny main effect. 


wing Periods of 


analysis of the noise group 
ding performance during 
ith proofreading during pe- 

quiet.“ Significance levels 
using £ tests for correlated 
ms (dj = 17). As presented in Table 2, 
icantly fewer mistakes were committed 

ise lines than on quiet lines during the 
Passage (p < .01), but this effect had 
Minished by the third passage and was no 
Ber statistically significant. 


MS of relatis 
re computed 


gh these differences violate the assumption 
hoscedasticity underlying the analysis of co- 
the results already discussed are sufficiently 
cut that a correction for unequal variances 
| not change any of the conclusions. 

A noise line was defined as one in which noise 
typing or from the paper tape reader was 
t least half of the time. A quiet line was 
Sas one in which no noise was present. The 
Won from the former category of lines with 
mall portion of noise was necessary to 
3 A subtle artifact which would otherwise ap- 
st the speeds on noise and quiet lines were 
ted (Fisher, 1972). 
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TABLE 2 


Praroamance of Nose Susyrets ov Nose axe 
Quer Lives oF Passons Two ao Tune 


== a 


Dependent variables 


| 2 3 2 3 2 3 
Lines | : * — 
Mistakes on | Response time Standard de- 
| erot linet | (second/line) | viation of re- 
| (% wrong) | | sponse time 
j | | (second /line) 
Quiet % | s2 | 387 | 3.01 | 115 | a7 
Noise | 46 | 48 jaar 382 * 138 


* Mean proportion of lines containing either comtestaal of 
Roncontest ual errors om which mi + were made. 

* Passage number. 

‘Group mean of the standard deviation of respone time 
calculated for cach subject. 


By recording the length of time spent on 
each line, it was possible to determine whether 
subjects were faster when noise was on or off 
and whether their response time was as stable 
in noise as in quiet. The values displayed in 
Table 2 show that during the second passage 
the mean response time and the standard 
deviation of the response time were consid- 
erably greater on lines worked with noise than 
on lines worked in quiet (significant at the 
01 and .002 levels, respectively). Similar, but 
smaller, effects appeared on the third passage, 
with both means and standard deviations 
approaching significance at the .1 level. In 
spite of these differences, when all lines are 
pooled, a noise subject exhibits slightly less 
variability than does an average quiet subject 
(significant only on the third passage, p < 
.05). The trend toward both equal speed and 
equal variability during noise and quiet, ap- 
parent in Table 2, was also significant (f < _ 
.05 and p<.10 for means and standard 
deviations, respectively). g 


rr 


Subject Interviews 
From interviews at the end of each session, 

it became clear that subjects did not believe 
that the noise had interfered significantly — 
with their proofreading. While practically all 
agreed that the noise was distracting at first, 5 
they felt that they had adapted to it. Aside 3 
from the frequent comment, “I tried to con- 3 


centrate more,” subjects were unaware of any 
alteration in their proofreading strategy. In 
general, changes in speed were mentioned 
tore often than changes in accuracy. 


Discussion 


_ Although the original hypotheses of this 
study concerning the effects of noise on proof- 
_ reading and speed were confirmed, the expec- 
tation that comprehension would show the 
greatest deterioration was proven incorrect. 
Apparently, subjects can combine individual 
words into a meaningful structure and still be 
unable to detect contextual errors. It seems 
that several different processes must be called 
Upon to account for the results. 

The interviews revealed that subjects were 
very poor sources of information concerning 
their proofreading accuracy, In the absence 
of any other feedback they may be forced to 
use their degree of understanding of the pas- 
Sage as the criterion by which to judge 
whether they are reading carefully enough to 
locate mistakes. Subjects may respond to noise 
by attempting to maintain their level of com- 
prehension, without realizing that they may be 
failing to test the sentences for proper syntax, 
or may be reading what they expect to find 
tten, unintentionally correcting the errors. 
There are several possible reasons why con- 
textual errors were affected more by noise 
8 were noncontextual errors. One explana- 


tion stresses the differences in the cognitive 
Processes required to detect such errors. Pre- 
sumably the pattern recognition skills in- 
volved in identifying noncontextual errors 
, are less sensitive to distraction than are the 


more complex processes involved in contextual 
error recognition. A competing explanation 
emphasizes subjectively perceived differences 
in the frequency of the two types of errors. 
i Because they appear to be considerably more 
frequent, subjects may concentrate their ef- 
fort on noncontextual errors. Though both 
these mechanisms are consistent with the 
Present findings, 
difficulty alone is 
be considered. 
lines, however, showed that for both types of 
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In contrast to the inacı of subjects 
reports concerning their p: ling succes, 
their comments about thei f work were 


fairly accurate. Both the ir 
the eventual adaptation t 
flect faithfully the effects 
bility presented in Table 


sruption and 
entioned re- 
l and varis- 

ability of 


individuals to judge how iey are ac 
tually working and their d resist any 
changes in this rate of wor! explain why 
effects of stress on overall are seldom 
observed. 

Several other experiment: isher, 1972; 
Woodhead, 1964) have no within-sub- 
ject differences in performa: iring inter- 
mittent periods of noise ar iet without 
any overall differences bet noise and 
quiet groups. These obser probably 
result from the greater sta power of 
the within-subject analyses. he present 
investigation, for example, t! elation be- 
tween the speed on noise a: et lines in 
the same passage was .98, n the f test 
for correlated means very | ful indeed. 

Are any of the theories noise or 
stress present in the litera helpful in 
explaining why one response ute rather 
than another was affected in h. experiment? 
One theoretical position holds th noise pro- 
duces an increase in arousal and that it 1$ 
overarousal which interferes with perform- 
ance (Broadbent, 1971). The appeal to a 
change in an undifferentiated bodily state, 


however, seems quite incapable of dealing 
with the relatively complicated pattern of 
heterogeneous effects observed in this expen- i 
ment. : 

More useful is Easterbrook’s (1959) theory 
of attentional narrowing, which differentiates 
between focal and incidental tasks (Lee 
Kahneman, 1973). Easterbrook suggests that 
stress narrows the range of cues which an 
organism can use, leading to poorer perform- 
ance on incidental activities. In the present 
context focal means primary, and incidenta 
means secondary, or less important, in terms 
of the subject’s priorities. Easterbrook a 
Kahneman believe that attentional narrowing 
is caused by physiological arousal due to emo- 
tion, stress, or drugs. It is arguable, hoe 
whether mild but distracting noises produc 


a large change in the 
ther possibility is that 
loyment of attention is 
process. The subject 
ite for his involuntary 
icting stimulus in order 
ince from deteriorating. 
of attention and effort 

s beliefs concerning the 
in overcoming the inter- 
variability of the noise 
investigation, for exam- 
in part to individual dif- 
id chosen to deal with the 


their effects th 
kvel of arous 
the change in t 
primarily a v 
attempts to co: 
attention to th 
te keep his pe 
The new distr 
i determined 


In experime: 
wust impaired 
be the one cor 


th multiple tasks, the one 
ress frequently appears to 
d least important by the 
ctively perceived differ- 
faces in imp e can be established ex- 
ctions (Hockey, 1970a; 
r implicitly via length of 
velty of a particular task, 
quent responses (Bahrick, 
1952), differences in feed- 
ct’s inherent cognitive style 
irus, 1958). 
The preceding discussion suggests that the 
ect of noise on the performance of complex 
‘asks cannot be understood without assigning 
an active role to the subject. The subject 
which strategy he will use on the basis 
Whatever information is available to him 
leg., Hockey, 1970b). Theories which treat 
de subject as a passive figure or hold that 
effects of stress result merely from a 
ge in the physiological state of the body 
te the large differences in performance 
Which can be produced by a small change of 
Tuctions or feedback. 
e interviews and the detailed study of 
noise group provide additional support 
uS point of view. It may be possible to 
plain the slower and more irregular speed 
uns noise bursts in terms of a passive 
Such as a brief failure of attention. 
ems difficult to explain the superior 
3 a on noise lines without assuming that 
i 0. are aware of the distraction and re- 
threatened change in their perform- 
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ance by taking greater care during noise than 
during quiet. 

Why has this investigation produced clear- 
cut evidence of noise interference when others 
have frequently yielded small or equivocal 
results? It appears that the effects observed 
in the present experiment are due to the task 
chosen and to the simultaneous monitoring of 
several different aspects of the subjects’ per- 
formance, rather than to some peculiar fea- — 
ture of the noise source or subject sample. a 
Because the task is complex and no feedback 
on accuracy is available, subjects may have É 
made a strategic mistake in responding to the 
noise. Such a mistake could be corrected — 
readily with appropriate feedback, It is also 
possible, however, that noise interferes more 
directly with the ability to proofread and that 
it would be difficult for subjects to improve 
their performance. The present study cannot 
establish which of these two alternatives is 
most correct. 

In spite of current interest in noise as one 
aspect of a deteriorating environment, the 
use of realistic noise levels and meaningful 
tasks is still uncommon. Without theories 
capable of making accurate generalizations — 
from one situation to another, the use of lab- 
oratory analogues must be recognized as a 
hazardous practice. Investigators interested 
in the practical applications of their research 
must attempt to design experiments which 
duplicate, as nearly as possible, conditions as 
they occur in real life. 
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HARASSED DECISION MAKER: 


PETER WRIGHT?! 


nt simplifying strategies people use in adapting to different information 
g environments were investigated. The hypotheses tested were that 
yperating under either time pressure or distraction would tend to 
tically place greater weight on negative evidence than would their 
parts under less strainful conditions. Six groups of subjects (210 male 
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Perhaps th 
face in evers 
parate pieces 
alternatives: 
portfolios, po! 
vidual equip; 
handling abili: 
sire to accurat 
mize his ultin 
his equally ure 
tive strains of 
Slovie (1972) 
Lichtenstein, 1 
verse ways judge 
chores. These rey 
while some struc 


st pervasive task people 
life is trying to use dis- 
formation to choose among 
sumer goods, investment 
1 candidates, etc. The indi- 
with limited information- 
ust try to balance his de- 
choose options which maxi- 
> consumption benefits and 
needs to reduce the cogni- 
` decision task. Reviews by 
his associates (Slovic & 
suggest some of the di- 
may simplify data-handling 
iews also demonstrate that 
iral properties of the avail- 
able information have been varied in judg- 
ment process research, the adaptations judges 


make under high information load have re- 
“elved surprisingly little empirical attention. 
A decision maker's need to simplify should 
come more urgent when he must operate 
under a heavy information load. Information 
oad is generally conceived as the amount of 
to be processed per unit of time. An 

fase in information load could therefore 
ag from either increasing the amount of 
1 1 which a person must cope or de- 
dean the time available for processing. 
— Sunt of data can itself be increased by 
Requests for reprints should be sent to Peter L. 
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ee now at Graduate School of Business, 
td University, Stanford, California 94305. 


iduates) were presented five pieces of information to assimilate in 
ling cars as purchase options. Three groups operated under varying 
ressure conditions, while three groups operated under varying levels of 
tion. Data usage models assuming disproportionately heavy weighting of 
e evidence provided best-fits to a significantly higher number of sub- 
the high time pressure and moderate distraction conditions. Subjects 
ended to fewer data dimensions in these conditions. 


either increasing the number of decision-rele- 
vant pieces of evidence or by increasing the 
total amount of information in the immediate 
environment such that the individual be- 
comes distracted. High information load is 
perhaps the rule rather than the exception 
for consumers shopping in noisy, crowded, in- 
formation-packed retail outlets or managers 
making decisions under the pressure of dead- 
lines. 

Faced with a decision task of challenging 
complexity, an individual may try to restruc- 
ture that task into a simpler one. He may use 
one of several simplifying strategies. For ex- 
ample, he can try to defer an impending deci- 
sion deadline, physically remove the source of 
the distraction, or move himself to a more 
peaceful locale. Even when a person’s ability 
to control time pressure or interference is 
limited, he may still restructure his task by 
restricting his attention to certain portions of 
the incoming data. He may exclude from 
consideration data about less relevant dimen- 
sions, even though he would consider those 
dimensions sufficiently important to input 
under less taxing conditions; or he may focus 
attention on data in certain regions of each 
dimension. For example, a multiattribute op- 
tion usually offers outcomes which potentially 
range from highly desirable to highly unde- 
sirable. Pieces of data inform the decision 
maker of the positive or negative implica- 
tions of choosing an option. A person may 
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limit his data intake by becoming especially 
attentive only to data about possible negative 
(or positive) outcomes. 

In simplifying as proposed, a decision 
maker accepts some distortion into his sub- 
jectively ideal judgment policy. If he ignores 
~ entire dimensions of evidence, he chooses in 
_ ignorance of what outcomes to expect on 
_ those dimensions. The dimensions he attends 

= to will consequently have relatively greater 
impact on his judgments than they normally 
would. If he focuses on negative evidence, he 
sacrifices awareness of the extent of positive 
outcomes to be expected, and vice versa. It 
3 would be surprising if these two strategies are 
not used concurrently, that is, the harassed 
4 decision maker limiting attention to negative 
(or positive) evidence on a reduced number 
of dimensions. 
_ Whether he is more comfortable ignoring 
E positive or negative evidence may depend on 
_ the payoff structure of the task. Both Web- 
ster (1964) and Canavan (1969) found a 
negative bias when the decision maker’s re- 
ward system heavily penalized his false posi- 
tives while ignoring his successes. In many 
decision tasks, no such well-defined, exter- 
nally imposed payoff structure exists. Even 
so, conditions surrounding the judgment may 
induce the simplifying judge, who feels he 
must sacrifice some of the available input, to 
ignore the positive evidence and insure he is 
aware of impending negative consequences. 
One such facilitating condition may be that 
the option(s) evaluated possess both positive 
and negative features (Abelson & Kanouse, 
1966). Kanouse and Hanson (1971) reviewed 
several streams of research suggesting that 
judgments about objects with good and bad 
attributes are more heavily influenced by 
negative data. Other research suggests a nega- 
tive bias might emerge where personal invest- 
ments (hence, personal losses) are involved 
and where the judgmental context implies 
final commitment to the chosen option (Ein- 
horn, 1971; Slovic, 1969). The present study 
examined one such judgment context, the 
consumer car-buying decision. 
At its extreme, disproportionately heavy 
Weighting of negative evidence amounts to 
using a conjunctive strategy (Coombs, 1964) 
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with multiple cutoffs sepa 
positive outcomes (and 


negative from 
negative from 


positive evidence). Disco a datum sg- 
gesting an option does no iss any cuted 
results in outright reject that option 
Einhorn (1971) has sugg but not dem- 
onstrated) that a conjun strategy is an 
attractive simplifying pr: relative to & 
linear compensatory strat While the ra- 
tionale offered is plausib ght’s (1974) 
results caution that exec: ı conjunctive 
strategy may not necess ve viewed as 


easy by the decision ma! 
person actually using a 
egy may temporarily adj: 


1 any case, 3 
satory strat- 
data treat- 


ment so that negative da accentuated 
without going to the extr: f a strictly 
honcompensatory conjunct ile. Unfortu- 
nately, when the judge observation 


makes errors in translating 
output judgments it is diffi 
these two cases. 


nput data to 
distinguish 


The hypotheses of this „ were that 
when persons make judgmen volving per 
sonal investment, a sense of itment, and 
possible negative outcomes «ii under time 
pressure or when they are distracted, dispro- 
portionately heavy weighting ©: negative evi- 
dence occurs. In contrast, similar judgments 
made under more leisurely conditions reflect 


no dominant evidence-usage pattern since 
individual utility functions vary. 


METHOD 


To test these hypotheses, mathematical models 
representing an unbiased, a negatively biased, and s 
Positively biased data usage scheme were formu- 
lated. All were variations of the general 2 
compensatory model (Slovic & Lichtenstein, 1971). 
The general model is: 


br 1, 2, aa e 


where J(X) is an overall (numerical) judgment of 
an option; X. is a (numerical) scale value for t a 
option of the ith dimension; and b: is the 5 
given the ith dimension as a whole. When a 
judgment is recorded on a response scale and 15 
stimuli are expressed as levels on descriptive SC% 15 
numbers are assigned to the scale points for 9 
this model. The judgment scheme variations ae 
trasted in the present context concerned only 
scale values on the right side of Equation 1. 
An unbiased model (where negative and 104 by 
evidence is equally weighted) was represente 


tive 


|, 


D: 
eint the num 7 to the seven levels along 
ead descriptive on (eg, “greatly below 
wormage” = 1; above average“ = 7). Two 
rate bias vere used. In one model 
(Equation 2), values X. equals 1,...,7 
were transforme og X, The effect was that 
Gherences betw scale point values increased 
penlincarly as riptive scales became in- 
ceatingly negat log 1=0; log 2=.301; 


bg $= 47, 
HX) = by 


778; log 7 = .845). 


log 17 


b. log (Xs), 180 (21 


cales used had an average 
lence might have been con- 
ying below average features. 
as model transformed only 
on of the stimulus scales so 
values were 0, 4, 6.5, 8, 9, 10, 
bias model described (Equa- 
ative Bias Model 1 and the 


Sace the desc 
midpoint, nega! 
wurd as evide 
The second ne 
the below avera 
that the effectis 
MN. The first no 
fen 2) is label 


cond Negative Model 2. 

Two positiv: models were also used. In 
Positive Bias 1 (Equation 3), differences 
between the sc lues increase nonlinearly as 
the scales beco: vasingly positive. 

NX) -b log N 

be log (a XM), [3] 
In Equation 3, arbitrary constant set above 
the highest scale (7 in this case) so the pre- 
dicted judgment ins finite. Positive Bias Model 
2 transformed on the above average section of 

Stimulus scale. the effective scale values were 

12,3, 4, 8.5, 8 

Procedure was to fit each of these models to 
the data of individual subjects making judgments 
Sader different tir pressure or distraction condi- 


more of the harassed sub- 
> ses were best matched by the 
‘ea we Bias Model i or the Negative Bias Model 
i this could be taken as support for the hypotheses. 


Judgment Task 


wier were given descriptions of 30 hypothetical 
a models. Information on five attributes of each 
Was given: (a) selling price, (b) ease of han- 
ag, (e) cost of maintenance, (d) styling, and (e) 
attri comfort. A pilot study had established these 
info utes as generally salient in evaluating cars. The 
mation was expressed as ratings on 7-point 
with endpoints labeled greaily below average 
bel 5 above average; the midpoint was la- 
el average, The descriptions were created so that 
attribute appeared an approximately equal 
tire Si of times and so that each car was a mix- 
imc and negative attributes. This was 
Yery well since the models would not discriminate 
fairly ho Where the evidence about an option is 
implieq by geneous. Subjects were told the norms 
cars EM the average Jabel referred to the class of 
g for less than $4,000. They were asked to 
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treat the available data as credible and as constitut- 
ing their own beliefs about the cars. 

Subjects judged each car according to the likeli- 
hood they would purchase such a car for personal 
use upon graduation from college, The context was 
thus one of final preference rather than preliminary 
screening, and specified an act rather than a general 
evaluation. In the time pressure study, judgments 
were recorded on a 4-point scale ranging from ex- 
tremely high probability to extremely low proba- 
bility. In the distraction study, the scale used to 


ee es Te eer ee 
* ` 


record judgments was a 7-point bipolar scale with 


endpoints labeled likely and unlikely, 


Subjects 


Subjects were 210 male undergraduates enrolled in 
a business curriculum and approaching graduation. 
Each was randomly assigned to one of three time 
pressure conditions (final cell size of 40) or one of 
three distraction conditions (final cell size of 30). 


Time Pressure Treatments 
Three variations in time pressure were created. In 


the high time pressure condition, subjects were told 
to make as accurate judgments as possible but were 


also told that subsequent tasks awaited them. They 
were asked to proceed as rapidly as possible without 
sacrificing accuracy. To increase their awareness of 
time pressure, an assistant recorded elapsed time in 
10-second intervals on a visible blackboard. Sub- 
jects were asked to record the elapsed time on their 
booklet when they finished. Subjects in the low time 
pressure condition were told their only task was to 
accurately judge the cars. Each was told he would 
have 40 seconds to consider the information avail- 
able and should use the entire period. Only when 
the end of each 40-second interval was signaled by 
the assistant could he record his judgment and 
proceed to the next car. The length of a 40-second 
interval was demonstrated to dramatize that it 
offered plenty of processing time. In the undefined 
time pressure condition, instructions were similar to 
the low time pressure condition, but no mandatory 
deliberation period was imposed. Subjects were told 
to proceed at whatever pace suited them. After 
completing the judgment task, subjects were asked, 
“How much time pressure did you feel while making 
your judgments?” They responded on a 5-point 
scale with endpoints labeled very much pressure and 
very little pressure. 


Distraction Treatments 


Distraction treatments were not crossed with time 
pressure treatments. Three levels of distraction were 
created. In all three, subjects were given an intro- 
duction similar to undefined time pressure condition 


and were also forewarned that some noise would 


accompany their task to simulate a natural decision 
environment. In the high distraction condition, a 


taped excerpt from a radio talk show (question and 
answer format, including commercials) was played 


at a moderately high volume through the remainder À 
of the task. In the moderate distraction condition, 


TABLE t 


Paesquency or Best-rrrs rom Data Usace Mounts 
IN Toe Pressure axo Deerescrion 


Teearwents 
— EEEE 
Time pressure 
Data i 
unge — --——— 
| High Low | Undefined 

Unbiased ERS EA 15 
Negative bias (1) | 25 11 13 
Positive bias (1) | 8 | 15 12 


the same tape was played at low volume. In the 
ow distraction condition, taped background music 
from a frequency modulation (FM) station was 
played. In all conditions subjects were assured their 


beled very distracting and not very distracting. In 
addition, they were asked to describe any methods 
they used to handle the distractions. 


RESULTS 


Mean time-per-judgment recorded by sub- 
jects in the high time pressure condition was 
12.2 seconds compared with the standard 40 
seconds in the low time pressure group. Exact 
timekeeping for subjects in the undefined 
time pressure condition was difficult, but ex- 
perimenters“ estimates showed an average 
total time of 10 minutes, or about 20 seconds 
per judgment. Time used is not the optimal 
measure of perceived time pressure, however. 
Mean ratings of perceived time pressure were 
4.67, 2.70, and 2.12 for the high time pres- 
_ Sure, low time pressure, and undefined time 
Pressure treatments, respectively. Analysis of 
variance revealed a significant difference be- 
tween the three groups (F = 8.05, df=2/ 
118, $ < 01). High time pressure subjects 
ſelt more time pressure than did either of the 
other groups, which did not differ significantly. 


Perver Waicur 


Subjects in the distrac: 
asked how distracting 
traneous noise accomp 
Means were 4.30, 3.19, 
distraction, moderate d 
distraction treatments, res 
dj = 2/88, p < .01). Neu 
showed each treatment 
from each of the others. 

For each subject, multi elations wete 
computed between his ac dgments and 
those predicted by the u: 1 model, the 
two negative bias models, the two posi 
tive bias models. For the four, the ap- 
propriate transformations made before 
the scale values were ente: > the regres- 
sion. Two values for 4 (8 ) were used 
in Equation 3 to see what ences would 
occur in the results. The vas minimal, 
and the reported results are = 8. Analy- 
sis at the level of the in ıl was most 
relevant to the question of < fying strate- 
gies. For each subject, the n yielding the 
highest multiple correlation noted, The 
frequency with which each el gave the 
highest multiple correlation wn in Table 
1 


\euls analysi 
significantly 


In computing these freque es, the negi- 
tive bias models were not con :asted against 
each other but were used «!icrnately. The 
same was true for the positive bias model 
Negative Bias Model 1 turned out to be vit- 
tually interchangeable with Negative Bias 
Model 2. Similarly, Positive Bias Model ! 
was interchangeable with Positive Bias Model 
2. Substitutions yielded only two reclassifici- 
tions. Consequently, the frequencies shown 1 
Table 1 are for Positive Bias Model 1 and 
Negative Bias Model 1. Similar analyses re 
frequencies of the highest multiple corre’? 
tions produced by all other model combina- 
tions gave similar results. 

The hypothesis was that operating undet 
pronounced time pressure or distracuion 1 
duces a general tendency among subjects. 
rely heavily on negative evidence. Examining 
first the time pressure effect, no y 
pattern in the weighting tactics of low " a 
pressure or undefined time pressure subjec 
is apparent. All three models provide * 
fits for about the same number of subjec 


ely two thirds of the sub- 

high time pressure were 

assuming heavy weight- 
An overall chi-square 
14.62 (df = 4, p< 01). 

time pressure subjects 
mples from the other two 
ri-square value of 13.83 
Subjects forced to assimi- 
under time pressure did 
vunterparts handling the 
nder less pressure. Thus, 
ubjects under pronounced 
ieavily on negative data 


n study, the emerging pat- 
vat different. Again, the 
dition (low distraction) 
ce that subjects displayed 
nal idiosyncracies in the 
data. Greater frequency 
i occur when a moderate 
noise surrounded the task. 
f the subjects in the moderate 
‘nt were best fit by Nega- 
Yet this pattern did not 
) distractions increased (high 
). Overall chi-square analysis gave 
of 8.62 (% = 4, p< 10). Compar- 
Moderate distraction group against 
psed low distraction and high dis- 
groups gave a chi-square value of 
=2, p < 05). This analysis offers 
lative support for the hypothesized de- 
ence on negative data when distractions 
Strain on attention. 
miting the nature and amount of the 


tion load, the number of sepa- 
dimensions subjects consulted in making 
judgments might reflect this. Conse- 
z the number of dimensions with sta- 
y significant (% < .05) regression co- 
ts was calculated for each subject. This 
an estimate of how many factors had 

tematically used by the processor to 
the final judgment. The maximum was 
pan number of significant dimensions 
bject was 1.50, 2.33, and 2.08 for the 
Pressure, low time pressure, and 
time pressure conditions, respec- 
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TABLE 2 
Mean Mutrivtx Comeetarios ror Toe Pansoss 


AXO DisTRACTION Tetarwaxrs 


Time pressure 


Data usage 
model 7 
High Low Undefined 
Unbiased s6 70 6 os 
Negative bias (1) 62 n 70 Af 
Positive bias (1) s 0 72 5 
s7 70 70 
Distraction 
High Moderate Low 
Unbiased i SO 63 72 6l 
Negative bias (1) 48 6g 70 | 42 
Positive bias (1) v 62 73 | ál 
w HS Ji 
tively. One-way analysis of variance 
(ANOVA) indicated significant differences 


between the three conditions (F = 4.45, df = 
1/118, p<.05). Neuman-Keuls analysis 
showed the high time pressure group used 
significantly fewer dimensions than did the 
other two groups, which did not differ. In 
the distraction study, the mean number of 
dimensions per subject was .98, 1.63, and 
2.15 for the high distraction, moderate dis- 
traction, and low distraction groups, respec- 
tively. Similar analyses showed the high dis- 
traction subjects used fewer dimensions than 
did either the moderate distraction or low 
distraction subjects, and the moderate dis- 
traction group used fewer than did the low 
distraction group (F = 6.32, df = 1/88, p< 
05). 

Just how adequate were any of these mod- 
els in describing the strategies subjects used? 
Frequent best-fits for Negative Bias Model 1 
where predicted would be less meaningful if 
multiple correlations were low. Mean multiple 
correlations for the models are shown in Table 
2. After transforming the multiple correla- 
tions via Fisher’s z transformation, separate 
3 x3 ANOVAs were run for the time pres- 
sure and distraction studies. In the time pres- — 
sure study, the main effects of time pressure 
(F = 46.34, df=2/118, p<.001) and 
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model (F = 6.86, df = 2/236, p < .05) were 
significant, as was the interaction (F = 36.23, 
dj = 4/236, p < .001). The time pressure 
effect, due to the somewhat lower correlations 
in the high time pressure condition, was an- 
ticipated by the preceding analysis of number 
of significant dimensions per subject. The 
interaction, also expected from preceding in- 
dividual level analyses, was due to higher 
correlations for Negative Bias Model 1 in 
the high time pressure condition versus the 
comparably high correlations for all three 
models in the low time pressure and undefined 
time pressure conditions. 

The repeated-measures ANOVA for the dis- 
traction study yielded a significant main ef- 
fect for distraction (F = 67.62, df = 2/88, 
p < .001) and a significant interaction effect 
(F = 12.88, dj = 4/176, p < .001). The mod- 
el's effect was not significant (F = 2.32, df 
= 2/176, ns). The expected interaction was 
due to the relatively higher correlation for 
Negative Bias Model 1 in the moderate dis- 
traction condition versus comparable corre- 
lations in the other distraction conditions. 
The relatively poor-fits generally found in the 
high distraction condition suggest the manip- 
ulation may have disrupted the subjects’ 
processing so much they became erratic. 
Hence the negative bias found among mod- 
erate distraction subjects failed to hold for 
the overstrained high distraction subjects. 
Such an interpretation is partially supported 
by subjects’ posttask descriptions of how they 
coped with the distractions. High distraction 
subjects tended to report frustration and an 
awareness of inconsistency more often than 
did moderate distraction subjects. 

As one final test of how well a negative 
bias model did in explaining the adjustments 
of subjects in the high time pressure and 
moderate distraction conditions, the maxi- 
mum multiple correlation for subjects in 
these two conditions was compared to the 
median maximum multiple correlation for the 
two conditions. In the high time pressure con- 
dition, 17 of the 26 subjects best fit by Nega- 
tive Bias Model 1 fell above the median 
maximum multiple correlation for that condi- 
tion (.635). The corresponding figures for the 
unbiased and positive bias models were 1 of 
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5 and 3 of 9, respective!) the moderate 
„ 


distraction condition, 10 18 subjects 
best fit by Negative Bias I fell above 
the sample median (.67 orresponding 
figures for the unbiased ositive bias 


models were 3 of 5 and 3 respectively, 
Discuss 

The proportionately gre 
best-fits afforded by the ni 


frequency of 
e bias model 


in the high time pressure noderate dis- 
traction conditions appeat support the 
original hypotheses. A tenden for people to 
accentuate negative evide: hen the en- 


vironment discourages lei 
may be indicated. A comple: 
to use fewer attributes in 
stance is also indicated. 
sion maker is pictured as be 
alert to discrediting evidenc: 
dimensions. 


processing 
iry tendency 
ame circum- 
irassed deci- 
g extremely 
few salient 


The perspective taken in resent study 
on how people use evidence n evaluating 
alternatives echoes that of ard (1964) 
and Tversky (1972) in emp! usizing a per- 
son’s immediate “state of mir’ as the major 
determinant of the weights apr od. Repeated 
findings of task effects on a weighting 
policies and of intransitivity in choices seem 
to argue against traditional ideas of a stable 


ing scheme 


utility function dictating a we } 
on maker's 


which consistently molds a d 


evaluation policy across tions. m 
weight a person gives to some aspect of i 
decision problem is in part a function 0 


stable goals and in part a function of immedi- 
ately salient subgoals. The latter may domi- 
nate the former under certain circumstances, 
and vice versa. The questions of interest then 
become identifying the conditions that lead : 
judge to deviate from his optimal ati 
strategy (if such a notion is meaningful) a 
discovering whatever stability exists in the 
manner in which he deviates under those on 
ditions. Situational factors related to informe 
tion overload, two of which were examined in 
this study, seem a promising starting p. ve 
A limitation of fitting mathematical model i 
to input-output data is the remaining A 
biguity about the underlying processes. 5 
of the models used in this study was litera 12 
an additive, compensatory model. The ©? 


‘tion of the results must 
s of differential weighting 
e by compensatory data 
both negative bias mod- 
tudy may be viewed as 
ı nhoncompensatory con- 
For example, Einhorn 
d that if Equation 2 in- 
ansform of the response 
eft side of the equation, it 
conjunctive model. Since 
el 2 bends the scale values 
than does Negative Bias 
iso be seen as a reasonable 
: multiple cutoff strategy. 
> be willing to interpret 
iing more frequent use of 
‘egies under difficult con- 
searchers have, however, 
reating math models like 
igh approximations to a 
to warrant such an inter- 
imbaum, 1973; Goldberg, 
‘re rigorous tests, relying 
ive reports from subjects 
are necessary to sort out 
«tation. 
had had considerable ex- 
evidence about cars under 
they might have devel- 
' shortcuts idiosyncratic to 
interesting question for 
whether people who con- 
n noisy, pressurized environ- 
Ments continue to accentuate negative evi- 
Ence or develop other shortcuts. Such people 
mueht have had the benefit of feedback which 
He subjects in this study did not. Tt is not 
ar whether the types of feedback usually 
Nailable to consumers or managers would 
= them to the deviations they intro- 
the in trying to simplify. At a minimum 
155 pa heed feedback under ideal condi- 
a or insight into their preferred judg- 
Policy, plus feedback about the outcomes 


Mervative inter 
therefore be in 
of negative e\ 
processors. Hoy 
ds used in 

approximation 
junctive strat 
(1971) has pr 
corporates a | 
scale values on 
may be treate: 
Negative Bias 
éven more sha 
Model 1, it mi 
approximation 
Some may the 
the results as i 


‘cautioned agai 
these as close 

junctive m 
(eg 
1971), Clearly 
More on intros 

an this study 
ne precise inter 
Tf these subje 


N 


: of incorrect judgments 
'S simplifying adaptations, not in his 
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preferred judgment policy, It seems it is time 
that judgmental research became more con- 
cerned with the task conditions systematically 
affecting the use of different judgmental 
strategies rather than general model building 
without concern for the task environment. 
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EXPERT JUDGMENT: 
SOME NECESSARY CONDITIONS AND AN EXAMPL! 


HILLEL J. EINHORN * 


Graduate School of Business, University of Chicago 


Expertise is examined within situations where judges deal with multid 
sional information. Psychometric criteria were advocated as being indi 
of expert judgment: (a) Experts should tend to cluster variables in the 
way when identifying and organizing cues. (b) Expert judgment shou 
highly reliable (intrajudge reliability), show both convergent and discrin 
validity, and be relatively free of judgmental bias when measuring cues 
Experts should weight and combine information in similar ways. Thes 
teria were applied to data obtained when three medical pathologists 
the amount of histological signs in biopsy slides. The results showed 
these experts generally met the criteria, although they did not seem to \ 
information similarly. Some limitations of defining expertise in terms o 


7 


above criteria are discussed. 


In an age of increasing specialization, it is 
quite likely that greater numbers of people 
will have to rely on expert judgment. It is 
therefore an important matter to discuss 
some of the conditions that produce “exper- 
tise.” Is an expert simply one who is very 
skillful with training and knowledge in some 
specialized field or are there perhaps more 
objective criteria that can be used? The pur- 
pose of this study is to discuss and illustrate 
some necessary, if not sufficient, conditions 
for defining expertise within a given situation. 
The type of situation to be considered is one 
in which several judges have to deal with 
multidimensional information. This informa- 
tion has to be measured and combined into 
a decision or global judgment. This is a very 
general task, specific examples of which might 
include: (a) radiologists viewing an X ray 
and coming to a diagnosis; (b) businessmen 
making a sales forecast for a new product; 
(c) art critics viewing a painting and judging 
its quality, etc. 

The expert must identify information or 
cues from the multidimensional stimulus he 
encounters. These cues are diagnostic (i.e. 
contain information) about the final decision 
or judgment. The expert’s ability to identify 
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vf extracting 
if noise (€g. 
h cue is re- 


cues can be seen as a prob! 
weak signals from a backgro: 
Green & Swets, 1966). Whik 


lated to the final categorizat t also serves 
the function of leading the ert to other 
cues that will also be of im dance in the 
determination of the final cat--orization. For 


example, consider the cue of duct safety”. 
While this cue can be relate to how much 
sales we can expect, it is alo indicative of 
other cues in the product (ii: price may be 


hen one con- 
with many 
is not unrea- 
s up expecta- 
bles typically 


higher due to safety features). 
siders that the expert may 
cases in the course of time, 
sonable to assume that he bu 
tions regarding both what va F 
covary together and the strength of this 
covariation. : 

Consider that the expert must measure two 


cues that he considers to be highly related, 
for example, the degree of workmanship E 
the quality of material in a particular product. 
If he believes that these cues covary tog 1 
it is quite likely that his measurement of on 
cue will affect his measurement of the 99 
This is similar to what has been terme ia 
logical error (Guilford, 1954); that is, tto 
that logically go together are rated in sim 5 
ways. The relationship different experts if- 
pect between cues may vary due to ther an 
ferent experience and training. Furtherm a 
this is not an error but reflects each 1 
way of organizing information into clus 


E JUDGMENT: Some Necessary CONDITIONS AND AN EXAMPLE 


or dimension m a psychological point 


of view, this ing of information serves 
to reduce the nsionality of the informa- 
tion one has rocess; that is, instead of 
having to pro nany cues, one only has to 
deal with a sn number of general factors, 

While cues being identified and clus- 


> expert is measuring the 
The businessman may say 
tasty”, but how “tasty”? 


tered togethe 
amount of th. 
that a produ 


In many typi situations the expert will 
have to serve ı measuring instrument for 
cues that car be measured in more ob- 
jective ways ( Sawyer, 1966). 

There are t) issues that are important in 
discussing the isurement of cues: (a) in- 


trajudge reliz 
(Cronbach & 
mental bias. 

liability, it s} 
expert can rej 


y, (b) construct validity 
ehl, 1955), and (c) judg- 
1 regard to intrajudge re- 
be obvious that unless the 
e his measurements of the 


cues, there is le more that can be said in 
defense of his ertise. Therefore, high intra- 
judge reliabili: is a necessity for expertise. 
With regard to. struct validity, it should be 


the case that 

have status as 

tionally, we wou 
both convergen 
when they are 
considered met! 
method matrix 


cues being measured should 
planatory concepts. Opera- 
expect that the cues show 
and discriminant validity 
sidered as traits (judges are 
) in a multitrait-multi- 
; ampbell & Fiske, 1959). 
Finally, with regard to judgmental bias, we 
should expect th: expert judges will not show 
Such Well-known biases as leniency, halo, etc. 
(Guilford, 1954; Ghiselli & Brown, 1955). 
ill three of the above criteria should be con- 
Sidered in evaluating expertise. 

After cues are identified, measured, and 
Clustered, the important cognitive work of 
weighting and combining to form a global 
valuation follows. This part of the judgment 
= has received the greatest amount of at- 
a on (Slovic & Lichtenstein, 1971). The re- 

S of a large number of studies have shown 
poes are quite poor at weighting and 

ming information accurately, that is, in 


It has bee 
Objective mea; 
5 Siven, subj 

ey are 
of what 90 


n noted that in situations where an 
Sure of the relationship between cues 
ects find it very difficult to learn them 
different from their preconceived ideas 
together” (Chapman & Chapman, 1969). 
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accord with the objective validity of the in- 
formation, However, objective validity im- 
plies that the criteria used as the bases for 
comparing the global evaluation were them- 
selves relevant to the ultimate criteria in 
question (Thorndike, 1949), In many cases, 
availability of criteria, not relevance, was 
used to define the objective measure. It is 
therefore possible that global judgments have 
more validity than it has been possible to 
demonstrate thus far. 

Since questions of criterion relevance 
cannot be fully answered, and since some 
situations contain no criteria, is there 
some other way of evaluating global judg- 
ments? Let us take the same line of in- 
quiry that was used in the measurement of 
traits; that is, Is the global evaluation an 
operational concept in the sense of possessing 
convergent validity? While it would seem im- 
portant that experts agree on their evalua- 
tions (as a condition for expertise), the issue 
is considerably more complicated. The con- 
cept of agreement can itself be thought of in 
two ways: (a) agreement “in fact” and (5) 
agreement “in principle”. The former refers 
to actual agreement of evaluation, whereas 
the latter refers to agreement with respect to 
weighting and combining policy, that is, how 
the global evaluation is to be formed once 
the inputs are specified. Although one may 
have agreement in principle, there may be 
disagreement with respect to the coding of 
input (note that coding of input is also de- 
pendent on the perceived covariation of the 
cues). Therefore, agreement in fact will be a 
function of weighting and combining simi- 
larity and similarity in coding cues. On the 
other hand, it should not be assumed that 
agreement in principle is easy to achieve, This 
is due to the fact that learning probabilistic 
relationships in the natural environment may 
be quite difficult. This implies that there is 
great room for individual differences in the 
evaluation of the diagnosticity of cues and 
subsequently, large differences in the weights 
assigned to those cues. Furthermore, even 
when training is rigorous and the conditions 
for learning optimal, similar weighting policies 
may reflect training in a particular school of 
thought. It seems that policies would be very 
similar within but not between competing 
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schools (a practical application of this is 
Naylor & Wherry’s, 1965, technique for 
grouping judges on the basis of similarity of 
their weighting policies). 

Although the difficulty of achieving either 
type of agreement is great, it should be the 
case that there is at least a common core of 
knowledge which is germane to a particular 
area of inquiry. Furthermore, expertise con- 
sists, to some extent, of knowing that common 
core. Therefore, as a practical guide it is pro- 
posed that there be at least some agreement, 
both in fact and in principle, for the global 
evaluation.“ 

Wich the discussion of the factors involved 
in expert judgment, there is a need to illus- 
trate, in a concrete manner, how the various 
criteria discussed in this study can be used 
in evaluation of real experts. The purpose of 
the present study is to make explicit the pro- 
cesses assumed to be present in the preceding 
discussion. The details of the study to be 
described appear in Coppleson, Factor, Strum, 
Graff, and Rappaport (1970) and Einhorn 
(1972). 


METHOD 
Subjects 


The judges we considered were three medical 
pathologists. This specialty requires considerable 
training and is of special importance since the path- 
ologists’ report is often the ultimate criterion that is 
used for the diagnosis of various diseases. 


Procedure 


The three pathologists independently viewed 193 
biopsy slides taken from patients having Hodgkin’s 
disease, a cancer of the lymph system. The biopsies 
were taken when the patient first entered the hos- 
pital. The only information given to the pathologists 


was the biopsy slide. All the slides used in this study 


were taken from patients who had died and from 
whom complete data were available with respect to 
the ratings to be described. 

i For each slide, the pathologist had to give his 
judgment of the amount of nine histological char- 
acteristics that were chosen, a priori, to be impor- 
tant. The histological signs were as follows: benign 
histiocytes, malignant histiocytes, eosinophils, necrosis, 
plasma cells, neutrophils, lymphocytes, Sternberg- 
Reed distribution, and Sternberg-Reed cells, For each 
of these signs, with the exception of Sternberg-Reed 
distribution (which was measured on a 2-point 
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cases where experts are from competing 
we might expect high negative correlations. 
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scale), a 6-point scale was ch was de- 
veloped by the pathologists 

In addition to the judgment amounts of 
the histological characteristics, al judgment 
concerning the classification of t ise in terms 
of severity was also made. Thi nt Was mes- 
sured on a 9-point scale (highe: representing 
greater severity). The global j t was made 
after the judgments of the hist« gns. Finally, 
26 of the slides were repeated ach so that 
estimates of test-retest reliabil ability to 
reproduce the judgments) could ned 

Although the judges picked ou gns they ex- 
pected to measure, one would that experts 
with experience in this task wou e little dif- 
culty in recognition of attribute ıs the judg- 
ment of amount would be of n neern, How- 
ever, definitions of each sign wer: iven so that 
any differences the judges had i: egard would 
be revealed in the later analys special im- 
portance is the fact that the mul nsional stim- 
ulus was viewed intact, so that i tions, context 
effects, etc. could play whatever : iey normally 
did in the task. 

RESULTS 

The first question addresse as whether 
the ratings or cue measuremen e reliable; 
that is, Could the expert, viewing 
exactly the same slide on tuo ions, give 
the same response? Mean judge reli- 
ability for the signs ranged f: 19 to .93 
(over the three judges), whi the mean 
global judgment reliability was 05 (over the 


three judges). The mean reliability for the 
three judges (over signs) ranged from .64 to 
74. Judge 2 had a slightly higher reliability 
for the signs, but his reliability on the global 
judgment (.46) was considerably lower than 
that of the other judges (.69, .71). The aver- 
age reliability over both judges, signs and 
global judgments was .69. 

Convergent and discriminant validity were 
examined for evidence of construct validity. 
Convergent validity refers to the fact that 
different methods (judges) should converse 
in measuring the same trait (sign); the 
correlation between judges for the same sign 
should be greater than zero. Discriminant 
validity, on the other hand, is defined by 
three conditions: (a) The correlation be- 
tween the same traits (signs) for different 
judges should be higher than the correlation 
between different traits as measured by dif- 
ferent judges. (b) The correlations bein 
the same signs as measured by differen 


igher than the correlations 
signs as measured by the 
he pattern of correlations 
e in the submatrices of the 
thod matrix. This refers to 
oth sign and magnitude of 
in the heterotrait-hetero- 


the ordering 
the correla: 


multimethod (multisign- 

x is shown in Table 1. 
ıt convergent validity, 29 of 
us between judges’ measures 
ficant at p < 01. Further- 
‘e of these correlations was 
convergent validity, It 
that convergent validity, in 
a single judge is defined as 
ıme as interjudge reliability, 
e amount of agreement was 
erent for the various signs 
tice should be given to the 
atest amount of disagreement 
to the global judgment (.27). 
accord with earlier remarks 
judgment being a composite 
as the mean agreement for 
judges, over all signs, Judges 
cu with each other most (.64), 
$ ! and 3 next (.52), and Judges 

fand 3 agrecin least (.50). 

In analyzing discriminant validity of 540 
fomparisons, there were only 20 violations 
lor 4%) of the requirement that the correla- 
Hon between the same signs for different 
judges be higher than the correlation between 
l erent signs as measured by different judges. 
Additionally, there were only 32 violations 
the 540 comparisons (or 6%) of the re- 
Mirement of discriminant validity that corre- 
ations between the same signs as measured 
different judges be higher than the corre- 
lions between different signs as measured 


* 
4 These conditions can be expressed formally. Let, 
Et; rating on ith slide by jth judge for sth sign. 
Me. i j=j’, ss’. Convergent validity then 
d to ee jr, >0 (correlating over i). The two con- 
NS for discriminant validity are: 
E 
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ute value operators are used, since it is the magni- 


a than the sign of the correlation that is 
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by the same judge. Given that this require- 
ment is quite stringent and the percentage of 
violations small, it seems fair to conclude that 
this condition was also met. Examination of 
the submatrices further showed that the pat- 
tern of correlations was the same in the sub- 
matrices, indicating that the third condition 
for discriminant validity had been met. There- 
fore, it seems fair to conclude that this data 
showed both convergent and discriminant 
validity. 

Guilford (1954) presented an analysis of 
variance approach for dealing with judgmen- 
tal biases found in the use of rating scales, 
While the situation that this approach was 
originally developed for involved raters rating 
employees on various traits, the general idea 
is applicable to any situation where judges 
rate objects on a number of dimensions 
(Stanley, 1961, also discusses this basic de- 
sign). Although Guilford’s procedure is ap- 
plicable to the present data, there is a slight 
difference. In Guilford’s example (pp. 280- 
288), each rater rates each ratee on each 
attribute only once, In the present study, 
using the 26 repeat slides, we have a Judges X 
Slides X Signs design with two replications. 
The last two factors are repeated and there- 
fore an appropriate way to analyze this data 
is given by Winer (1962, pp. 319-337). This 
analysis of variance is presented in Table 2. 

Note first that the main effect for judges 
is not significant. Since this effect represents 
the judges’ overall tendency to rate the 
amount of the signs, this indicates a lack of 
a leniency error. (Leniency in this case refers 
to a tendency to overvalue or undervalue the 
slides in general). The lack of an effect in the 
present context is more in accord with what 
one would expect from expert judges. The two 
other main effects simply indicate that dif- 
ferent slides and different signs have varying 
means. What is of major concern are the 
interactions involving the judges factor. 
These involve the differential effect of judges 
on slides (A X B), on signs (A X C) and on 
Slides X Signs (A X B X C). These interac- 
tions should be nonsignificant; that is, differ- 
ential effects due to judges are not consistent 
with the fact that the shared core of exper- 
tise is influencing their measurement. Con- 
sider the Judges X Slides (A X B) interac- 
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1 of severity and then work at rating 
in accord with this general impres- 


(A xB XC) are 
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action indicates that the judges differentially 
rate the signs; that is, some judges see more 
or less of some signs than others. The three 
way interaction indicates that judges differ 
entially rate signs depending on what slide 
the sign appears in. This is a type of context 
effect that is specific to the judges. However 
it should be stressed that the significant inte 
actions are all quite small. For these particu- 

lar experts then, there seems to be only slight 
bias in their judgmental ratings. 

One final analysis is necessary to complete 
the results on bias. Because of the qualitative 
difference between the global judgment 
the individual cues, a separate one-way analy- 


of variance (repeated measures on each 
for the three judges) was performed on 
de global judgments. The results showed a 
macant effect with Judges 1 and 2 being 
ost different (F, = 2.98, F = 4.61). There- 
Judge 2 saw more severity generally 
3 did Judge 1, although both judges did 
See different amounts of the signs. As has 
n Pointed out before, this is consistent 
remarks made concerning the diffi- 

of achieving agreement on global vari- 


de issue of ‘identifying what the relevant 
> were in this task was simply decided by 
S experts agree on a set of attributes. 
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While this obviously is useful from an ex- 
perimental point of view, its external validity ` 
is quite limited. To investigate the clustering 
of relevant cues, the correlations in the mono- 
judge submatrices of the multitrait-multi- 
method matrix could be examined. A simpli- | 
fying method for comparing the correlational — 
structure of the matrices is to compare their 
factor structure. In order to achieve this, an 
orthogonal factor analysis with varimax rota- 
tion was performed on the correlation ma- 
trices of the cues, one for each of the three 
judges. Four factors were extracted, account- 
ing for 70% of the variance of each matrix. 
These results are shown in Table 3. 5 


_ tained for each judge, 
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TABLE 2 
ANALYSIS OF VARIANCE FOR 
Jupces X Stipes X Sicxs 


Source SS | df | MS V 

Judges (A) 2.38 21.19 <1 
Subjects X Within 3.88 3 1.29 
Slides (B) 100.70 25 6.43 14.61** 
AXB 21.10 50 44 1.47 
B X Subjects 

Within 22.75 75 30 
Signs (C) 975.63 8 | 187.80 | 257.20 
AXC 26.37 16 1.65 2.26* 
C X Subjects 

Within 17.62 | 24 73 
BN C 976.08 200 4.88 13.94% 
AXBXC 159.15 | 400 40 1.14* 
B X C X Subjects 

Within 208.93 | 600 35 


Note, All 5 are assumed fixed. 
*p <00]. 


The rotation accomplished a good degree 
of simple structure as can be seen. Factor 
loadings of .5 or greater (in either direction) 
are circled. Judges 1 and 2 seem to have a 
much more similar structure than Judge 3. 
In order to adequately compare the factorial 
structure of the judges, factor scores were ob- 
This was accomplished 


via the least squares procedures outlined in 


Thurstone (1947). The factor score, for each 
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judge, represents that j 


underlying construct 
factor. In order to con 
similarity, Nunnally (19: 
factor scores be correlate: 
analyses, Since there ar: 
four factors, the results í 
of this procedure can be ci 
a multitrait—maltimethod 
multifactors-multijudges. 
Judges x Factors matrix 
The correlations within 
matrices are all zero duc 
factor analysis. The vali 


circled. It can be seen that 


2, the factorial structure is quite similar (the 


average of these values is 
this matrix shows high di 
(in the sense of the first 


ever, note that both compari 


trices involving Judge 1-] 


2-Judge 3 only show conver; 
the first two factors. For F 
there is no convergent valid 


s score on the 
nted by that 
factor matrix 
aas stated that 
oss the different 
ee judges with 
an application 
stualized within 
nework, that is, 
le 4 shows the 
elations. 
single judges’, 
he orthogonal 
diagonals are 
Judges 1 and 


In addition, 
scriminant validity 
dition). How- 
ms of the ma- 
udge 3, and Judge 
nt validity on 
tors 3 and 4 

(and there- 


fore no discriminant validity). The fact that 


€ Let r = factor score on ith sli 
the rth factor. Then, convergent v 
structure is defined as, PL jr . 


for jth judge on 
y of factor 
ince the factors 


are orthogonal, PF jr Hr, =. Condition (b) for dis- 


criminant validity cannot be 


‘however, becomes, ens -l. 


Condition (a), 


test 


TABLE 3 
FACTOR MATRICES FOR THREE JupcEs 
Judge 1 Judge 2 Judge 3 
Signs Factor Factor Factor 
v- 
a II | m | iw I Teal seer cress ley I IL 1 
enign histioeyte — 080 tee bo Es 95 «09 
peal 6| — 112 155 1002 | —.035 | 018 
rane „191| 143 000 208 011 | —.064 ee | —095| 128 
ot ne aso —"184 | C706) —.387 00 | —.075 bear 
156 402 422 147 245 042 370 456 232 E 
ae 058 | .065 | C875) —.087 | C21 01 =.103 | —.008 429 
S60) —042 | aia] 0074 CZD) .040| 7s 23 | Goad} —.150 | = 
Lymphocytes E 3 5 y! 12 ED) 
— 414 | =. = 85 — —.119 
. S5 113 —.135 Q 174 055 232 | =. 
distribution ee. —.225 
P 222 —052 | 110 046 025 | —105 | 4332 4 
cells 10 =.140 | 081 | CED) 09 104 063 | CD .296 | —.007 | =% 


Note. Factor loadings of .5 or greater (in either direction) are circled, 


Judge 3 is invo! 
Suggests that th 
ferent factorial 
judges. 

This analysi 
judges generally 
Variables, altho 
ferent from the 


>d in these two comparisons 
judge has a somewhat dif- 
ructure than the other two 


indicates that these three 
agree on the clustering of 
Judge 3 is somewhat dif- 
er two. However, the con- 
vergent validation results generally indicate 
à fairly high degree of consensus, 
* order to exar ine the weighting process, 
duns function was assumed to be 
ag (Dawes & Corrigan, 1974). The 
e ts given to the cues were determined by 
e on the global judgment on the nine 
i 10 each judge. Although interpretation 
a 90 in regression is difficult (Darling- 
23 vs the intercorrelations of the cues 
192 ch judge were generally small.” There- 
= Problems of multicolinearity were not 
155 . these weights. The re- 
ae E analysis are shown in Table 5. 
e 5 e seen that the ability to model the 
multip aries among these three judges. The 
Bey ctor for Judge 2 is low, as 
Most 15 to the other two judges, What is 
mportant is the fact that the weights 


The avi 
lute 84 era; 
and 3 lues) 


ge intercorrelations of the cues (abso- 


are .20, .17, and .22 for Judges 1, 2 
» Tespectively, en E 
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TABLE 4 
INTERCORRELATION OF FACTOR SCORES FOR THREE Jupors 
Factor Judge 1 Judge 2 Judge 3 
2 3 4- 1 2 3 4 1 2 3 4 
1 .000 —.000 ( 050 241 087 — 177 E) 1 —.079 08 
Bai 2 100 00 000 —.097 CWD —.229 033 C64) 200 —.019 
3 —.056 238 204 —.081 (274) .182 
4 075 185 | 149 027 122 Ci 
1 00 Gi 22 366 
Judge 2 : —.034 C518) —.040 
3 -28 153 CO 
4 —.025 —.099 308 
1i 100 —.000 —.000 
7 2 | 100 000 
831 1.00 
4 
| 
Note. All correlatio ised on n = 193, The validity diagonals are circled. 


for the cues vary greatly across the three 
judges. Consider Experts 1 and 2. As we have 
previously seen, these two experts have very 
similar factor structures, yet, they weight the 
cues quite differently in coming to their global 
judgment. As a matter of fact, the cue that is 
weighted most heavily by Judge 1, Sternberg- 
Reed cells, is not weighted significantly by 
Judge 2. Consider the weighting policies of 
Judges 1 and 3. They do agree that Stern- 
berg-Reed cells should be weighted most 
highly yet, Judge 3 weights three other cues 
as being important (benign histiocytes, ma- 
lignant histiocytes, and plasma cells), while 
Judge 1 does not. Clearly then, in addition to 
there being low agreement in fact, there is 
also low agreement in principle. From our 
previous results we can say that disagreement 
in fact is mostly accounted for by disagree- 
ment in principle, since the judges seem to 
code the data in very similar ways. 


DISCUSSION _ 


The implications of the results on informa- 
tion weighting for defining expertise are 
problematic. If one requires that experts have 
similar weighting policies, the problem of ade- 
quately defining similarity is immediately 
raised. For example, Judges 1 and 3 do agree 
on the importance of Sternberg-Reed cells, 
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TABLE 5 
FOR CUES IN PREDICTING GLOBAL JUDGMEN 


— — —— —= 
Cues 
Judges |- = "a 25 1 ae 
BH | MH | Eos. | Nec. | PC | Neut. | Lymph. | S-R D ( i 
| = — 
1 | % 12 | 13" | 08 | 08 | — 16+ o so 
2 0% | 07 =| —13° 08 -s | 00 | —.05 0 27% 
3 | e, een, 07 | it eee 0 | —07 | —.ı Agnes 
oN 3 alone f = 193, fis eck stiocytes; MH = ee hist osi P 
ee = maons: PO a ane ean mitin neutrophils; Lymph = Daph ca: e . his listra 
S-R C. = Sternberg-Reed cells, 
*p <.10. 
27 20s 
b <0 
yet they do not agree on the weighting of the A second consideration no! ussed in this 
other cues. On the other hand, Judge 2 is study is how alternatives a t formed in 
clearly different from either Judges 1 or 3. the decision/judgment proce r example, 
It should be mentioned that Judge 2 was a although there may be severa to market 
resident of Judge 1 and was learning this task a new product, the expert be able to 
to some extent. His low correlation Suggests elucidate alternatives that not been 
another possible criterion for expertise—ex- thought of before. Again, thi a creative 
perts should have at least one significant cue process that we know little abo 
weight. The reason for this is that at least A third consideration, somewhat related to 


Some information in the cues should be used 
in forming the global judgment. 

Although requiring experts to have similar 
weighting policies would be a very stringent 
requirement, it would seem that the combin- 
ing of information lies at the core of expertise. 
Learning the important relationships between 
Cues and criteria should be one of the most 
important kinds of learning that is involved 
in mastering a particular area, Therefore, it 
is tentatively suggested that experts should 
exhibit similar weighting policies. 
It was stated at the outset that the condi- 
tions discussed in the present study were 
necessary but not sufficient. This clearly im- 

plies that other factors are indicative of ex- 
pertise. Consider our example of businessmen 
making a forecast of sales for a new product. 
Whereas they may agree on what some of the 
relevant attributes of the product are, the 
real expert may discern/form cues that no 
one has ever seen before. Similarly, the ex- 
_ pert may be the one who can discern and use 
contingent relationships between cues, that is, 
complex interactions among cues. These as- 
‘ects of cue formation and utilization may be 
an important aspect of expertise. 
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the first two, is how the task is tue. BY 
this is meant how the problem is defined 
the manner in which a solution i 
tempted. For example, the process of 
tion search will greatly structure 
Kleinmuntz (1968) suggests 
searches his environment for thos u 
yield the greatest amount of information, and) 
this leads to a shortening of the decision 
tree” used to reach a judgment. These types 
of problems have not been discussed in this 
study since the experiment served to structure 
the task. However, in nonexperimental = 
tings, this structuring will undoubtedly be 0 
great importance. : 

A major criticism of the conditions for 
expertise stated in the present context may be 
that too much stress has been put on agree- 
ment. This agreement has been with respect 
to clustering, measuring, and weighting cues 
As is well known, the history of scien 
replete with oddballs who did not agree “a 
anyone, yet, were proved to be correct bY 
Subsequent events. This criticism has value, 
yet it conceals the fact that eventually some 
relevant criterion other than agreement ae 
came available (possibly due to new instru- — 
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mentation or ual events). However, from Darlington, R. B. Multiple regression in psychological 
, z 

a practical | view, we cannot afford to genre reg practice. Psychological Bulletin, 1968, 
Bee periods of 1 1 bie may be Dawes, R. M. & Corrigan, B. Linear models in. de- 
3 5 t 7 i rene ave ae cision making, Psychological Bulletin, 1974, 81, 

en with simited time perspective. 95-106. 
Therefore, al n agreement is by no means Einhorn, H. J. Expert measurement and mechanical 
the only crit it is one that seems rele- combination. Organizational Behavior and Human — 

. i Performance, 1972, 7, 86-106 

vant to the í on of expertise. _ Ghiselli, E. E, & Brown, C. W. Personnel and in- 

Where a nt, or partially relevant cri- dustrial psychology. New York: McGraw-Hill, 
terion can be wn within a reasonable time 5 ss — i 12 i 

* =e . , reen, D. wets, J. A. Signal detection theory 
period, et a be determined by ones and psychophysics. New York: Wiley, 1966. 
track recor: 11 1 found that two experts Guilford, J. P. Psychometric methods. New York: 
do equally in terms of matching a cri- McGraw-Hill, 1058. 
terion (and is significantly higher than Hammond, K. R. Probabilistic functioning and the 
some base r zue), yet they do not meet 5 method. Psychological Review, 1955, 62, 
the require: discussed in the present 5 


study (espec / that of similar weighting), 
What are we cay? We can only say that in 


a highly pre :bilistic world, there may be 
many routes `< the same goal (cf. Hammond, 
1955) and t-t there may be more than one 
Way to per,, the cognitive tasks involved 
in judgment. 
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OBJECTS FROM ABSTRACT EVALUATION í 


Department of Geography, Florida State University 


Ten employees of Jowa trout hatcheries served as subjects in a stud 
human judgment behavior. In Phase 1, subjects judged hypothetical i 
streams varying in quantity of trout and driving time to stream 
factorial design). In Phase 2 subjects judged real Iowa trout streams 
estimated the quantity of trout and the driving time for each stream. 1 
estimates were used in multiple regression equations, derived from the 
in Phase 1, to predict the ratings of the real streams in Phase 2. Corre! 
between predicted and observed judgments ranged from 0.37 (individual 
ject level) to 0.85 (group level). Results were consistent with empirical 
in information processing and nonexperimental work in geography and 


spatially oriented sciences. 


Individuals daily make judgments which 
vary in their importance to the functioning 
of the person. Some judgments may be trivial 
—what to wear, what to eat—while others 
may have substantial impact—which car to 
buy, what courses to take in a given semester, 
which of several destinations to choose. This 
study attempts to apply experimental and 
analytical methodology, used in the study of 
human judgment behavior, to a practical pre- 
diction problem. 

The study of human judgment behavior 
has received increasing attention (see e.g., 
Slovic & Lichtenstein, 1971). Such behavior 
involves the operations of weighing and valu- 
ation of alternatives, At any point in time, an 
individual’s judgment may be thought of as 


_ comprised of two components: (a) an evalu- 


ation function, that is, the model which the 
individual employs to weigh and value stim- 
ulus entities, and (5) the variance in the 
attributes of these entities over time and 
space. Much previous work has concentrated 
on the idiosyncratic evaluation function of 


_ judges at one point in time (see Slovic & 
Lichtenstein, 1971). Within this research 


v oies 


paradigm, considerable attention has been 
devoted to the so-called “expert” judge 


* Requests for reprints should be sent to Jordan 
J. Louviere, Department of Geography, Florida 
State University, Tallahassee, Florida 32306. 


US 
THEIR ATTRIBUTES: 
THE CASE OF TROUT STREAMS 
JORDAN J. LOUVIERE? 

(Dawes, 1971; Hoffman, 5 & Rorer, 
1968; Slovic, 1969) whose s: ation func- 
tion may be assumed to be if the al- 
ternatives remain stable. a 

Static models may be very seful in ap- 
plied work for short-range pred n in which 
neither the evaluation function of the indi- 
vidual nor the attributes of the opportunities 
are expected to change substa: Hence, 
an experiment to test the uti of static 


models for field prediction should focus upon 
the judgment of entities in a coniext in which 
both the individual evaluation functions and 
the attributes of the entities to be judged 
can be assumed to remain fairly constant over 
time. The purpose of this study, therefore, 15 
twofold: (a) to describe mathematically the 
judgment policy (evaluation function) of a 
sample of expert judges (whose evaluation 
functions are assumed stable) in a laboratory 
context in which the attributes of correspond- 
ing real world entities are assumed to have 
remained stable over time, and (4) to employ 
these mathematical policy descriptions to 
predict a second set of data consisting of the 
judges’ evaluations of corresponding real 
world stimulus entities. 8 
The real world situation was a problem in 
recreational planning in which experts are 
called upon to recommend trout streams for 
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fishermen aí aluation of many relevant 
sources of in tion. 

This stud; refore, contributes to both 
the subarea man information processing 
known as ex ‘ecision making and to the 
application oí rmation-processing method- 
ology to the diction of behavior in the 
field. 

METHOD 
Subjects 

The subjects 10 employees of the three trout 
hatcheries ope: by the State of Iowa. Selection 
was based upo lingness to participate coincident 


with being a t 
varied, subject 1 
experience in bi 
Called upon 
mend streams 
trips, subjects 


fisherman. Although experiences 
a minimum of five years of 
ork and trout fishing. 
by the general public to recom- 
r their management for fishing 
quired to put together informa- 
tion, experienc: about the set of possible desti- 
nations (stream nd then to give some measure of 
their recomm ion. These subjects maintain, 
administer, an work at the hatcheries; they 
constantly mo: and manipulate a number of 
Variables imp to stream quality; and they 
conduct period ensuses of the trout population. 
isions regar stocking, stream improvement, 
tte. are based or ‘his information. The subjects also 
fish these strea and thus know them recrea- 


tionally and pro‘-ssionally. 
Task 
The judgment is the recommendation of 


trout streams ternative destinations for one- 


he subjects are hatchery person- 
don daily by the public to make 
Such judgments, >y are experts at making these 
commendations and are also expert trout fisher- 
men. Hence, bo eir evaluation functions and 

e stimuli to be judged (trout streams) may be 
considered to be stable. This task, therefore, is 
&sumed to hold both components of the evaluation 
Process constant, 


Experimental Procedure 


ae Experimental paradigm parallels that of func- 
is umdeasurement (Anderson, 1970) because it 
25 Upon a factorial design, a numerical response 
l tial v. and a within-subjects format. The ana- 
Pee tod, because of its reliance upon multiple 
ment “on, is more in accord with that of judg- 
Policy capturing (Hammond, Hursch, & Todd, 
1972) Hoffman, 1960, 1968; Stewart & Gelberd, 


p experimental design derives logically from an 

5 ee that judgments about entities (in this 

Subjects trout streams) can be described as if the 

attribus Weighed and valued each of a set of relevant 

ae es (stimuli) of the entities in turn. Thus, one 

sey et a set of relevant attributes and then 
em in combinations to be judged. 


i ds Ener) * * a 
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Phase 1 


A list of stimuli was derived from conversations 
with the hatchery personnel and other trout fisher- 
men; trout fishing literature was also consulted. 
Out of a large number of potentially relevant stim- 
uli,? travel time and number of trout were selected 
and varied in a 5 X § factorial design (travel time: 
15, 60, 105, 150, 195 minutes; number of trout: 
2, 14, 26, 38, 50 trout).3 

The response scale was a 150-millimeter continu- 
ous line scale labeled “Recommendation to Fish.” 
Both end anchors and filler items were employed to 
provide a range of values within which the subjects 
might couch their judgments and to avoid possible 
nonlinearities in scale usage. Verbal end labels were 
“Strong Recommendation Not to Fish” and “Strong 
Recommendation to Fish,” respectively. Subjects 
responded by making a slash mark at that point on 
the line which best described their recommendation 
of the stream described by the informational items. 
Slash marks were measured to the nearest 5 milli- 
meters by ruler, yielding a 30-category scale. 

Each stimulus combination was centered on a 
Separate one third of a sheet of 8} by 11 type- 
writer paper, with the response scale centered im- 
mediately below. Presentation order of the stimulus 
items was alternated and the combinations (25) 
were arrayed in a different random fashion in each 
of four replications; all replications were placed in 
a packet. The subjects were instructed to proceed 
through the packet as carefully as possible, and 
insofar as possible, not to refer any of their re- 
sponses to previous items. 

Subjects were instructed to imagine that they 
were in a new state in which they had never fished 
and were acquainted with a fellow trout fisherman 
at work. This fisherman offered to supply them 
with information about a number of quality streams 
in the new state that they could fish in the coming 
season. The friend asked the subjects to indicate 
how strongly he would recommend fishing or not 
fishing each stream during the forthcoming fishing 
season. 

Quality streams were defined as those which had 
minimally acceptable levels of competing fishermen, 
size of fish, water temperature, habitat, and en- 
vironmental aesthetics. To insure that the subjects 


2 Three variables initially appeared relevant: num- 
ber of trout, travel time, and length of fishable 
stream, Pilot experiments revealed that subjects 
discounted length of stream, and subsequent dis- 
cussions indicated that this variable, along with 
most others, could be held constant. 

Values of these variables were chosen from 
representative data supplied by the Iowa Conserva- 
tion Commission: 2 trout per quarter of a mile was 
the lowest count, while 48 was the highest in the 
spring-stocking surveys. Stream values for real 
streams range in between these figures, and there is 
considerable variance. The travel-time values were 
expressed in hours and minutes: for example, one 
hour and 45 minutes. 
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Ficure 1. Plot of the average rating (R:;.) for each stream as a fu 
of number of trout and travel time. 


realized that they had to assume these factors were 
Present in each stream, a list was drawn up of the 
things to assume about each stream. It was left 
with the subjects so that they could refer to it if 
need be during the course of the task. A session 
which averaged 45 to 60 minutes was held at cach 
hatchery. A total of 116 judgments was required 
[(25 Combinations x 4 Replications) + (4 Fillers X 
4 Replications)]. The four fillers represented items 
that were more extreme than those under experi- 
mental investigation. The first replication was treated 
as practice and was not analyzed. 


Phase 2 


In the second phase of the study, subjects sup- 
plied information about real Iowa trout streams, 
The purpose was to generate data to be predicted 
by the equations derived from Phase 1. The subjects 
first listed up to five of their favorite trout streams. 
The subjects then rated each favorite stream as they 
had in Phase 1. Following a stream rating, the 
subjects supplied a written reason for their rating, 
Finally, the subjects estimated the average number 
of trout per quarter of a mile and the average 
driving time from their residence to each real 
stream. The subjects were then requested to supply 
the same information for 10 streams drawn at ran- 
dom from the Iowa trout program. 


Depending upon whether one chooses to include 
experimentally stocked streams or just those part of 


the regular stocking schedule, there are between 
46-60 trout streams in Iowa. 
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105 150 195 


Analytical Procedure 


applied te 


Analysis of variance (ANOV 
are assumed 


the response data. Because t 
to have been collected on an 
son, 1972) and because they v 
outcome of a repeated measure 
ment, hypotheses about the m 
effects were tested. The ANOV 
for possible equation forms which 
data. ~ 405 
In addition, graphs of the marginal response me Jus 
plotted as a function of the expe al sirme 
values served to compliment the OVA resu a 
For the two-factor design in this study, each on 
corresponds to a single term in the ANOVA. a 
supplied visual information suggesting Ian 
istic graphical forms of algebraic expressions = 
derson, 1974; Shanteau & Anderson, 11 
yielded insight into interaction effects. Both a me 
and graphical data plots are common 1 
procedures of functional measurement (Anderson, 
1972). 
Multiple regression procedures were ann 
derive predictive equations for each individua a 
ject and the group data. These equations were a 
employed to test their ability to predict the Te 
sponses of the subjects in Phase 2. Multiple 51 85 
sion is commonly employed in Daly ores 15 
search and has considerable empirical support end 
Dawes, 1971; Goldberg, 1968; Stewart & Ge 
1972; Yntema & Torgerson, 1961). 


d interaction 
ed as a test 
would fit the 
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RESULTS 
A graph < esponse data from the fac- 
torial design re 1) suggests that the top 
four curve v a linear integration rule; 
the bottom rout) .curve deviates from 
this form. latter effect may indicate 


ts failed to distinguish this 
trout) from the filler value 
the end anchor value (0 
Th eraction effect (Trout X 


simply that 
extreme val 
(1 trout) : 
trout). 


Travel Tim: s significant (F = 2.44, df 
= 16/128, 1), but localized in the 2 
trout curve oval of this curve rendered 
the interact msignificant (F = .76, df = 
12/96, p > The single factor graphs are 
in Figure 2. e plots support a linear rela- 
tionship betw the objective and subjective 
values.“ 

The indiv subject graphs exhibit over- 
all similarity the group graph (Figure 1). 
All but one s ect appeared to follow a linear 


trend, howe, 
able noise. Aj 
tion clearly 


the data exhibited consider- 
ugh a linear predictive func- 
dd fit the group data, linear 


and log-linear iunctions were fit to the indi- 
vidual subject data for comparison. 

The results (Table 1) suggest that the 
linear models a:e good predictors of the mean 
judgmental responses of the individual sub- 


jects and the g 
and 8) deviate 
log forms fit bet 
these functions are theoretical models, be- 
cause they are not, they are predictive func- 
tions. The next logical question becomes, 
How good, in terms of some criterion, are 
these functions as predictors of Phase 2 data? 

Subjects’ estimates of the number of trout 
and travel time (Phase 2) were employed 
directly to predict their ratings of the real 
Steams (Phase 2). First, the estimates of 
trout and time were substituted in the multi- 
cession equations derived from Phase 1 
1 5 tain Predicted values for Phase 2 judg- 
: S. Correlations between the predicted and 

& obtained judgments were .37 for the group 
“quation (Table 1: linear group) and .47 for 
— 


p, although two subjects (5 
om this trend; that is, the 


5 
tiring mal trend analysis also supports this 
for the his, of course, is a necessary assumption 
aud Licht ear regression model, although Slovic 
Will do 1501 05 (1971) report that the linear model 
Nonoto; well, so long as each independent variable is 
À mically related to the dependent variable. 
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Fıcure 2. Graphs of single variable functions: The 
average recommendation (R.,; R..) for each value 
of the number of trout or travel time, holding 
values of the remaining stimulus constant. 


each individual subject’s equation (Table 1: 
Linear, S 1 to S 10). Thus, both equations 
were used to predict all of the single subject 
judgments (n = 68). The linear model was 
employed because the plotted group data 
(Figure 1) were clearly linear and because 8 
out of 10 single subjects were better fit by 
the linear equations. 

Second, the subject’s average estimates of 
trout and time (Phase 2) were employed to 
predict the average rating given to each real 
stream (Phase 2) by the subjects. Subject’s 
average estimates of trout and time were sub- 
stituted in the group level equations derived 


TABLE 1 
lixar axo Le UKA FonCrions 
Prr to Data 
Subject | e | a Ig (Trout) |B: ine) 
Lincar model 
Group * 10.41 20 —. 0 
81 si | 25.13 a —.10°* 
8114 2 18.78 26° — .06"* 
83 0 23.05 ry eg —.11°* 
S4 M 1L35 As’ —,08** 
ss 78 9.28 or —.07 
Só 19 13.03 31 —. 04. 
8 7 20 10.36 ao" n 
82 14 6.39 2'006 
So | $8 | 1846 | 25 | ar 
$10 58 28.36 or —.01° 
Log data model 
Group 87 1.93 S 21 
$1 A8 3.14 8 ˙˙ —.38** 
8 2 81 2.52 2 —.23˙⁰ 
84 79 3.16 err -45% 
84 8 89 1.00** —.37* 
183 82 1.29 90˙⁰ —.51 
5 6 73 1.36 12° —.23** 
8 7 89 1.25 04˙⁷ —.10°* 
$8 89 .23 40˙ 30˙˙ 
8 9 47 2.14 38˙ —.07 
S 10 49 3.31 05 —.03** 
Note. S = Subject. 
S. 
Si. 


from Phase 1 (Table 1: Linear, Group; Log- 
linear, Group) and these predicted judgments 
te correlated with the average stream judg- 
ments (n= 12). Interestingly, the log-linear 
equations predicted the average rating data 
better than the linear equation (R=.75 
compared to .67). This same result obtained 


equation employing the average trout and 
ume estimates as independent variables and 
th average rating for each of the 12 streams 
as the criterion (n= 12). The linear form 
correlated more poorly (.47) with the average 
ratings than did the log-linear form (.86). No 
en for this effect can be given at this 


Discusston 


‘Jorvan if — 


& Gelberd, 1972), simp! le linear re 
gression models account much of the < 
variance in the laborat: (Phase 1). 
The application of these í s to the field 
data must be viewed ly, however 


because there are a r of possible 
sources of error that are tly detecta. — 
ble. Among these are the ng: (a) The ` 
theory and models may (b) Some — 
important or relevant stin y have been 
omitted. (c) Subjects mas have in the 
real world as they do in 

Regarding the first poin if the true 
static model were nonlin: ear models 
have considerable empiri: wrt (Slovic 
& Lichtenstein, 1971; Ynt Torgerson, 
1961) for their predictive ty. Concern- | 
ing the second point, there ideed be an 
infinite number of potentia evant stim- 
uli important to various rs, but a 
most important scientific is, “How 
well will only a few do as p: rs?” Avail- 
able evidence (see, e.g., Slovic Lichtensteig, 
1971) suggests that less than © will usually 
be adequate. Data collected dase 2 sup- 
port this view. Using only t ibjects’ esti; 
mates of trout and time to ict subjects 
ratings of real streams, much the variance 
in average stream evaluations could be ac 
counted for. Therefore, while here may be 
other factors operative, these ‘wo were suf 
ficient to predict much of the data. Whether 
subjects behave in the lab as in ihe real world 
is a very important question in applied re- 


search. The recent presidential polls suggest 
that individuals will behave consistently with 
their stated intentions (at least in the short 
run). However, more research is needed t0 
provide more definitive answers as to bes 
validity and temporal stability of individuals 
intentions. 

Much research in geography, urban and 
regional planning, and spatial economics has 
been directed towards the question of a 
dicting the allocation of trips by individu 
over a number of different spatial alternatives 
(e.g., shopping trips). These studies Ca 
reviewed in Isard, 1959; Olsson, 1965; WI 
son, 1968, 1970) have consistently show? 
that a function of the following form 
describes these trip allocations between 


one 
Faas ints in 
or more individuals and one or more points 


~~ 


pace: 


ig My = k by log A, + 


by log Dy + ey, 111 


»bserved number of move- 

time span) between an in- 
-gregation of individuals) é 

is some objective measure 
ess of place j as a move- 
and Dy is a measure of the 
ug from point į to point j; 
e constants to be estimated 
nd éy is an error term. 


where My 

ments (over 
dividual (o0 
asd a place 
of the attra 
ment destin 
difficulty of 
CG, 015 and 

from the da 


This equ: of course, is of the same 
general for that derived from the re- 
gression of ze estimates of trout and 
travel time st average real stream rat- 
ings. Interes the group level log-linear 
equation of ame form derived from the 
factorial des Phase 1) also fit the average 
fatings in > 2 better than its linear 
counterpart. explanation can be given for 
this result, it may be some artifact of 
averaging, a: group equation in Phase 1 
(Figure 1) ed to be. Nonetheless, the 
results are cle consistent with a large body 
of research fi: ngs in geography and other 
spatial discipi cited previously. 

ONCLUSIONS 
This study 1:4; attempted to apply experi- 


mental and analytical methodology in human 
judgment behavior to a practical prediction 
Problem. The methodology appears to be 
generalizable to a wide range of social prob- 
dealing with human judgment behavior. 
Th particular, it provided insight into the 
‘valuation of alternative destinations which 
ae differentially located in space a problem 
of major interest to geography, urban and 
regional planning, spatial economics, and re- 
lated disciplines. 
Sreat many theoretical and technical 
Problems await resolution in this type of 
"search, Nonetheless, it is felt that present 
Methodology can be applied to a number of 
Practical problems; moreover, such research 
Serve as an additional data base for 
a reticians. The results of the practical pre- 
tion exercise are believed to be significant 
Ne they showed encouraging corre- 
ence with data they should have, in 
ty at least, been able to replicate. 


PeepreTine ReaL Opyect EVALUATION: Trout Steams 


REFERENCES 


Anderson, N. H. Functional measurement and 
psychophysical judgment. Piychological Review, 
1970, 77, 153-170, 3 


Anderson, N. H. Looking for in ciini- 
cal judgment. Psychological Bulletin, 1972, 76, 
93-102. 


Anderson, N. H. Algebraic models in perception. In 
E. C. Carterette & M. P. Friedman (Eds), 
Handbook oj perception (Vol. 2). New York: 
Academic Press, 1974. 

Dawes, R. M. A case study of graduate admissions: 
Application of three principles of human decision 
making. American Psychologist, 1971, 26, 180-188, 

Hammond, K. R., Hursch, C. J., & Todd, F. J. Anas 

the components of clinical inference. 
Psychological Review, 1964, 71, 438-456. 

Hoffman, P. J. The paramorphic representation of 
clinical judgment. Prychological Bulletin, 1960, 
$7, 116-131. 

Hoffman, P. J. Cue-consistency and configurality in 
human judgment. In B. Kleinmuntr (Ed.), Formal 
representation of human judgment, New York: 
Wiley, 1968. 

Hoffman, P. J., Slovic, P., & Rorer, L. G. An analy- 
sis of variance model for the assessment of con- 
figural cue utilization in clinical judgment. Psy- 
chological Bulletin, 1968, 69, 338-349. 

Isard, W. Methods of regional analysis. New York: 
M. I. T. Press, 1960. 

Olsson, G. Distance and human interaction: A bib- 
liography and a review. Bibliography Series No. 
2, Philadelphia Regional Science Association, 1965. 

Shanteau, J. C., & Anderson, N. H. Integration 
theory applied to judgments of the value of infor- 
mation. Journal of Experimental Psychology, 1972, 
92, 266-275. 

Slovic, P. Analyzing the expert judge: A descrip- 
tive study of a stockbroker’s decision processes. 
Journal of Applied Psychology, 53, 1969, 255-263. 

Slovic, P., & Lichtenstein, S. Comparison of Bay- 
esian and regression approaches to the study of 
information processing in judgment. Organisa- 
tional Behavior and Human Performance, 1971, 
6, 649-744. A 

Stewart, T. R., & Gelberd, L. Capturing judgment 
policies: A new approach for citizen participation 
in planning. In J. E. Rickert, C. F. Davis, v. 
DiMatteo, & A. Bose (Eds.), Urban and regional 
information systems: Information research for an 
urban society (Vol. I). Claremont, Calif.: Clare- 
mont College Printing Service, 1973. 

Wilson, A. G. Models in urban planning: A syn- 
optic review of recent literature. Urban Studies, 
5, 1968, 249-276. E 

Wilson, A. G. Entropy in urban and regional mod- 
eling. London: Pion, 1970. 2 

Yntema, D. B., & Torgerson, W. S. Man- computer 
cooperation in decisions requiring common sense 
IRE Transactions of the Professional Group on 
Human Factors in Electronics, 1961, HFE-2, : 
20-26. x 


(Received September 24, 1973) 


f 


University of New York, 


$ 
3 
3 
3 

N 

k 
: 4 


Journal of Applied Psychology 
1974, Vol. 59, No. 5, 575-552 


STATISTICAL PROCEDURES FOR EVALUATING TH! ( 
UTILITY OF A MODERATOR APPROACH TO PRI 


9 


PICAL 
PION? 


ALAN L. GROSS, JANE FAGGEN-STECKLER, AND KAREN Mc‘ y 
City University of New York 


Three statistical procedures for evaluating the practical utility of a n 
approach were described. Each procedure gives tests of significance for t 
parison of three prediction strategies: (a) using the moderator variable as 
for subgrouping subjects and computing separate regression equations; 
the moderator variable as a predictor variable; or (c) excluding the modera: 
able from the prediction model. It was concluded that when job perforn 
measured along a continuum, the average y score of the selected group (Pr 
1) should be considered. However, when two levels of job performance are ms 
for example, pass-fail grades, then either the number of selection errors (Pro 
2) or the number of selection errors when a selection ratio constraint is | 


(Procedure 3) should be used. 


The purpose of this study is to describe 
statistical procedures that can aid a practitioner 
in evaluating the practical utility of a moder- 
ator approach to prediction. By the expression 
“moderator approach to prediction” we mean 
using some variable (moderator) to subgroup 
a sample of subjects into g > 2 groups and 
then computing separate regression equations 
within each group. 

The prediction problem commonly facing a 
practitioner such as a personnel officer is that 
of selecting job applicants. In this context 
the moderator approach has practical utility 
only if it enables the practitioner to select 
applicants who are superior in some sense to 
those who would have been selected had the 
moderator approach not been used. More 
Specifically, the moderator approach must be 
compared with two simpler prediction strate- 
gies: (a) predicting job performance from a 
single regression equation consisting of p 
predictor variables (ri, x2, ++, x) excluding 


_ the moderator (Strategy 1); (b) predicting job 


performance from a single regression equation 
where the moderator is simply included as the 
$ + Ist predictor variable (Strategy 2). Strate- 
gies 1 and 2 clearly require the expenditure of 
substantially less time and effort on the part of 


A N Was supported by a grant from the 
esearch Foundation of the City University of 

ö ity University of New 
* Requests for reprints should be sent to Alan L. 

Gross, Graduate School and University Center, City 


33 West 42nd Street 
York, New York 10036. ee 


the practitioner than do: moderator 


approach (Strategy 3). Cons tly, if either 
Strategy 1 or Strategy 2 allo ne to select 
a group of job applicants w re equal or 
superior in some sense to th up selected 


by the moderator approach, ‘here is little 


justification for subgroupin ibjects and 
computing separate regression equations. 

Let us now consider exactly how the prac 
titioner can assess the superiority, if any, of 


the moder- 
tegies 1 and 
gested by 
e statistical 
zle a prac- 


the group of applicants selecte: 
ator approach as compared to Stra 
2. Following the guidelines 
Abrahams and Alf (1972), th 
procedures are described which € 
titioner to evaluate the practic lity of the 
moderator approach to prediction. In Pro- 
cedure 1, the three prediction strategies are 
compared in terms of the average performance 
of the selected group. In Procedure 2, the three 
prediction strategies are compared in terms of 
the number of selection errors. In Procedure 
3, the prediction strategies are again compared 
in terms of the number of selection errors, but 
with the additional constraint that only 4 
fixed number of applicants can be accepted. 
For each of the three procedures, tests © 
significance are given for the comparison of the 
three prediction strategies. 


PROCEDURE 1 


Assume the investigator is interested 1 
developing a selection model that will lead 
the selection of that group of applicants having 
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the highest se job performance (subject 
to the selec tio constraint that only u, 
applicants < selected). The investigator 
has availab i for a group of employees 
on p> ip r variables (x1, a2, . Xp), 
a potentia rator variable (m), and a 
measure of formance (y). The predictor 
and moder: cores for these employees 
should be ed when these people are 
applying f loyment and, furthermore, 
should not sed as a basis for selection. 
The sample idomly divided into a vali- 
dation samy d a cross-validation sample. 
In the valid sample, three different pre- 
diction stra s for predicting y are con- 
structed : 


Strategy 1 igle least squares regression 
tion predicting y from the xs 
mputed. 
ingle least squares regression 
ion predicting y from the x 
ables and the potential moder- 
sior variable m is computed. 
Strategy 3—i sing the potential moderator 
vaviable, the validation sample is 
rouped in some manner into 
groups and separate regres- 
equations for predicting y 
the as are computed within 
each group. 


In the cro: 
Tate predicte 


Strategy 2 


lidation sample, three sepa- 
` scores are computed for each 
Subject by applying the validation sample 
Tegression weights computed for Strategies 1, 
2, and 3 to the cross-validation data. The 
Ctoss-validation sample predicted y scores for 
trategies 1, 2, and 3, respectively are given as 


de = bo + 141 * F Dez [1] 
Yon = Co arip --. + Cyt p + Cpp, [2] 


4 g 
n= X dh, [3] 
J=1 


oe d; is one if a subject is a member of 

755 J as defined by the moderator variable, 

score „ S Zero otherwise, f; is the predicted y 

“ore for subjects in Group J. 

wald Predicted y score (9,) in the cross- 

Gin ‘tion sample for the subgrouping strategy 
tegy 3) can best be understood in terms 


of 
an example. Suppose g = 2 groups have been 


r 
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formed where Group 1 (d; = 1, d; = 0) con- 
sists of all subjects for whom m is less than 
some value m', and Group 2 (di = 0, d: = 1) 
consists of the remaining subjects. The pre- 
dicted y score for a cross-validation subject 
for whom m < me is obtained by substituting 
his/her x scores into the validation sample re- 
gression equation for Group 1. In a similar 
manner the predicted y score for Group 2 
members is computed. 

When there is a positive correlation between 
yand predicted yin the cross-validation sample, 
the optimal selection strategy consists of order- 
ing all cross-validation subjects from high to 
low on the predicted y and then selecting the 
n, subjects having the highest predicted y 
scores (Cochran, 1951). This strategy is 
optimal in the sense of leading the practitioner 
to select the group of u, subjects having the 
highest average y score. , 

Using Strategy 1, subjects are ordered on 
Ves and J, (average y score for the u, subjects 
having the highest J. scores) is computed. 
Under Strategy 2 subjects are ordered on Hen, 
and Fem (average y score for the u, subjects 
having the highest zm scores) is computed. 
Similarly, using the moderator approach 
(Strategy 3), subjects are ordered on V, and 
Js (average y score for the n, subjects having 
the highest s scores) is computed. To evaluate 
the moderator approach the practitioner then 
compares Js with both Ym and J+. To justify 
subgrouping, it must be shown that J. not 
only exceeds jj» but also exceeds J. In other 
words, we must demonstrate that the average 
performance of the selected subjects is highest 
when m is used to subgroup subjects (Strategy 
3) rather than used as a predictor variable 
(Strategy 2) or simply not used at all (Strategy 
1). Since the potential for sampling error is 
greater for a p + 1 variable regression equation 
than a p variable regression equation, it is 
possible to find in the cross-validation sample 
that J exceeds Jem but does not exceed Jr. 
In this case it is best to predict y from the » _ 
variables alone and disregard the moderator 
completely. If J, does not exceed Jan, but Jem 
exceeds ¥,, the moderator has practical value 
only as a predictor variable. 

In comparing the three means, Yz, Yom, and 
Ys, it is useful to have a method for testing the 
statistical significance of the observed differ- 
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ences. One major difficulty in deriving a 
Significance test is the fact that the m, y scores 
‘used in computing any one of the means are 
not independent of one another. This de- 
pendence follows from the direct ordering of 
_ the predicted y scores and the consequent in- 
direct ordering of the y scores (Watterson, 
1959). If the y scores are independent, the 
means of the selected groups may be compared 
using a nonparametric technique such as the 
~ Wilcoxin test (Hays, 1973). However, because 
of the dependence, this test is only approximate 
when used to compare J., Jem, and , 


PROCEDURE 2 


In many problems an investigator is inter- 
_ ested in developing a selection procedure that 
will minimize the probability or proportion of 
election errors [P(SE)]. A selection error 
can occur in one of two ways: (a) selecting an 
individual whose performance (y) falls below 
Some minimal satisfactory level (yori); or (b) 
not selecting an individual whose perform- 
ince would have equaled or exceeded yeri 
had he/she been selected. The proportion or 
Probability of selection errors, P(SE), provides 
a meaningful index for comparing the sub- 
grouping strategy (Strategy 3) with Strategies 
1 and 2. 
As in Procedure 1, suppose the three different 
regression equations (Strategies 1, 2, and 3) are 
"computed in the validation sample and each 
applied to the cross-validation sample. Let us 
t consider the proportion of selection errors 
the cross-validation sample that are incurred 
by selecting subjects on the basis of their 7, 
Scores. Assuming that y and 9, are variables 
that are normally distributed in the cross- 
validation sample, one minimizes P(SE) if 
ill subjects are selected whose 9, scores exceed 
the cut score, k(x), where 


e (it = 
0 „ 4000 


where sf, is standard deviation of predicted y 
the cross-validation sample; s, is standard 
deviation of y in the cross-validation sample; 
mean y score in the cross-validation sample; 
is mean predicted y score in the cross- 
idation sample; r(y,9,) is correlation be- 


„ 4d 
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tween y and predicted y in th 
sample. 
The derivation of this for: 
ized form is given by Alf and 
To compute the actual pr 
tion errors, resulting from 
P(SE/9,), that is, predictio: 
investigator counts the n 
validation subjects for who 
¥< Yaw Or Je < k(x) anc 
divides this number by the 
subjects in the cross-validati: 
The proportion of selectio: 
in the cross-validation samj 
the predictor, P(SE|9..), is 
same manner as P(SE|j. 
subjects are selected only) 
where the cut score, &(x,m), i 
Stituting into Equation 4 
formation on Jem, Li. e., Sem, | 
In the subgrouping strateg; 
proportion of selection errors 
separately minimizing the P(S 
each subgroup defined by | 
Subjects in Subgroup j are < 
predicted y score (/) exceed: 
ky, given by 


. j= 12, 5 


bie . (erit = ui) 
J= 
* j 


where sf; is the standard deviation for 

predicted , in Group j; sy, is the sta 
deviation of y in Group j; J; is the met à 
score in Group j; 9; is the mean f; scored 
Group j; r; is the correlation between A 


of selection errors incurred by subgroupim 
P(SE|9,), is computed by counting the m 
ber of subjects in Subgroup j (j = 1, 2, 
for whom ĝ; > k; and y < Yerit or U < kj i 
Y > yerit. If we denote this number as Mj th 


subgrouping strategy is given by 
g 
P(SE |9.) = Mynovs 
= 


where noys is the total number of subjects 
the cross-validation sample. 


The subgr trategy (Strategy 3) is of 
ical val if (SE.) < P(SE| fin) 
id E . SE V%). In other words, it 
must be dem ed that the proportion of 
gabjects “m ed” using the moderator 
to subgroup s is less than the proportion 
misclassified! ither the strategy where the 
moderator is tor variable or the strategy 
where the tor is not used at all. 
li CE SE J.) and P(SE|fs=) 
< P(SE|4 moderator is of practical 
value only sed as a predictor variable 
(Strategy 2 it is demonstrated that 
PISE|¥.) < Ñ.) and that P(SE|9,) 
ISE V. should predict y only from 
the x variab trategy 1). 

Testing fi significant differences be- 
tween P(SE (SE ĝm), and (SE ,.) is 
simply the em of testing the difference 
between corre!icd proportions. A simple chi- 
Square test d bed by Hays (1973) is appli- 
table to this em. 


"ROCEDURE 3 


In some selc< tion problems, one cannot apply 
Procedure 2, since the use of cut scores given 
by Equations i and 5 may lead the investigator 
to select more applicants than there are avail- 


sble “openings.” The investigator desires to 
minimize the proportion of selection errors, 
but at the same time can only select a certain 
Sumber (.) or proportion (a) of the appli- 
‘ants, Thus the cut scores given by Procedure 
2 must be moditied. 
Using Strategy 1, the u. subjects having the 
highest f, scores are selected, and (SE 
8 Computed. For Strategy 2 the m, subjects 
ving the highest han scores are selected, and 
PISE] im) is computed. 

In the subgrouping strategy (Strategy 3), if 
J and 9; are variables that are normally dis- 
tributed in subgroup j (j= 1, 2, „ 8) 
p (SE|9,) is minimized subject to the selection 
fatio constraint if subjects in Subgroup j 
ate selected only if f; equals or exceeds hy. 

© £ cut scores, ky, ke, +++, kg, are computed 
Y iteratively solving the following equations: 


PI Pz P [7] 
g 
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— = 5 > % cs]. 


where the probability Py is given as 
P; = NE A (Yann = rAr) — 07), | 


where Z is a unit normal variable, y*n is 
the value of Yeri, standardized in Group J, 
r; is the correlation between y and , in Group 
j, kë is the value of k;, standardized in Group 
j, q is the proportional size of the jth sub 
group in the cross-validation sample relative 
to the entire cross-validation sample, a is 
the proportion of individuals in the cross 
validation sample to be selected, 
The derivation of Equations 7 and 8 is 
given by Gross and Su (1973), i 
To better understand the computation of the 
cut scores given by Equations 7 and 8, con- 
sider an example where there are g = 2 sub- < 
groups in the cross-validation sample. In sub 
group j (j = 1 or j= 2) the mean y score” 
(Ñ), the mean predicted y score (5%, the stan- 
dard deviation of y (sy;), the standard devia: 
tion of predicted y (s$;) and the correlation 
between y and predicted y (r can be computed. 
If there are n; subjects in the jth group, 9 
is computed by dividing n; by the total num- 
ber of subjects in the cross-validation sample. 
To standardize yer in group j (Yent), sub- 
tract 75 from Verl and divide by SY}. The 
standardized value of k; (k;*) is similarly ; 
obtained by subtracting 9; from &, and then 
dividing by sĝ;. To find kı* and & a table of 
unit normal probabilities is consulted and 
values are selected that make P, equal to :? 
(ie, satisfy Equation 7) and also yield a 
selection ratio of a, (i. e., satisfy Equation 8). 
Once the standardized values for the cut scores 
(er, Ez“) are found, the raw score values 
(kı, Rs) are readily computed. As in Procedure 
2, the values for P(SE|}:), P(SE F,) and 
(SE J) are then compared using the chi- 
square test for correlated proportions. 4 


DISCUSSION 


Any one of the three procedures described in 
the previous section can readily be used by a 
practitioner in determining the practica 
utility of the moderator approach. The type 
of selection problem facing the practitioner — 
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UTILIZATION OF MANPOWER: 
OPMENT AND EVALUATION OF AN ASSESSMENT- 
SIFICATION MODEL FOR MATCHING INDIVIDUALS 

WITH JOBS 


LYLE F. SCHOENFELDT* 
University of Georgia 


for the assessment of individual characteristics, the identification of 
gical requirements of jobs, and the classification of applicants to job 
ities was developed. The validity of the model was investigated with 
sample of students (N = 1,934) working toward a college degree. 
ps, formed on the basis of previous behavioral data collected during 
man year, differed with respect to criteria (arts-sciences vs. applied 
zrade point average, etc.) measurements taken four years later. More 
tly, the subgroups differed with respect to the curricular paths 
luring college. The results indicated that it was possible to diferenti- 
ple in meaningful ways, to identify “job families,” and to match 


vith jobs. 


is been the emergence of a 
growing chorus of behav- 
| the public that it should 

better job of predicting 
\ complicating factor for 
¿tional psychologists is the 
er (1972) that the world 

going profound and rapid 
The dual dilemma of (a) the ceiling 

predict behavior and (b) 
ange is receiving increasing 

varied publications (Owens, 
and contexts, for example, clinical 


ita t psychology (Vale & Vale, 1969), the 
Rational setting (Novick & Jackson, 1970), 
of course, the industrial setting (Bar- 
51972; Campbell, Dunnette, Lawler, & 
tk, 1970). 

est in this dilemma represents a grow- 
atisfaction with the state of the art; 
atisfaction which culminated in the for- 
Hon of a model directed toward (a) the 
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assessment of individual characteristics, (b) 
the identification of the psychological require- 
ments of jobs, and (c) the classification of 
manpower to available opportunities. The 
purpose of the present research was to review 
the evidence relative to the assessment-clas- 
sification model and to evaluate the potential 
gains offered by this approach in the context 
of manpower utilization. 

As far back as 1948, Toops (1948) dis- 
cussed the value of analyzing relationships in 
terms of homogeneous groups. Knowledge of 
an individual’s group membership enabled the 
psychologist to predict future behavior with 
increased facility (Toops, 1959). Toops’s line 
of reasoning has become the intellectual ante- 
cedent for a class of systems prediction mod- 
els. The approach of systems models in 
contrast to that of statistical models (exem- 
plified by the work of Ghiselli, 1956, 1960 
Lee, 1961) is typically more complex and 


nebulous, or at least subject to varying in- 


terpretation by differing researchers. 

One line of descent from the Toops (1948) 
conception has been the person—process—prod- 
uct models by Dunnette (1963), Campbell 
et al. (1970), and Cirino-Gerena (1972) em- 
phasizing the individuals configuration of 
differences and the impact this has on job 
effectiveness. The models put forth by each of 
these researchers view the product as a result 
of the configuration, that is, a person inter- 
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an A 


acting with various task demands or organi- 
zational forces. 

A second line of descent from Toops was 
offered by Owens (1968). His developmental- 
integrative model was based on the premise 
that past behavior is the best predictor of 
future behavior. As with the other schemes, 
the point of entry was a configuration of indi- 
vidual differences, except in his model the 
quantification of these differences was heavily 
laced with the measurement of antecedent or 
life-experience characteristics. Prediction was 
to be accomplished by forming subgroups with 
respect to previous experiences and relating 
the subgroups to criteria of interest. Sub- 
groups would be expected to display differen- 
tial profiles across the criterion behaviors. It 
must be added, however, that the individual 
was not lost in this conception. Owens (1971) 
had fully detailed the implications of his ap- 
proach for individual assessment. 

The assessment—classification model, shown 
in Figure 1, follows from the other systems 
approaches. It incorporates the evaluation of 
person—process—product models with the sub- 
group conception of Owens (1968). It in- 
volves the assessment of individuals, measure- 
ment of jobs, and the prediction of job suc- 
cess. 

Individual assessment or the inventory of 
the psychological capabilities the individual 


INDIVIDUAL ASSESSMENT 


job family. 


Individuals Life History 


MATCHING INDIVIDUALS WITH JOBS 


Establishment 


Discriminant analysis to 
determine the probability of 
success and satisfaction in FM 


given that eis a member of SL 


Use of the Model . 


New individuals are classified to the 


life history group (SD. they most closely 
resemble, and are compared to each 


Employment recommen— 
dations are for the job(s) where the 
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brings to the job market ) aspects. The 
first involves using stan: edictors found 
to be valid for the jobs tion, the indi- 
vidual differences varial he Campbell 
et al. (1970) model. T nd aspect in- 
volves implementation o approach de- 
scribed by Owens (1968) ig the forma- 
tion of subgroups with o the major 
dimensions of anteceden r and relat- 
ing the subgroups to re riteria. This 


would inyolve administe 
questionnaire to assess th 
iors. The individuals woul 


viographical 
dent behav- 
be classiñed 


on the basis of their respo o life history 
subgroups or subsets hony tous with re- 
spect to important dimensi: life behavior. 

After lengthy neglect, ji lysis is seen 


ure of work 
Several in- 


as a way of understanding 
behavior (Prien & Ronan, 


struments (Cunningham, Floyd, & 
Bates, 1970; McCormick inneret, & 
Mecham, 1969) and operati procedures 
(Fleishman, 1967) have be: veloped and 
found to be of use for me g jobs in 
terms of the psychological ads required 
for successful performance. Thus. in the same 


subgroups 


way individuals are placed U 
ast behavior, 


homogeneous with respect t 
jobs can be classified into fa } 
ous with respect to their psy¢ 


JOB STRUCTURE 


f th 


Probability of success and satisfaction Job 
Subgroups would be maximal, Families 


Ass! 
aulysis of the y a technique such as the 
Position Analy suestionnaire (McCormick 
e al, 1969) he classification of jobs 
based on the r: es to the analysis into 
job families ho 1eous with respect to im- 
portant activit and hence job require- 
ments). The nent-classification model 


the use of a maximiza- 
as discriminant analysis) 
obability of success and 
rticular job family given 
s a member of a particu- 
group. The goal of the 
iction of job success and 
xew individual whose sub- 
who would be performing 
ne of the job families. 
at the assessment-classifi- 
prediction is feasible came 
ackson (1970). These au- 
at the logical approach is 
i. Thus, although the usual 
res (linear regression) have, 
ite effectiveness, the overall 
ve increased if recognizable 
subgroups, devi from the average in pre- 
dicted success, c-n be removed. In contrast 
0 statistical models for improved prediction, 
which have resulted in a mixed picture of 
successes and failures (Zedeck, 1971), the 
assessment-cla: tion model has never been 
sven a full and complete evaluation. 
This article describes a research effort to 
vestigate the usefulness of the assessment- 
classification model, The evidence, hopefully, 
wil Point to a successful generation of new 
predictive procedures and increased under- 
Standing of job related behavior. Specifically, 
this article develops an assessment—classifica- 
tion model in a college situation and explores 
the stability and validity of that model. 
e of the systems models have been con- 
3 ed with prediction of work performance, 
pally Work at the managerial level. How- 
ever, It seems appropriate to use college stu- 
Mts in a study of the relationship of human 
ifferences to job effectiveness. College stu- 
“its have much in common with job incum- 
a In each case (college or job) the indi- 
10 a brings with him (or her) a configura- 
N of differences. The course (as well as the 


is then develop: 
tion procedure | 
to determine 
satisfaction in 
that the individ 
lar life histor: 
model is the 
satisfaction fo 
group is known 
a job belonging 
Some eviden 
tation approach 
from Novick a 
thors conclude: 
a Bayesian m 
predictive proc 
at least, intern. 
effectiveness ca 
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subelements within the course) is equivalent 
to task demands—demands which result in 
behavior that is inevitably evaluated as to its 
effectiveness. The educational processes occur 
within the context of an organizational cli- 
mate which offers constraints of various types. 
The college reward structure has many levels, 
several facets at each level, and if nothing 
else, is well imbued on the individuals. In the 
college setting, as in an organization, there is 
feedback, both immediate and delayed, as to 
the effectiveness of behavior. Finally, for both 
tasks and courses, training and development 
experiences are optional and may or may not 
occur. In addition to these similarities, the 
advantage of the educational setting of per- 
mitting a broader range of research studies 
than would otherwise be possible in the in- 
dustrial context more than offsets the assumed 
loss in generalizability of results to the world 
of work. 


METHOD 
Subjects 


The subjects were 1,037 males and 897 females, 
an 80% sample of the 1968 freshman class at the 
University of Georgia. The study was longitudinal 
extending over a four-year period. 


Biographical Data 


The biographical data were collected during the 
first quarter in residence. A detailed discussion of 
the methodology regarding the development of the 
biographical inventory is available elsewhere (Schoen- 
feldt, 1970). The goal was to identify subsets of 
subjects with comparable patterns of prior experi- 
ence. A 389-item biographical inventory was con- 
structed such that each item was multiple choice, 
with the responses arranged to form a continuum. 
Scales were formed from the items by separate 
principal-components analyses and varimax rota- 
tions for the sex groups, which resulted in 13 (Table 
1) and 15 (Table 2) interpretable factors, respec- 
tively. 


Subgroups 


Each subject was profiled on the life-experience 
dimensions and each profile was then transformed 
into a matrix of intersubject profile similarities. 
Profiles were grouped into “families” using the pro- 
cedure developed by Ward and Hook (1963). One 
deficiency in the grouping procedure has been that 
once assigned to a group, the individual remains in 
that group, thus the assignment of individuals to 
subsets is usually less than optimal at the conclu- 
sion of the grouping (Ward, 1963). A two-part 
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TABLE 1 


DESCRIPTION OF MALE BIOGRAPHICAL Factors 
a ————— —-— 


Factor 


1. Warmth of Parental Relationship 
Had a close, warm relationship with both parents; parents were very likely to gi 
affection, praise and attention; wanted to imitate and be like the father. 

2. Academic Achievement 
High standing in high school class; often on the semester honor roll; very comp 
and successful in academic situations. 

3. Social Introversion 
Compared to others in high school: fewer casual friends, fewer dates, less popula: 
more self-conscious and ineffective in meeting demands of social situations. 

4. Athletic Interest 
Very active in athletic activities; often engaged in team sports; enjoyed physical 
education courses; rated past performance in physical activities very high. 

5. Intellectualism 
Regularly read literary, business, or scientific magazines; watched educational and 
cultural TV shows. 

6. Verbal Aggressiveness 
Enjoyed discussion courses, and tended to try to make others see their point of 
view; questioned teachers on subject matter a lot; were often regarded as radical 
or unconventional; often wanted to be alone to pursue own thoughts and interests. 

7. Socioeconomic Status 
High parental educational level, average family income, and father’s occupational 
level is high. 

8. Parental Control vs. Freedom (bipolar) 
Parents were more strict, critical, or punitive; anger was more often shown by or 
at parents; were allowed much less freedom or independence. 

9. Positive Adjustment Response Bias 
Rarely wished to become more socially acceptable; less often chose parents or friends 


as someone to “take things out” on; less often felt downcast, dejected or selfconscious 


not typical to daydream. 
10. Scientific Interest 


Enjoyed science and lab courses, and found them relatively easy; worked with scientific 


apparatus and equipment, often outside of any required school assignment. 
11. Positive Academic Attitude 
Liked school and teachers very much; teachers were more successful in arousing 
academic interests; enjoyed specific courses more; did more hours of homework. 
12. Religious Activity 
Very active in church, religious or charitable organizations; compared to others of 
the same age, more often went to church, and had stronger religious belief, 
13. Sibling Friction 
More often argued or fought with siblings; had more younger brothers and sisters, 
close to their own age; more friction and feelings of competition. 


90 


97 


93 


94 


96 


94 


94 


94 


91 


96 


6 


91 


93 


N, e. Biographical factors are listed in descending order of variance explained. 


ot 
Multiple correlation using 118 biographical items (short form) to predict each factor score. 


e unbiased estimate of the cross-validated population correlation, which is the multiple correlation obtained by app 


lying 


the sample weights to the entire population (Claudy, 1970). 


procedure was developed to evaluate the fit of each 
individual to the assigned group and remove him 
from the structure if he failed to fit any group well 
(isolate), or fit more than one group about equally 
(overlap) (Schoenfeld, 1970). 

Using this Procedure, 23 male and 15 female sub- 
groups were formed. Approximately 75% of the 
total sample was allocated to one of the groups 
with the remaining subjects either not fitting any 


of the groups or, alternately, matching two or more 
groups. 


As might be expected, differences between E 
groups on the biographical factors were hig ch 
Significant, thus allowing the characterization of 1 
group in terms of the dimensions that distinguis E 
it from all or most of the remaining groups: 0 
addition, each subject had completed a battery a 
standard and experimental tests when originally 159 
veyed in 1968, These measures have been desci 8 
by Owens (1971), and included, among other 2101 
the Scholastic Aptitude Test (SAT), the St 
Vocational Interest Blank (SVIB), the Californi 
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Sale, the Inver of Social Attitudes (tough- and hierarchical complexity), and the Purdue Values 
mindedness versi der-mindedness and conserva- Inventory (cognitive values, economic values, and 
tive versus radic he General Outlook Inven-  social-religious conformity). Finally, these subjects 


tory (integrative y, conceptual simplicity, had been involved in a number of field studies, such 


TABLE 2 
DESCRIPTION OF FEMALE BIOGRAPHICAL FACTORS 
Factor R pè 
1, Warmth of M | Relationship | %4 | 92 
In high sci close to the mother; mother more often provided emotional | 
support ar rest; more often discussed intimate and/or important matters with | 
mother. j 
2. Social Leade 95 94 
More often d to guide or direct others in group activities; participated in school 
politics; he re leadership positions in high school. 
3. Academic Aci ment 90 95 
High grade ry high standing in high school class; more competitive in academic 
situations + often expected to be successful in academic tasks. 
4. Socioeconomi tus 96 95 
High educa and occupational level of father high family income and social class. 
5. Parental Con vs. Freedom (bipolar) 93 91 
Parents w ore strict, and allowed less freedom or independence, and were more 
punitive | ing away privileges. 
6. Reading Int: 88 85 
Regularly r=: | national news magazines; did much more nonrequired reading; often 
watched TV ens programs and special reports. 
7, Athletic Pari ition 97 96 
Rated pas! erformance in physical activities very high, very active in athletic 
activities; often engaged in individual and team sports, enjoyed physical 
education cour 
8. Scientifc-Artis Interests P = — 
Enjoyed courses in the sciences, music, or art, more often worked with scientific i 
equipment or apparatus; physical and biological science subjects; active in dramatic, 
ar sic gr 
t or music groups. 88 84 


9. Conformity to lemale Role 
More often suffered “attacks of conscience” when they felt they had done wrong by the 
hed to become more socially 


standards of society, the church or parents; more often wis 
acceptable, cr better prepared as a responsible family member. 
10. Maladjustment a = 
Often typically felt downcast and dejected, felt so upset that they brooded DYER 5 
meaning of life; daydreamed to a greater extent; were more sensitive to criticism. 87 84 
Xpression of Negative Emotions , : : 
Very often openly expressed anger with a close friend; very often tried to Boua med 
Someone close hurt or upset them; more often wanted to “take things out” on friends. 
ocial Maturity X = 
More likely to give help or advice to friends wi 
freedom to work as they pleased in class projec 
opularity with the Opposite Sex 
11 often went on dates; started dating regularly 
Ounger age. 
ositive Academic Attitude 55 É. 
Teachers were more successful in arousing academic interes 
Participation and discussion; liked high school teachers to a gre 
15, 0 ate was adequate. 92 90 
ere very close to the father, and spent relatively more time with him, father provided 
emotional support, interest, and gave more attention. 


IL E. 


12 g 
th personal problems; enjoyed complete 
13. ts or tasks; were respected by classmates. 
P. 
and started going steady at 


14. p 
ts, and allowed much class 
ater degree; felt high 


Nove. Bi 7 
. Biographi N TuL 85 nce explained. 
Multi ic phical factors are listed in descending order of varia hiet each factor score. 


bpp atple correlati si i ical i short form) to predict eac! A p R i 
the e ne digte e ee ice e correlation, which is the multiple correlation obtained by applying 
le weights to the entire population (Claudy, 1970). 
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as leadership (Boardman, Calhoun, & Schiel, 1972) 
and over- versus underachievement (Klein, 1973). 
Tables 3 (males) and 4 (females) summarize the 
Subgroup differences in terms of these several sets 
of variables and research results. For example, Group 
1 of the males is distinguished by virtue of the fact 
that its members scored extremely low on the di- 
mensions of intellectualism, athletic interest, and 
verbal aggressiveness. Group 1 males were also dis- 
tinguished from the remaining 22 groups in that 
they scored high on the Music Teacher scale of the 
SVIB. Groups that were high on a particular life- 
experience dimension generally scored in the same 
direction on comparable tests from the supplemental 
battery, although there was not always room on 
the table to indicate this duplicative information. 


Stability 


A critical aspect of the logical flow concerns the 
information that the life-experience dimensions and 
Subgroup structures have been found to be stable in 
several important respects, In 1970, 1971, and 1972, 
over 6,000 additional subjects completed 118 of the 
biographical data items with the highest loadings on 
the factors. Using the estimated factor scores com- 
puted from the 118-item form (see Tables 1 and 2 
for the multiple correlations), each new subject 
was classified to the group he (or she) most re- 
sembled by means of using the discriminant-classifi- 
cation procedure described by Cooley and Lohnes 
(1971). 

Stability was assessed by comparing the alloca- 
tions of subjects to groups for the years subsequent 
to 1968, as well as by comparing the allocations of 
these years to the allocations of 1968. With regard to 
1970 through 1972, the difference in the percentage 
of subjects classified to each group exceeded 3% in 
only one of the 38 (23 male and 15 female) com- 
parisons, the exception being female Group 13 
which had 6% of the subjects in 1970-1971 and 
10% in 1972. For 1970-1972, 35% (versus 27% in 
1968) of the males and 32% (versus 26%) of the 
females were designated as overlaps or isolates. 
Other than these differences, only two of the 38 
groups formed in 1968 differed by more than 4% 
in 1970-1972; male Group 3 decreased by 6%, while 
female Group 10 increased by the same amount. In 
summary, the subgroup structure established with 
Subjects tested in 1968 was replicated during the 
three subsequent years. The subgroup structure was 
wholly applicable to other Samples surveyed at the 
same institution over a four-year period. 

Another finding has been that the life-experience 
dimensions were not specific to sex or institution 
(Anderson, 1973). This study confirmed that finding. 
Approximately 5,400 subjects representing three 
diverse universities completed a large number of 
biographical items, 248 of which were selected for 
analysis, including sex and institution which were 


coded as dummy variables. The item responses were 


factor analyzed as described previously, except that 
Sex and institution were forced to become the first 


> 
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two factors (Cooley & Lo! 71). There was ne 
meaningful institutional va In addition, it was 
possible to define a factor cluded all of the 
sex variance, and then de set of 16 addi- 
tional factors (similar to t scribed in Tables 
1 and 2) that cut across bi 8. 

The successful the assessment- 


applicati 
classification model hinges 
tion of stable, meaningful sı 
antecedent behavior but 
subgroups to job criteria, I. 
with groups formed on the 


on the forma- 
s on the basis of 
relevance of the 
oups must mesh 
job activities. 


Validating the Model 


The validation of subgrou; 
respect to three types of edu 
group differences were evalua‘ 
criteria. Second, course profile s 
and related to the life-experic 
analysis was concerned wit! 
the components of predicted ar 
that is, the SATs and high s 
ate the generalizability of res 
period. All educational criteria 


rences was with 
| criteria. First, 
five univariate 
ups were formed 
oups. The third 
differences on 
point average, 
ides, to evalu- 
er a two-year 
obtained from 


transcript records of the 1,93 bjects tested in 
1968. 

A count was made for each sul) of the number 
of courses (i.e., job activities) co) ted in each of 
the following 12 areas: (a) fine b) languages, 
(c) humanities, (d) biological sci , (e) physical 
sciences, (f) social sciences, (2 iculture, (h) 
business, (i) education, (j) home cconomics, (k) 
speech-journalism, and (l) p 1 education- 
ROTC. In the same manner 3 used to form 


subgroups 


life-experience subgroups, cour 5 
s combined) 


were derived for the 1,450 subj 


who had completed 90 or more hours over 
a period of 13 quarters (summer included) following 
matriculation. The result was 11 course profile sub- 
groups or job families including 6 of the sub- 


jects and corresponding to the 12 course areas except 
that the biological and physical sciences were com- 
bined. A twelfth group, fewer hours group, 7 
distinguished by the fact that they averaged 4 
fewer credit hours than the remainder of the sub- 
jects (although all completed more than 90 hours). 


RESULTS 


Criterion Differences 


The 23 male and 15 female life-experience 
subgroups were compared on five educational 
criteria. Chi-squares were computed compar- 
ing for each group the number who had (4) 
been on the dean’s list one or more times ver- 
sus the number on at no time; (5) been on 
academic probation and/or dismissal one or 
more times versus at no time, and (c) had 
one or more course withdrawals versus no 


Ass 


withdrawals.* 
were divided 
applied studies 
compare the s 
criterion. The 
average, and t! 
this variable by 


11 course major profiles 
arts and sciences versus 
a chi-square was used to 
ups on this dichotomous 
riterion was grade point 
ups were compared on 
ins Of a one-way analysis 


of variance (A A). 
The results presented in the center 
columns of Tab! (males) and 4 (females). 


The average pe 
F) are provide: 
Groups deviatir 
age value are r 
drawals for the 
on which the 
cantly.“ As can 
principal interr 
experience sub 
with respect to 
and grade poin 
teria, withdraw 
relatively indepe 
group members!“ 


Assessment—clas 


tages and chi-squares (or 
the base of each column. 
bstantially from the aver- 
as high or low. With- 
les was the only criterion 
ips did not differ signif- 
seen from the tables, the 
mships between the life- 
ps and the criteria were 
list awards, probations, 
erage. The remaining cri- 
and arts-sciences, were 
‘ent of life-experience sub- 


ication Relationships 


It was of interest to see how the antecedent 
behavior subgrouns related to the course pro- 
dle (or job family) subgroups. To answer 
biographical data by course 
s were formed, a 23 X 9 
(languages, education, and home 
fonomics were null sets) and a 15 X 10 for 
females (agriculture and physical education 
Were null sets). ‘The highlights of each row of 
these matrices a described in the right-most 
Columns of Tables 3 and 4. 

a chi-square assumption could not be 
Met, that is, more than 20% of the expected 
[——— 


8 
Among the students with greater than 90 hours, 
Sse students with one dean’s list, withdrawal, or, 


ti 
0 a lesser extent, probation were likely to have 


Se 


a than the number with one or more exag- 
et os the strength of the dependent relationship 
ween subgroup membership and the criterion. 
os resulted in restriction of the range of 
su 50 Scores, thus diminishing the strength of 
tent up-criterion relationships to an unknown 
„ Previous results (Schoenfeldt, 1970) showed 
hye Pous were not evenly distributed across 
dtom Ps, but that some groups had less than 10% 
uts after one year, while the rate was as high 


5 30% for others, 
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tral Using the total dean’s lists, for example, 
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values were less than five, two alternate pro- 
cedures were used. The first technique, devel- 
oped by Kundert (1973), in conjunction with 
a Fisher-Lancaster categorical scaling pro- 
gram, involved the calculation of a chi-square 
and, then the generation of a random matrix 
with the same marginals as in the original 
data matrix. For each observed matrix, a 
random data table was constructed and a 
chi-square computed. Thus, for each analysis 
an empirical sampling distribution was gen- 
erated and related to the observed results, 
One limitation was that the program could 
only handle a 10 x 10 matrix, necessitating 
the formation of 10 observed matrices from 
the original 23 x 9 and three from the 15 X 
10 such that each pair of groups were in- 
cluded in at least one of the analyses. For 
the males, six of the 10 chi-squares were sig- 
nificant at less than the .05 level, and eight 
at less than the .08 level. All three of the 
chi-squares calculated from the female data 
were significant at less than the .01 level. 

A second feature involved the assignment 
of dummy variable codes to the categories, 
which were then used for calculating a canon- 
ical correlation between the biographical data 
and course profile subgroup categories. These 
codes also served as input data for computing 
a discriminant function between the bio- 
graphical data subgroups (with course pro- 
files as criteria) and between the course pro- 
files (with biographical data groups as cri- 
teria). This information was then used for 
determining which rows or columns were ac- 
counting for differentiation between the 
groups in the data matrix. For the males, 
science and business contributed most to the 
linear combination of the course profile groups 
in the discrimination between the 23 bio- 
graphical data groups, while biographical data 
groups 3, 21, and 16 (in that order) con- 
tributed most of the differentiation between 
the course profile groups. The science and 
fewer hours groups contributed most to the 
female course profiles in the discrimination 
between the 15 biographical data groups, 
while biographical data groups 2, 7, and 12 
contributed (equally) most to the differentia- 
tion between the course profiles. 
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ortant than the statistical 
ict that the relationships 
nse. For example, male 


Perhaps mo 
evidence was 
make reasona 


Group 1 had h proportion of fine arts 
majors, and y haracterized as being low 
in athletic int low in verbal aggression, 
and high on t lusic Teacher scale of the 


SVIB. Male ( 
gression, low 
plexity and h 

of business ma 
features of ini 
this discussior 
dally the bas 


3 was low in verbal ag- 
> SATs, and low in com- 
extremely high proportion 
. Obviously, there are more 
t than can be detailed in 
omparing the tables, espe- 
tes, indicated the tremen- 


dous sex diffe s in the popularity of the 
majors, as we differences resulting from 
the number o! rse hours. 


Cross-validatic 


at remains concerns whether 
d among biographical data 
r will hold up in succeeding 
ted grade point average and 
high school grades and the 
used for this aspect of the 
ss-validation was studied by 
ık order of groups evolved 
d grade point average with 
groups on this same varia- 
e known 1968 groups and 
riables (high school grade 
the SAT scores) allowed 
criminant analyses, one for 


One questi: 
differences ob 
groups in one 
years. The pre: 
its component: 
SAT scores, w 
investigation. < 
comparing the 
in 1968 on predic 
tank of the 19 
ble. In additior 
Scores on three 
Point average pli 
‘or canonical dis 


It is not necessary to list predicted grade 
Point average means for the 1968 groups 
‘ince they are similar to the actual grade point 
Average means shown on Tables 3 and 4. The 
ank order correlations between predicted and 
actual grade point averages for the 1968 
a was .85 for the males and .84 for the 
“males, highly significant (p < .001) in both 

es, The rank-order correlations between 
mie Means on predicted grade point average 
ae and 1970 was .89 (p < .001) for the 
= and 3 (p < :01) for the females (both 
A cant), indicating that the criterion dif- 

“aces were maintained over the two-year 
Period, 

35 © results of the discriminant analyses 

Summarized and discussed in detail (in- 
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cluding plots of the 1968 and 1970 groups in 
the discriminant space) elsewhere (Schoen- 
feldt, 1970). Two discriminant functions ac- 
counted for 78% of the discriminating vari- 
ance for the males and 73% for the females. 
For each sex, the first function was a high 
school grade point average dimension and the 
second an SAT total dimension. The two func- 
tions resulted in plots with four quadrants 
with the origin representing average grade 
point average and SAT scores. Four of the 
male groups formed in 1970, Groups 5, 8, 9, 
and 11 were in different quadrants from the 
corresponding 1968 groups. For the females, 
all but one of the 1970 groups was in the same 
quadrant as the 1968 counterpart. Group 5 
was below the origin on the high school grade 
point average dimension in 1968 and above in 
1970. On the basis of these results it can be 
concluded that criterion relationships do cross- 
validate. 


Discussion 


These several validity studies indicate that 
the assessment-—classification model is viable. 
The subgroups formed to be homogeneous 
with respect to antecedent behavior differed 
on univariate criteria related to job success. 
These groups were also related to job family 
subgroups, that is, groups formed due to 
similarities with respect to the path walked 
in working toward a baccalaureate degree. 
Finally, the subgroup differences observed in 
1968 held up in a new sample surveyed in 
1970. 

What are the implications of this model on 
the profession and society? What is the role 
of the psychology of human utilization in the 
seventies and beyond? The assessment—clas- 
sification model might serve the major pertur- 
bations resulting from the governmental reg- 
ulations regarding validation. One trend the 
regulations seek to redress is the emphasis 
placed on an applicant's credentials, specifi- 
cally the number and character of his de- 
grees, rather than on his personal competence 
(Dailey, 1971). Indeed, a major thrust of 
the regulations is a shift away from the 
selection model that has dominated industrial 
psychology since its inception to a realization 
that most people can do many (if not most) 


S44 
of the jobs that need doing. Towards this 
end, the assessment-classification procedure 
should be in harmony with the regulations. It 
is a model aimed at matching individual 
tendencies (especially nonintellective) with 
the job requirements, and assumes both are 
competently measured. To the extent that 
there might be an excess of people to jobs, 
selection would be involved, although it is 
envisioned that application of the model 
might extend to many employers in a region, 
thus emphasizing classification. 

Another realization made explicit by the 
regulations is the fact that personnel policy 
involves more than the hire decision. The 
principal transition points into the world of 
work are upon completion of secondary school 
or some form of higher education. The model 
is directly applicable to these entry levels. 
Beyond this, the model can be used for per- 
sons already employed, as, for example, in 
conjunction with the assessment centers which 
have received much attention during the past 
Several years (Bray & Grant, 1966; Byham, 
1071). 
It is probably only a matter of time until 
the regulations will be extended to cover 
_ those experiences which qualify individuals 
for particular careers, such as technical train- 
ing, college, graduate school, etc. The assess- 
ments classification model is concerned with 
matching persons seeking opportunities, 
whether jobs, or other types of experiences, 
wich available openings. Utilization of the 
= model is not limited to traditional employ- 
ment settings. 

4 The complete list of benefits anticipated 
from the assessment- classification system is 
_ too lengthy to describe in detail. In summary, 
the model’s significance is twofold. First, the 
= model provides a new approach to the utiliza- 
tion of talent, an approach that promises to 
A improve the fit of individuals and jobs and at 
the same time be in concert with the major 
_ thrusts of the governmental regulations re- 
garding validation. Second, the model allows 
for the continuous collection of data of the 
Find needed to anticipate and study person- 
nel problems before they become critical. 
When applied to a large employer or region- 
ally, for example, it would Provide feedback 
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regarding shifts in ap characteristics 
or job requirements. 
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The authors have never heard of the term 
_ agricultural psychology. At the time that 
Miinsterberg’s (1913) book, considered to be 
the first dealing with industrial psychology, 
was published, about 30% of the United 
States work force were still employed in agri- 
culture (Taylor & Jones, 1964). However, 
_ farming appears to have largely missed out on 
the rise of scientific psychology. A recent 
review of Psychological Abstracts revealed 
‘only seven articles in American psychological 
Journals between 1945 and 1970 that dealt 
directly with any aspect of agriculture (Rich- 
ards, 1973). As the last half century has been 
characterized by a secular decline in the pro- 
Portion of farmers in the United States work 
force (Taylor & Jones, 1964), it is easy to 
understand why the United States studies of 
the psychology of work have ignored the 
farmer. However, although industrial psy- 
chology has thrived for so long without the 
farmer, the study of work could be advanced 
by broadening its scope to include a type of 
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he Kibbutz, 


Kibbutz communities with a mix of factory and farm jobs were su 
study the effects of industrialization on work and workers while 
for the effects of urbanization. Questionnaire responses of 476 work 
representative sample of factories were compared to those of 175 w 
location- matched agricultural branches in 27 kibbutz collective se: 
Factory workers rated their jobs significantly lower than did farm wc 
plant managers leadership, opportunities for self-realization on 
participation, control, peer relations, and information about the job. H 
factory jobs were rated cleaner, easier, and more mental. Factory ar 
workers were similar in outcome measures, including supervisory r=! 
performance, job satisfaction, mental health, and alienation. Implicat 
kibbutz and nonkibbutz work are discussed. 


work hitherto neglected. veloping na- 


tions, based solely on the of workers 
involved, agricultural psy: would be 
relevant to the lives of n ore workers 
than would industrial psy y. In addi- 
tion, cross-occupational con ns could be 
interesting and enlightening 1 regard to 
job content and employee atti es. 

In Israel, kibbutz ® communities have been 
undergoing rapid industriali-- *. At present 


many of these so-called “collective farms” 
have a mix of industry and agriculture in 
which industry dominates the (arm branches 
in total revenue, profits, number of workers; 
and in setting economic and social priorities 


3A kibbutz (plural kibbutzim) is a collective $t- 
tlement governed in principle by a general assembly 
of all members. Each kibbutz operates as 4 
economic unit and pays its members no wits, 
Food, clothing, housing, social services, and a nom 
inal discretionary allowance are distributed by cone 
munal institutions on an agreed, egalitarian bass 
A typical kibbutz has from 300 to 450 members 
1970 the kibbutz population numbered 84,900, 0 
whom 27,200 were employed adults aged 14 1 
above, that is, 3.9% of all employed adult meg 
the country (State of Israel, Central Bureau of 
tistics, 1971). Work is organized collectively; 
economic branch managers are elected by 1 
branch’s workers or by the general assembly 
all members. In 1970 there were 170 factories 
224 kibbutzim (Leviatan, 1973). For sadia 
readings about the kibbutz, see Darin-Dra 
(1963) and Rosner (1972). 
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Leviatan, 1973). Indeed, 
ent as a whole is ripe for 
hology. 
the resultant jump in 
ndustrialization is widely 
brought dehumanization, 
ind the like to the worker 
e same results are often 
ization. Since industriali- 
ition commonly go hand in 
3), it has been difficult to 
1e extent to which the oft 
condition of modern man in in- 
should be attributed to 
ni trialization, or to both, 
e industrializ ion of the kibbutz pro- 
fare opportunity to tease out empiri- 
effects of industrialization and ur- 
n. Kibbutzim are incorporated as 
lomous settlements and may: be regarded 
communities. At the same time, Ts- 
me of the most highly urbanized soci- 
i the world. Thus the partially indus- 
Kibbutz provides an opportunity to 
farm and factory work in small set- 
in an urbanized society. Whether 
onment of the working kibbutz 
should be regarded as rural or urban 
and not crucial to the present article, 
Macroenvironmental effects are 
lled by comparing farm and factory 
in the same kibbutzim. Thus, in the 
zim it is possible to assess the effects 
alization while controlling urbani- 
Income is controlled, since all mem- 
a given kibbutz, farmers and factory 
alike, share equally the fruits of their 


present study was part of a research 
on kibbutz industrialization under- 
iy the Center for Social Research on 
»butz. (Previous analyses of some of 

from part of the sample have been 
ted by Levitan, 1970). This study at- 
ted to examine the effects of industriali- 
on the worker while controlling urban- 
t was hypothesized that if industrial 
erves its ill-repute in comparison to 
work, then scores on measures of 
| well-being of workers in industry 
ntal health, job satisfaction, etc.) 
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would be lower than those of their agricul- 
tural counterparts, Further, if there are simi- 
lar scores on such psychological measures for 
farm and factory, then it may be concluded 
that industrial work per se does not bring ill 
effects upon workers. This study also com- 
pared the features of farm and factory work 
to gain some understanding of the conditions 
that may give rise to differences in psycho- 
logical outcomes. 


MeTHOD 
Populations and Samples 


A population of factories and a population of 
farm branches were sampkd. The industrial popu- 
lation was defined as all industrial plants that, fa 
1970, (@) were members of the Kibbutz Industrial 
Association (KIA), (6) were owned by individual 
kibbutzim (and not by interkibbutz conglomerates), 
(c) employed at least five workers, of whom no 
more than 30% were hired laborers (i.c., nonmember 
wage carners), “ and (d) had been established at 
least one year prior to the study. Of the 157 plants 
affiliated with the KIA in 1970, 97 met the other 
criteria for selection. Ten of these were barred from 
the sample, since they had participated in a some- 
what similar study only a year earlier (Rosner, 
1971; Tannenbaum, Kavcic, Rosner, Vianello, & 
Wiesner, 1974). A three-dimensional classifica- 
tion of the remaining 87 plants was created using 
information about their movement affiliation,’ size 
(small: 5-17 workers; medium: 18-30 workers; 
large: 31 or more workers), and technological level 
(mostly manual; semiautomated without feedback 
about deviation from standard operation; and semi- 
automated with feedback about deviation but with- 
out self-adjustment, as in Turner & Lawrence, 1968; 
fully automated plants did not exist in kibbutz in- 
dustry at the time of the study). The sampling 
design called for choosing one plant at random from 
each of the 27 cells in the resulting 3 X 3 X 3 table. 
The distribution of the 87 plants in the table was 
such that eight cells had zero entries. The original 
sampling design was therefore compromised by either 
randomly selecting a plant from the same movement 
and of the same size, or, if there was none, then 


plants with more than 30% hired labor were 
eliminated so that the kibbutz factory in its purer 
form, in correspondence with the normative model 
for kibbutz enterprises, would be studied. This re- 
striction of the population excluded that minority 
of the larger plants that employed most of the 
hired labor in the kibbutz industries. 

5 There are three major interkibbutz federations, 
known as movements. They differ in ideological 
coloring and in the kind and degree of normative 
organizational structure they advocate for their pro- 
ductive enterprises. 


aed application of technology to modern farm- 
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shown in Table 1 are based c T 
sposdents in both sampks t 
listed in Table 1 are ordered 4 


(1967) distinction between cau 
end-result’ variables. With the 


based on responses given by al * 

pleted the questionnaire. 
The five indices which were 

bles included the following. 
I. Plant manager's leadership hich 
bined items that assessed the to 
the plant or branch manag aware 
workers’ everyday job probk „ * 
technical and organizational! ir os, 
adaptability to change, and his 
general; 
2. Consideration or the extent cb man 
ment was really interested in kers" 
fare and improved workin: jons i 
when not forced to do so; 
3. Supervisory leadership w) s ase 
by worker ratings of how ír: v the: 
mediate supervisor asked the opinia 
how good he was in relating to bord 
how friendly and approachabk „ and Bi 
Success in coordinating his work s interest 
with those of other groups in int} x 
4. Opportunities for selj-realis« hich mes 
sured the extent to which the jo» :-ovided Op 
portunities for learning new t! using one 
own ideas, doing interesting w ising 0. 
abilities and knowledge, and unco 
tional work methods; 


S. Participation which gauged ș rs" fi 
regarding what part they took in ms 
portant decisions concerning th 
and general problems in the plan: 

in this index were workers’ 2 
actual authority of the work: 
kibbutz institution designed to 

for worker participation in managing 


The six indices considered intervening 
included the following. 


1. Control index or the extent to which the job 
gave the worker authority over other people 
the degree of responsibility the worker had 
the plant, and the degree of influence eae 
worker felt he had personally; Pe 
2. Peer relations index which was comprised 
items tapping group spirit and mutual aid, 
understanding of one’s job problems, pre 
Present team to any other, peer work f 
tion (Bowers & Seashore, 1966), and an ov 
rating of peer relations in the plant; 

3. Information index which was constructed 
averaging ratings of the extent to which 
worker was “in the know” in his plant cone 
ing the production plan, the investment P 
economic performance, and technical probi 
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o, and a question asking 
felt be knew how his job 
eration of the plant; 

e Communication which 
s of four items that mea- 
hich both the superior and 
ed and provided informs- 
performance; 

he degree of responsibility 
ork, for one's work group, 
ant; 

hich included measures of characteristics 
of better and faster work mantic 
a work beyond that re- tached 
it innovations on the job. 


red the end-results variables. 


were constructed from items The 
the four included: formed 


letter 
index which was comprised search 
verall satisfaction with the One 
of overall satisfaction with since 
ou do“; expansion 
health index which included at random 
each of three dimensions: from the 
nt, and self-esteem; arranged date and 
following Seeman’s (1959) sembly 
‘alization, was quantified by questionnaires. 
i items to measure each di- at 
powerlessness, meaningless- received 
vent, and normlessness. The questionnaires 
ned into a single index; jy 
ings (rather than self-report 
were used to assess work- istration 
Supervisors were presented dispersion 
- subordinates and were asked Therefore, 
‘cree 7-point scales in terms of naire 
rowledge, work quality, and all, 47 


taing 

„.. ee 
Physical MPa Se Mental 

Dirty 


H 
fi 


Dangerous 


Together with others 


nt upon others 1.88 
Independent of others . Depende: 


1 2 3 4 5 6 7 


Ficurr 1. Mean scores for industrial (I) and agricultural o workers on semantic differential 
scales describing dimensions of jobs ( p< .01). 
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Farm managers were harder than their industrial cantly fewer opportunities lf-realizathee 
| Counterparts to locate for the interview, resulting in (% S 05) and for participa P< DI) ia 
2 somewhat lower proportion of supervisory ratings decision making on their jo! 
C the spring of 1970. Consistent with the repo less partici. 
| pation in industry was the cantly (p< 
Resutts 01) lower feeling of contr: erienced by 
The comparison of factory and farm work- factory workers. The differ: n participa- 
ers’ scores on indices measuring causal, in- tion and control are the lar nes in Table 
; tervening, and end-result variables is shown in 1. An examination of the intervening 
t i lati and 
Table 1. variables revealed that pe lations 
e An analysis of the causal variables indi- information were also signi! y (p < 01) 
cated that agricultural branch managers were lower in industry. Vertical, nication be- 
_ rated significantly ( < .01) higher by their tween workers and their in te superiors 
= subordinates than were industrial plant man- was similar in agriculture ndustry. In- 
agers by theirs. Nevertheless there was no dustrial workers considered | elves signif. 
difference in management consideration of cantly (p < .02) less respon han did the 
_ farm and factory workers. The third row of farm workers. Factory and workers ex. 
© Table 1 shows no difference in workers’ rat- hibited equivalent levels ative-taking 
ings of their immediate supervisors’ leader- behavior. 
_ ship, The two remaining causal indices showed The bottom third of Ta shows that 
that industry provided workers with signifi- factory and farm worker: » strikingly 
TABLE 1 
RELIARILITY Estimates AND MEAN SCORES FOR INDUSTRIAL AND AGRICULTURAL U s ON 
CAUSAL, INTERVENING AND Enp~RESULT VARIABLES 
Indust ure 
Index No. | Coefficient — | a fer. 
items| alpha | 
f | SD n M 5 t 
Causal variables C 
Plant manager 's leadership 4 73 | 411] 74) 430 | 435 | 62 | 126 | 330% 
Consideration 2 63 3.66 | .93 | 449 | 3.56 1.00 16 1.09 
Supervisory leadership 4 lis 3.84 7 1.48 
ities for self-realization 5 84 |301| 99 |462 3.20 81 | 173 |218 
Participation 9 88 |274| .93 | 387 | 3.35 | 1.17 | 147 | 630%" 
Intervening variables | 
ee control 3| „ | 3.12. | 1.12 |472 | 3.71 | 1.00 | 171 | 6.098% 
Freer relations 5 33 | 3.30 60 452 | 3.52 | 70 171 | 3.91%" 
Information 5| 86 |340| 1.01 | 466 | 3.65 | 102 | 168 | 601 
Superior-subordinate communica- | i 
1 4 81 40% .79 | 443 | 4.16 | .70 42 <i 
i Responsibility 3 65 3.98 | .69| 407 4.4 63 | 138 co 
mitiative 3 50 3.65 | 1.65 | 468 | 3.60 | 1.47 | 17 < 
: 8 variables | A 
i ob satisfaction 2 76 3.60 88 446 | 3.59 93 171 
1 6| 78 | 437 | 86] 451 | 441 | 34|167 | <i 
penta £ 10 «83 3.82 | 82 462 | 3.85 | 81 | 167 Si 
uber visors ratings of workers 3 80 5.26 | 1.19 | 558e | 5.37 | 1.35 | 130° 


This u is larger than the self-report i 
did not subsequently return com ecient 
This n is smaller than the 
_ available for interviewing, 
*p <.05, two-tailed test. 
** p < .02, two-tailed test, 
* p <01, two-tailed test. 


leted questionnaires. 


z we 
-report agricultural sample, since managers of six of the participating farm branches 


A F a of whom 
ial sample, since plant managers filled out ratings on all their workers, some 


re not 


gical end-result variables, 
ction, mental health, and 
t row of Table 1 shows 
ratings of workers were 

factory and farm work- 


hat factory and farm jobs 
d differently by workers, 
iaracteristics measured by 
differential scales, five 
nificant differences (p < 
sts based on responses from 
kers and 180 agricultural 
ying slightly from scale to 
onal missing data). Indus- 
considered more mental, 
er, but also less varied in 
alone. 


! ISCUSSION 


lifferentials used to measure 
offer strong evidence that 
differences between work 
> kibbutz farm and factory. 
ifficient to cast serious doubt 
place contention that the 
ioe of agriculture” in the kib- 
iced that the kibbutz agri- 
effect become an industrial 
one accepts this view, the 
Figure 1 indicate that the 
work performed by this new 
is still quite different from 
z the industrial worker in the 
any case it is worthwhile to 
the diffe: rences between the two. It is 
not at all surprising that industrial 
Ik is perceived as less physical, cleaner, and 
than farm work. These are hygiene 
according to Herzberg, Mausner, and 
nan (1959). These differences have a 
eal of face validity and corroborate 
ations of the investigators during nu- 
US site visits. 
It is interesting that factory workers feel 
r work is performed more alone. The 
Number of workers in the sampled 
and agricultural branches was 26 and 
ectively. Thus, the factory workers 
Ay reported themselves as being lonely 
wd. Farm workers reported a greater 


feeling of working together with others, de- 
spite the smaller size and wider geographical 
dispersion of their branches, 


Table 1 shows that, in addition to differ- 


ences in the work itself, there are large and 
consistent differences in causal and interven- 
ing variables showing results unfavorable to 
the factory. The only variables showing no 
differences involve the immediate supervisors 
—their rating by their subordinates, their rat- 
ing of their subordinates, and worker-super- 
visor communication. These similarities in 
items tapping aspects of worker-supervisor 
relations may be due to the common kibbutz 
milieu shared by members at different levels 
in the work organization. After hours they all 
remain a part of the same community, and 
nonwork interaction may serve to attenuate — 
whatever differential effects might otherwise 
have occurred in interpersonal relations, 

The causal and intervening variables showed 
overwhelming disadvantages for factory 
workers, and it is surprising that this did not 
produce differences in end results. Since the 
number of workers in the immediate work en- 


vironment is negatively associated with job 
satisfaction (Indik, 1963), the size difference 


is an additional reason to expect industrial 
workers to be less satisfied. One could argue 
that alienation and mental health are more 
influenced by nonwork, and since kibbutz 
farm and factory workers share the same 
communities and nonwork social environ- 
ment, they should be equally mentally healthy 
and alienated. This reasoning does not hold 
for job satisfaction, which should be influ- 
enced by job content and by causal and inter- 
vening variables. Why are the psychological 


returns to factory workers equivalent to those 


of farmers despite the evident disadvantages 
of factory work? On the basis of the data 
available one can only speculate as to the 
answer to this question. 

The most obvious and straightforward in- 
terpretation of the results is that industriali- — 


zation in and of itself has a measurable im- 


pact on the job itself and on the work en- 


vironment, but not on the worker's psycho- 


logical responses to his job. Among kibbutz > 
members, similarity in degree of urbanization 
and comparability of all other community 
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ronment factors may attenuate or indeed 
obliterate the negative i 


true, one should seek the sources of the ma- 
laise of modern industrial man somewhere 
nd the workplace, Future research could 


members from the relative disadvantages of 
their work. 
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The proble: employee turnover has con- 
tinued to pl: srganizations in recent years 
despite an ir se in investigations into fac- 
tors affecting behavior. With few excep- 
tons, attitu l studies of turnover have 

on ti nstruct of job satisfaction as 
2 predictor « nure. A moderate and con- 
sistent relati ip has generally been found 
across various samples between greater job 
Sttisfaction anc the propensity to remain with 


Organization (Brayfield & Crockett, 1955; 
Herzberg, Mausner, Peterson, & Capwell, 
1957; Porter & Steers, 1973; Vroom, 1964). 

Recent investigations of turnover have typi- 
cally attempted to replicate the findings of 
Previous studies and have generally failed to 
k beyond the job satisfaction construct for 
other potential correlates of turnover. In ad- 

tion, such research has been consistently 
“atic in nature, collecting attitudinal mea- 
“ures at one point in time and comparing such 
— 
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vestigation studied changes across time in measures of organizational 
tment and job satisfaction, as each related to subsequent turnover 
a sample of recently employed psychiatric technician trainees. A 
idinal study across a lol-month period was conducted, with attitude 
cs collected at four points in time. The results of a discriminant analy- 
idicate that significant relationships existed between certain attitudes held 
mployees and turnover. Relationships between attitudes and turnover 
‘ound in the last two time periods only, suggesting that such relation- 
ire strongest at points in time closest to when an individual leaves the 
zation. Organizational commitment was found to discriminate better 
n stayers and leavers than did the various components of job satis- 


data with either present or future turnover 
rates, As pointed out by Lefkowitz and Katz 
(1969), attitudinal measures of factors as- 
sociated with turnover do not remain con- 
stant over time. Because of this, it appears 
more appropriate to measure patterns of atti- 
tude changes over time as they relate to the 
propensity to leave than to rely on a single 
measurement at one point in time. Our knowl- 
edge of the turnover process could therefore 
be expanded, at least in part, by the follow- 
ing: (a) investigations into other potential 
constructs relating to turnover in addition to 
job satisfaction and (6) measures of attitudi- 
nal changes over time as they affect de- 
parture. The present study is designed to 
accomplish both of these objectives. 

This study focuses on the attitude construct 
of organizational commitment, in addition to 
job satisfaction, and compares their respective 
predictive powers in differentiating stayers 
from leavers among a sample of psychiatric 
technician trainees. The assessment takes the 
form of a longitudinal study over a 103 
month period. 

Based on previous findings, we would ex- 
pect overall job satisfaction measures to be 
inversely related to turnover (Hulin, 1966, 
1968; Waters & Roach, 1971; Wild, 1970). 
When individual components of job satisfac- 
tion satisfaction with pay, promotion, su- 


603 


pervision, co-workers, and the work itseli— 
have been studied, fairly consistent relation- 
ships also have been found with the propen- 
sity to remain (Farris, 1971; Friedlander & 
Walton, 1964; Hulin, 1968; Knowles, 1964; 
Ley, 1966). Such findings are not universal, 
however, as some studies found certain of 
these factors unrelated to withdrawal (Kil- 
bridge, 1961; Taylor & Weiss, 1969b; Telly, 
French, & Scott, 1971; Waters & Roach, 
1971). 
Organizational commitment may represent 
a related but more global evaluative linkage 
between the employee and the organization 
_ that includes job satisfaction among its spe- 
cific components. Relatively few studies have 
examined the relationship between commit- 
ment and turnover behavior (Porter, Cram- 
pon, & Smith, 1972). Organizational com- 
mitment is defined in the present context in 
terms of the strength of an individual’s iden- 
tification with and involvement in a particular 
organization. Such commitment can gen- 
erally be characterized by at least three fac- 
tors: (a) a strong belief in and acceptance 
of the organization’s goals and values; (b) 
_ 4 willingness to exert considerable effort on 
behalf of the organization: ( c) a definite de- 
sire to maintain organizational membership. 
Some evidence exists that a stated intention 
to remain with the organization, a component 
of commitment, is strongly and inversely 
related to turnover (Atchison & Lefferts, 
1072; Kraut, 1970). It is predicted, there- 
fore, that individuals highly committed to an 
organization’s goals and willing to devote a 
Steat deal of energy toward those ends would 
de inclined to remain with the organization 
in an effort to assist in the realization of 
such highly valued objectives. 

Under certain circumstances, measures of 
organizational commitment may be more ef- 
- fective predictors of turnover than job satis- 
faction. For example, while the individual 
may be dissatisfied with either his or her pay 
or supervisor, a high degree of commitment 
to the organization and its goals may serve to 
3 Override such dissatisfaction in the decision to 
continue participation in the organization. In 
other cases—for example, where money is 

very important to an employee and where he 
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Subjects and Research Seii 


The subjects in this study 
or groups of psychiatric technic: 
by a major west coast hospit 
retarded. The training course 
Was one year in duration, inclu 
supervised clinical experience an 
site classroom lectures. The clinic 
the trainees were quite varied, r 
and dressing patients to doing ph 
using such techniques as behavio 
erant conditioning. During the 
technicians received a modest s; x 

The demographic characteristics of the two 1 
ing groups were very similar. There were no dif- 
ferences between the groups with respect to = 
male-female ratio, or educational background. 18 
addition, selection procedures for both groups Wer 
identical, as were the training sequences, 
assignments, and instructors. The training group 
were begun two months apart. Because of the re 
larity of the two groups, it was felt combining 
into one study group was justified for purposes 
analysis. 5 

As ih most state hospitals, this organization 
a long history of relatively high turnover a 
Psychiatric workers, with rates ranging be 
30% and 50% during the first year of full- 35 
employment (i.e., after completion of W uals 
averaging 25% thereafter. A total of 84 individ 
began training in Groups 1 and 2. However, eu 
beginning of this study (approximately nine mo 
later), only 60 individuals remained. Tee 
individuals constituted the initial study group- 7 
ing the remainder of the study period, howevet. 0 
more individuals terminated, or 45% of the samp! 


two chasse 
ces employed 
the mentally 
se employes 
000 hours of 
hours of os- 
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TABLE 1 i 


n Leavers Actos Tue Prsioos 


Time Feriod 3 


ruments were used in the atti- 
Sach is discussed briefly. 
mmitment Questionnaire. This 
was designed to measure the 
ibjects feel committed to the 
on. Included in this instrument 
to the subject's perceptions 
iy toward the organization, his 
a great deal of effort to achieve 
and his acceptance of the 
All items represent statements 
ct responds on 7-point Likert- 
from “strongly disagree” to 
ne wording of six items is re- 
t to reduce response set bias. A 
vere!| commitment for each respondent 
z the mean score across all items. 
sistency of the instrument as mea- 
td by coefficient alpha (Cronbach, 1951) ranged 
“82 to .93 across the four time periods. 
Descriptive Index (JDI). The JDI was em- 
need to Measure satisfaction with five aspects of 
Subjects’ jobs: supervision, co-workers, work, 
b and promotion. In addition, the unweighted 
M of the five scales was used as a measure of 
“all satisfaction. Validity and reliability data, as 
es Scoring techniques, for the JDI are described 
Smith, Kendall, and Hulin (1969). 


tta Collection Procedures 


ments were administered to the subjects 
four sequential time periods by the research- 
“enfidentiality of responses was assured by both 
University researchers and hospital management. 
er, subjects were given the opportunity to 
Participation but none did so. 

frst administration (designated Time Period 
CS given 10 weeks prior to the completion of 
aie g program. Only Group 1 participated in 
“ministration since Group 2 had already passed 


me Period t Time Period 2 Tame 
10 weeks) {=} weeks) (+2 weeks) 1 N pha my 
2 — Í 
t Leaver Stayer Leaver Stayer Leaver Mayer bal 
16 (a= 1) | (a = 23) (e 2) | aem 1 = 21) i -2 | w. 
sb | M | SD] u |sp| u | SD) u |s0| «| SD) u | SD) u | sD 
10) asf soj as] stji BS) 01 45) s as) sef aof at 2 * 
E. | Í 
29.5 | 130.0 5.1 152.0 | 44.2 | 127.7 | 45.0 | 155.4 S15 |134| 468 |1469] 647 |1345 
118 | 240|37.0 | 32.9) 145) 264 |125| 32.3|154 | 278 12 | 35.6 | 10.9 | 2247 
10% 4 1. 38.7) 14% . . % % dee | solii) 12) 170 
$8) 302) 9.0) 32,9) 88| 275/125) 277/105 | 245) 122 f 
12.4 | 11. 108 | 20.2 | 10.5 | 19.21127 | 219/145) 25 4% JOS) 19) „ o 
4.9 | 26.9) 18.2 | . — 140/155 MA) 17.7 | 22.2 [10.5 | 235 1s | 174 44 
isa) EE EI E ! 
ptive Index. 


this point in their training program. The remaining 
three administrations (Time Periods 2, 3, and 4) 
were given to both groups two weeks prior to train- 
ing completion, two weeks after beginning full-time 
work (i.e., after training), and six weeks after be- 
ginning full-time work. Subjects who did not com- 
plete the training program were dropped from the 
investigation. The sample size fluctuated somewhat 
during the latter three administrations due primarily 
to incomplete questionnaire responses during all time 
periods and to a very small amount of turnover 
between the third and fourth administration periods. 
Thus, while a pure longitudinal design was at- 
tempted, it was not possible to achieve an entirely 
homogencous sample across time. Turnover data, 
based on hospital records, were collected up to eight 
months following the completion of the training 


program. 


RESULTS 


The sample was divided into stayers and 
leavers based on subsequent turnover data, 
and mean scores for each group were calcu- 
lated on the organizational commitment and 
job satisfaction measures. Group means, 
standard deviations and sample sizes for each 
time period are presented in Table 1. 

The homogeneity of stayers and leavers 
concerning demographic characteristics is re- 
flected in the absence of significant differ- 
ences between the two groups with respect to 
education, male-female ratio and income. 
However, the mean age for stayers (X = 31.9 
years) was significantly higher (% < 01) than 
the mean age for leavers (X = 23.9). Rela- 
tionships between age and turnover have been 
found elsewhere and were not surprising find- 
ings in the present study (Farris 1971; Fleish- 


man & Berniger, 1960; Ley, 1966; Minor, 
1958, Stone & Athelstan, 1969). This finding 
Suggests though, the possibility that any rela- 
tionships found between attitudes and turn- 
over may be attributable to differences in age. 
Consequently, age was controlled in the sub- 
_ sequent analysis by partialing its influence 
from the commitment and job satisfaction 
_ Scores using a method suggested by Nunnally 
(1967), 

A discriminant analysis (Overall & Klett, 
1972) was performed for each time period to 
_ find the linear combination of the commit- 
ment and job satisfaction scores (with the 
influence of age partialed out) that best dis- 
criminates between groups formed on the 
basis of turnover. The significance of the 
discriminant functions was tested using a test 
_ Statistic which is approximately distributed 
as a chi-square variable when computed by 
the procedure in Overall and Klett (1972). 
In the two group cases of discriminant analy- 
sis, only one discriminant function is found, 
and the results are functionally equivalent to 
those that would be found from a multiple 
Tegression using a dummy criterion variable 
(Tatsuoka, 1971). The total discriminatory 
power of the discriminant function was cal- 
_ culated using Tatsuoka's (1970) multivariate 
extension of the estimated omega-squared. The 
_ Omega-squared statistic can be interpreted as 
che percentage of variance in the discriminant 
Space that is relevant to group differentiation 
(G Tatsuoka, 1970). 


TABLE 2 


STANDARD DISCRIMINANT Weicnts, TEST 
STATISTIC, AND DISCRIMINATORY PowER 
Across TIME PERIODS 


Í anI = Job Descriptive Index, 
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examined the 
relationship between tu ind the follow. 
ing set of variables: c ent to the or 
ganization;  satisfactior supervision: 
satisfaction with co-wor} itisfaction with 
the work itself; satisfac ith pay: satim 
faction with opportuniti: sromotion, The 
measure of overall jol tion was not 
considered in the disc; analysis be 
cause it is derived from mation of the 
five subscales and thus i in conjunction 
with the subscales) wou! te the mathe 
matical requirements of hnique. 
Results of the discri analysis for 
each time period are rep: n Table 2, As 
the results indicate, no si it discrimina- 
tion was found between and leavers 
in either Time Period 1 ne Period 2. 
However, significant relat s were found 


eral attitude 
05) and in 


between turnover and 
measures in Time Period 
Time Period 4 (% < 05 

The fact that significan 
found in the latter two tin 
in Time Periods 1 and 2 
relationship between turnove 
attitudes is strongest at poin 
to when the individual te 5 
words, as an individual approaches the point 
of leaving the organization it appears that his 
or her attitudes are a more accurate predictor 
of such subsequent behavior. 

The results of the discriminant analysis for 
Time Period 3 indicate that 21° of the vari- 
ance in commitment and job satisfaction E 
relevant to the subsequent decision to stay Of 
leave the organization. Interpretation of the 
standardized discriminant weights (Table 2) 
allows conclusions to be drawn concerning the 
relative contribution of each variable in dis 
criminating between groups. C ommitment to 
the organization was clearly the most impor 
tant variable in differentiating between stay 
ers and leavers. Satisfaction with opportun: 
ties for promotion and satisfaction with 
work itself were next most important. | 
moderate size of the discriminant weights 
for these variables relative to the weight m 
commitment indicates a smaller contribution 
to the discrimination between groups- 


ionships were 
iods but not 
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ind the several 
n time closest 
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DISCUSSION 


purpose of this investigation 
terns in organizational com- 
b satisfaction across time as 
to turnover among a sample of 
‘chiatric technicians, in this 
‘ having high rates of turnover. 
*he results clearly indicate that the attitudes 
Meld by an individual are predictive of subse- 
int turnover behavior, with individuals who 
MUmately leave the organization having less 
Worable attitudes than individuals who stay. 
“terns of attitudes across time suggest that 
> inverse relationship between favorable 
Mütudes and turnover generally is stronger 
> individuals approach the point at which 
leave the organization. Based on these 
E lts, several points warrant emphasis and 
istussion, 

When group means on the original mea- 
des are viewed within a cross-sectional 
4 Work, it is apparent that the level of 
i des for leavers is consistently lower than 
© level of attitudes for stayers during all 
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four time periods. This finding is 
evident with respect to commitment to the 
organization. Although it is clear that leavers 
are characterized by lower levels of commit. 
ment and satisfaction, the data are not suf- 
ficient to demonstrate why such differences 
occur. ’ 
One possible explanation may be found in 
the extent to which the expectations of stay- $ 
ers and leavers had been met on the job 
(Porter & Steers, 1973). Each individual can 
be viewed as bringing a unique set of eee 
tations to the employment situation. The rela- 
tively common reward system (le., levels of 
pay, training procedures, task requirement, 
and instructors and supervisors were the same 
for all subjects) can be regarded as approxi- — 
mating more closely the level of expectations 
of those individuals who remained with the 
organization than those who left. Within such 
a framework, certain attitudes and turnover 
would be viewed as outcomes of a process in 
which individuals compare their level of ex- 
pectations with the perceived realities of the 
job environment. While the present investiga- — 
tion was not specifically designed to examine 
this issue, the importance of considering met 
expectations in the withdrawal process has 
been suggested elsewhere in the literature 
(Dunnette, Arvey, & Banas, 1973; Katzell, 
1968; Macedonia, 1969; Weitz, 1956; Young- 
berg, 1963). R 
When the results are viewed longitudinally, 
the relationship between attitudes and turn- 2 
over appears to grow generally stronger over 
time. Evidence that a general decline in the 
level of attitudes takes place prior to leaving 
the organization has been found elsewhere 
(Porter et al., 1972). Such a trend was es- 
pecially evident in this study with respect to 
organizational commitment, where the differ- 
ence between the stayer and leaver mean 
scores generally increased over time. How- 
ever, such a relationship was not as evident on E 
the various measures of satisfaction. In fact, 
the difference between the stayer and leaver f 
means on the subscales of the JDI tended to 
decrease over time, although such a pattern 
was not entirely consistent. In addition, the 
mean satisfaction scores for stayers and leav- 


ers fluctuate considerably across time with no 
apparent pattern. 
The differential relationships between turn- 
over and the two attitude constructs suggests 
that commitment and satisfaction are related 
yet distinguishable attitudes. Across the four 
time periods, the average intercorrelation 
between commitment and the five subscales 
of the JDI were .50, 42, 40, and .55, re- 
spectively (the highest correlations were gen- 
erally found between commitment and satis- 
faction with the work itself), Thus, on the 
average, the two constructs shared less than 
35% of the common variance. When partial 
correlations were calculated between commit- 
ment and turnover holding level of overall 
satisfaction (JDI total) constant, significant 
relationships were found in three of the four 
time periods (in Time Period 1, neither com- 
mitment nor overall satisfaction was related 
to turnover). These findings suggest that al- 
though we would expect commitment and sat- 
action to be related, each construct appears 
to contribute unique information about the 
individual’s relationship to the organization. 
Although this investigation was not designed 
to examine in any extensive manner the rela- 
tionship between the two constructs, it is pos- 
sible to speculate as to the nature of the dif- 
ferences between commitment and satisfac- 
tion. On the one hand, the development of 
organizational commitment appears to require 
an individual to think in fairly global terms 
about his or her relationship to the organiza- 
tion during the initial employment period. 
We would expect that a relatively greater 
amount of time would be required for an 
employee to determine his level of commit- 
ment to the organization than would be the 
case with his level of job satisfaction. This 
Process is likely to result in a stable and 
enduring level of commitment. On the other 
hand, the degree of one’s job satisfaction 
appears to be largely associated with specific 
and tangible aspects of the work environment / 
and may represent a more rapidly formed 
affective response than does commitment. 
One's level of job satisfaction may thus be 
more transitory in nature, which could ac- 
count for the shifts in the mean degrees of 
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satisfaction over time í n the presat 
study. 
The overall findings study suggest 
that in this type of or onal context, | 
general attitudes toward inization may 
be more important in the mn to remain 
than the more specific at toward one’s 
particular job. It must | ibered, how- 
ever, that such finding ed among a 
sample of psychiatric tec s working in 
a state mental hospital. ( e nature and 
purpose of such an orga it is com- 
ceivable that organization iitment may 
take on a special importar he participa- 
tion decision. It is possi it some em- 
ployees place a high value goals of the 
organization and such a tation may 
outweigh—to a degree, at their like or 
dislike of the particular equired to 
attain such goals. Whet! ilar results 
would be found among e: es of most 
profit-centered business org ns remains 


to be demonstrated. 
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JOHN E. NEWMAN 1 
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The present investigation studied the lab-to-field generalizability of | 


attitude-behavior model and examined the relative efficacy 


of the 


model and traditional job attitude measures as predictors of absente 1 


turnover. Predictor data were collected from 108 nursing 


home 


immediately preceding the two-month time period of interest. Crite 
were obtained at the end of the two-month period. Fishbein’s mode! 
some field support, particularly with respect to predicting turnove 
tional job attitude measures were more effective predictors of ab 


while Fishbein’s model was a more effective predictor of turnover 
concluded that neither approach seems superior especially 


in light 


amounts of criterion variance accounted for, 


Empirical studies of the attitude-behavior 
relationship in specific reference to job atti- 
tudes and job withdrawal behaviors have been 
reviewed recently by Porter and Steers 
(1973). They concluded that there is consid- 
erable evidence for consistent negative rela- 
tionships between job attitudes (ie, job 
satisfaction) and turnover. There appears to 
be a similar relationship of job attitudes to 
absenteeism, although there is much less in- 
formation available. Porter and Steers note 
that many of the more recent findings are 
based on reliable and valid measures of job 
attitudes such as the Job Descriptive Index, 
JDI, (Smith, Kendall, & Hulin, 1969), 

Although these observed attitude-behavior 
relationships are relatively consistent, they 
are seldom overly strong. Obtained correla- 
tions or multiple correlations as high as .50 
have been rare. Since the prediction of job 
withdrawal behavior is a critical research 
topic from both theoretical and applied per- 
spectives, it is important to further specify 
the precise nature of the relationship between 
attitudinal measures and withdrawal behavior 
and to test these models in applied settings, 

Fishbein (1967) has developed a model 
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for predicting overt beh from verbal 
predictors. Fishbein’s theo 1 adaptation 
of Dulany’s (1961, 1962) y of propo- 
sitional control and a of the ex. 
tended instrumentality the: of behavior 
(Mitchell & Biglan, 1971) : 
The theory identifies two 5 of variables 
that function as the basi erminants ol 
behavior: (a) attitudes to the behavior 
and (b) normative beliefs. The first com- 
ponent, attitudes toward the behavior, re- 
flects the instrumentality concept and de- 
pends upon the individual’s beliefs about the 
consequences of performing the particular 
behavior in a given situation and his evalua- 


tion of these consequences.“ second com- 
ponent, normative beliefs, depends upon the 
individual’s beliefs regarding others’ expecta- 
tions of his behavior and his motivation to 
comply with these expectations. Fishbein 
acknowledges that other variables may affect 
behavior, but suggests that they operate in- 
directly by influencing one of the above de- 
terminants. i be 

Fishbein’s theoretical formulation may 5 
expressed in symbolic form as a multiple re 
gression equation: 


B ~ BI = (Aae) wo + (NB. MC) Ws 
Where: 


B = overt behavior 
BI = behavioral intention 
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toward the behavior in 
a situation 

normative beliefs, i.e., 
pe ed expectations of others 
m ion to comply with the 
ni ive beliefs 

WoW = €r ally determined weights. 


It is impo to note that the criterion 
predicted by omponents of this model is 
the individu ehavioral intention (BI). 
Ajzen and Fi n (1969) have stated that 
BI serves as itervening variable between 
the attitudin: d normative predictors and 


the overt beh 
predictor of 
person’s inter 
According to 
dict BI, one 
slight attenua 
The result 
studies have 
lation betwee: 
haviors is a 
These same 
average multig 
components o 
ioral intentio: 


They believe that the best 
ven behavior should be the 
to engage in that behavior. 
theory then, if one can pre- 
also predict B with only 


more than nine laboratory 
:cated that the average corre- 
SI and a wide range of be- 

70 (Fishbein, in press). 
‘ies have indicated that the 
correlation between the two 
‘ishbein’s model and behav- 
is about .80 (Fishbein, in 
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press). The model, however, has not been 
tested in on-going work organizations, and the 
usefulness of the theory for organizational 
situations is unknown, 

The purpose of this study then was to test 
Fishbein’s model in a real organizational set- 


ting. In addition to exploring the generaliza- 


bility of the model from the laboratory to the 
field, the relative efficacy of the Fishbein 
model and traditional job satisfaction mea- 
sures as predictors of job withdrawal be- 
havior was examined. 


METHOD 
Subjects and Research Setting 


The subjects (N =108) were employees of a 
county nursing home and included nurse’s aides, 
licensed practical nurses, registered nurses, adminis- 
trators, staff, food service and housekeeping person- 
nel. Slightly over 88% were female. 


Questionnaire 


The questionnaire provided measures of the fol- 
lowing variables: 


1. BI with respect to unexcused absences was 
measured in the following manner: 

I am going to be absent (unexcused) from work 
on one or more days within the next two months 
(July and August). 


This scale was scored from 1 (extremely unlikely) 
through 7 (extremely likely). 

2. Attitude toward being absent (Adet) was mea- 
pu by. taking the sum over the following four 


Mantic differential scales with high loadings on 


foolish 


EXTREMELY QUITE SLIGHTLY NEITHER SLIGHTLY QUITE EXTREMELY 
LIKELY LIKELY LIKELY LIKELY UNLIKELY UNLIKELY UNLIKELY 
NOR 
UNLIKELY 


the evaluative factor (cf. Osgood, Suci, & Tannen- 
baum, 1957): 

Being absent (unexcused) from work on one or 
more days within the next two months (July and 
August) is: 


wise 


good 


bad 


harmful 


beneficial 


rewarding 


punishing 


= scale Was scored 1 through 7, 7 representing 
Positive end of the continuum. 

ae 1922 respect to unexcused absenteeism was 

sored f Y the following scale. Responses were 

enei tom 1 (extremely unlikely) through 7 (ex- 
e likely), Note that the phrase “whose opin- 


ions I respect” is an attempt to incorporate moti- 
vation-to-comply into this measure. 7 
Most of the people, whose opinions I respect, think 
I should be absent (unexcused) from work on on 
or more days within the next two months (July and 


August). 


8 
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EXTREMELY QUITE SLIGHTIY NEMHER SLIGHTLY qi 
LIKELY ELY LIKELY ag UNLIKELY UN! 
NO 
UNLIKELY 
4 BE, Aso, NB, with respect to voluntary resig- are presented in Table m Table t k 
nation were alo measured according to the above can be seen that BI co 1.10 and 4 
3 the job in general was mea- With absenteeism and resi respectively. 
sared by the General Motors Faces scale (Kunin, Although the 39 correl: is significant 
1955). This was scored from 1 (very dissatisfied) (p< 01), it was substa smaller thas 
through 5 (very satisfied). It incorporated five faces BI-B correlations (? = tained in the 
pyres pth it — 8 bas laboratory studies menti auler 
ployee's job situation (A.) were measured by the The Fishbein model w igned specifi 
five sales of the Job Descriptive Index (JDI) cally to predict behavio: tentions, The 
(Smith, Kendall, & Hulin, 1969). These scales multiple correlations bet» he respective 
— i the employee's satisfaction with the work, attitudinal and normative nents of the 
i pay, and co-workers. model and the intent to | nt (R= 45, 
Procedure p <.01) and with the in: resign (R= 
The predictor data were obtained from the par- :70, 5 < .01) indicate tha ein’s model 
in the field setting. In an effort to minimize is a relatively effective p r of behav- 
‘Me reactive nature of the study, the questionnaire ioral intention. Both of. multiple cor- 
Le in, however, ween Dea Ù 
ministration room, he signed a numbered attendance 80) obtained in the lab: stu 
roster and was given, unobtrusively as possible, a In summary, although th hbein mode 
Questionnaire ac number. This yielded several significant lations, the 
allowed matching of the questionnaire data with amounts of variance in over wiors and in 
. resignation dats. behavioral intentions accou: de by tbe 
The general nature of the study was explained 4 3 P . sub- 
briefly, the participants were guaranteed confidenti. Fishbein model in this fiel were 
informed it was not necessary to sign the stantially lower than those accountable by 


work shifts on the two days immedi- 
ately preceding the two-month time period of in- 
terest. 


Data relevant to the behavioral criteria, unexcused 
absenteeism (a single dichotomous act criterion) and 
4 voluntary resignation (also a single dichotomous act 


not (0) absent (un- 
during the two-month period. It is 
to understand that we were pre- 
Or not the employee would be ab- 
ent, not frequency of absences as a continuous 
Lernt en ay resignation was scored as O or 1 
dep on whe employee voluntarily 
resigned (1) or not (0) during the two-month 


importan 


RESULTS 

Fisnbeins Model: Lab versus Field 

The means, standard deviations, and inter- 
ations of all predictors and both criteria 


the model in laboratory situations (Table 1). 


Predictive Power: Comparison oj the 
Fishbein Model and the Traditional 
Job Attitude Measures 


Absenteeism. BI correlated .10 with ab- 
senteeism (Table 1). Ass and NB, had a 
multiple correlation of .12 with absenteeis™ 
(Table 2). Thus, Fishbein’s model accounted 
for (or predicted) about 1% of the variance 
in this withdrawal behavior. 

Examination of Table 1 indicates chat a 
traditional measure of satisfaction with the 
work itself (JDI Work scale) correlated sig- 
nificantly with absenteeism (r = —-19). This 
correlation coefficient was larger than DY 
achieved by Fishbein’s variables. However, 
the best single predictor of absenteeism was 
satisfaction with the job-in-general (Faces 
scale, r= —.31), accounting for roughly ° 
of the criterion variance. 
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Jt should be noted that scores on the JDI 
; were alo summed (unweighted) to 
d another index of overall job satisfaction. 
index correlated —.10 with absenteeism. 
The predictive power of combinations of 
traditional measures was also examined 
ble 2). The multiple correlation of the 
JDI scales with absenteeism was .27 (ns). 
ing the Faces scale to the preceding five 
increased the multiple correlation 
< .05). This combination of tradi- 
attitude measures turned out to be 
best predictor of absenteeism, accounting 
the variance. 

ta suggest that the traditional job 


-predictor-criterion correlations of this magni- 
tude are useful to the practitioner, this level 
of predictive power (i.e., amount of criterion 

ce accounted for) leaves much to be 


An EEN attempt to increase the pre- 
dictive power by combining the traditional 
measures and Fishbein’s components yielded 
4 multiple correlation of .37, not significantly 
better than the combination of just tradi- 
tional measures (Table 2). 

Resignation. BI correlated .39 (p < 01) 
with voluntary resignation (Table 1). Aset 
and NB, had a multiple correlation of .36 
p < .01) with resignation (Table 2). Thus, 
Fishbein's model accounted for (or predicted) 
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about 13% of the varia: 
drawal behavior (Table 
Table 1 indicates that 
tional job attitude measu 
satisfaction (Faces scale 
correlation (r= —.16, % 
tary resignation. The < 
overall job satisfaction (i 
of the five JDI scales) « 
resignation. The correlat 
tween satisfaction with p 
and resignation was not 
the .05 level (Table 1). 1 
ditional measures had nea 
with resignation (Table | 
single traditional measure 
proximately 3% of the c: 
Combinations of the t: 
were only slightly more pr: 
The multiple correlation oí 
with resignation was 21 
combination accounted for « 
of the criterion variance. 
Again, an exploratory att 
increase the predictive pow 
the traditional measures and 
sures. This resulted in a mu 
of .48 (p < .01) when BI w 
ble 2). This eclectic combii 


for 23% of the variance in vo! 


tion, 


These data suggest that Fis 


was relatively more effective 


voluntary resignation. Any conclusions about 


TABLE 2 
MULTIPLE CORRELATIONS BETWEEN PREDICTORS AND CRITERIA 


Criterion 
Measure Absenteeism | Resignation 
R R R R 

* 12 (050 0¹ (00) 363) 13010 

1 5 . — F 27 (.17) 07003) 21 (00) | .04(.00) 
311 and Faces 36*(.30) 13000) 26 (16) 07002) 
X B Faces, Ase, NB, 37 (28) 14008) 452 (38) 20(.14) 
S scales, Faces, Asm, NR., BI 37 (27) 1407) 48** (41) .23(.17) 


I Pay scale) 
ignificant at 
t of the tre 

correlations 
s, the best 
ted for ap 
variance, 
ul measures 
(Table 2). 
JDI scales 
This latter 
imately 7% 


vas made to 
combining 
ibein’s mea 
e correlation 
juded (Ta- 
n accounted 


tary resigna- 


bein’s model 
n predicting 


„Note. One 
sities tetas 


— fo} inkage”’ 
pas social normative beliefs, i.e., 1 ex 


terpretation of the 1 5 Rs since they are not cross-validated. For inferential pù 
r ‘shiri the Wherry formula. Ae = attitude toward the be 
pectations of others; BI = behavioral intention; JD 
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power of either approach, 
tempered by a considera- 
t of criterion variance ac- 


ISCUSSION 


idy indicated that neither 
behavioral intentions were 
ior in accounting for job 
iors. Also, Fishbein's model 
behavioral variance in this 
in in the prior laboratory 
more field tests are neces- 
findings reaffirm the view of 
behavior in real organiza- 
s a complex situation with 
d, unaccounted for contribu- 
variance. 
ts to come to grips with this 
d incorporate more facets of 
tuational information in a 
ut integrated, prediction sys- 
roach is outlined in Newman 


h should aim toward further 
he circumstances under which 
iships might be evident. For 
‘ree of control the person has 
behavior would appear to be 
at boundary condition. The 
ittitudes and intentions to 
a ly constrained to the extent 
‘the perso. lacks control over the be- 
or. This may be a partial explanation 
presen: results in that the study was 
ed during a time when the cost of liv- 
Vas high and the labor market tight. 
having no better alternatives, many 
es had no real choice concerning job 
al, perhaps. 
0sely related to the problem of lack of 
Tol in the field situation and to the de- 
Of control the person has over the be- 
tor of interest, is the fact that employees 
m a dynamic world. Thus, a most diffi- 
Phenomenon to cope with (let alone con- 
) in Predictive, field studies is the change 
re lle of the predictor variables during the 
Period following their measurement. 
visions of causal relationships dance 
Sf our heads when we interpret predic- 
@tutude—behavior correlations, then it is 
necessary to entertain the assumption 
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os 


that the attitude (or behavioral intention) 
remains constant during the intervening time 
period, Future investigators should test this 
assumption by closely monitoring the attitude 
of interest over several points in time prior to 


obtaining the criterion measure and even at 


the time of obtaining the criterion measure. 
This procedure would also allow examina- 
tion of the relationship between change (per 
e) in attitude and the criterion behavior and 
the relationships of magnitude, rate, and di- 
rection of attitudinal change to behavior. 
These relationships may be stronger and thus 
more predictive of absenteeism and turnover, 
for example, than the typical attitude-mea- 
sured-at-one-time-behavior relationship pur- 
sued today. 
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Greduate School of Business Administration, New York Uni: 


urban versus rural background, strong versus weak belief in the P 
Work Ethic, and high versus low strength for “higher order” needs 
three 


thew methods was used as a moderator of the relationship 
job characteristics on the one hand, and specific job facet satisfactic 
job satisfaction, and job behavior on the other hand. Based on 


$ 


need strength is the most useful 
measure this type of individual diference, followed by the Protestar 
Ethic, and lastly by urban versus rural background of the worker. Th 
no the three individual differences as moderator 
job characteristic: and job behavior relationship, 


Although certain job characteristics tend to emphasize either jobs or at the ex 
f more satisfying to employees than others, pense of the other. 
there are wide individual differences in reac- The Herzberg theory (H g, Mausner, 
lions to them. This research is an examination & Snyderman, 1959) as i by Ford 
of three different methods for measuring in- (1969) emphasizes job ch: stics rather 
livid differences relevant for employee than individual differences. theory sug- 
s to job characteristics, The three gests that jobs with o; -tunities for 
ls are (a) urban versus rural worker achievement, recognition. sibility, ad- 
ground (Hulin & Blood, 1968), (6) vancement, and growth in petence are 
Mirong versus weak belief in the Protestant those that enhance motivation: job satis- 
Work Ethic (Blood, 1969), and (c) high ver- faction. Another example of approach i 
5 low desire for “higher order need satis- the Turner and Lawrence (1905) six “requi 
action” (Hackman & Lawler, 1971). The site task attributes” (i.e., variety, autonomy, 
framework for comparison among the three required and optional interaction, knowledge 
individual differences concerns how well each and skill required, and responsibility). More 
moderates the relationship between the pres- recently, Hackman and Lawler (1971) sig- 
en certain job characteristics and reac- gested that intrinsically satisfying jobs 
to these characteristics, (a) allow the worker to feel personally rē- 
Redesign: Emphasis on Jobs and People sponsible for his work (autonomy), (5) pro- 


i 3 vide meaningful outcomes (variety and task 
Ide Present viewpoint is that both job identity), and (c) provide adequate knowl- 


eristics and employee individual dif- edge of results (feedback). Somewhat more 
fe i ces must be included in a thorough general treatments of work humanization 
Posener of work humanization. Historically, (Sheppard & Herrick, 1972; Work in Amer 
ica, 1973) have tended to stress a number ke 
on ch Shorter version of this article was presented factors in the total work environment, 
den J. annual meeting of the American Psycho- example, pay, supervision, working hours 
$ ET aeaa to thant. J. a and so forth. ; 2 
Edward E. Lawler III, Greg on. aid iene In contrast to an emphasis on job char 
acteristics, some have concentrated on various 
indivi differences 
nous, Graduate School of Business Admir ist types of employee individual 5 
New York University, 90 Trinity Place, New These efforts have proceeded along two 
New York 10006. The first of these has been to identify e. 
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demographically, for exam- 
or collar color (Sheppard 
Work in America, 1973), 
h, the concern of the pres- 
n to identify those employ- 
satisfaction from enriched 
u those who do not. 


I idual Differences: Three 


n middle-class work norms 
«ted as one way to understand 
em loyees are satisfied with and 
n enriched jobs (Blood & 
ilin & Blood, 1968). This view 
ferences has been operation- 
n two ways: demographically 
with a hand- pencil measure of work 


Mographically, Hulin and Blood (1968) 
‘two variables, white collar versus blue 
and urban versus rural work site loca- 
# concluded that the blue collar- 
of their 2 X 2 matrix represents 


st demoxraphic measure (Hulin, 1973; 
r, 1973. Shepard, 1973). The issue 
ms how to define urban/rural differ- 
that is, work site location or location 
Malizing experiences while growing up. 
Present study is not able to resolve this 
lar issue, because the urban/rural 
hee was based on personal background 
Socialization, not work site location 
th was exclusively urban). 
€ second approach to measurement of 
e-class work norms employed a short 
designed to measure belief in 
fotestant Work Ethic (Blood, 1969). 
recently a much longer scale has been 
ped (Wollack, Goodale, Wijting, & 
1 1971), but this was not used in the 
it study. 
‘third approach measures an individu- 
ences or desires for various amounts 
n job characteristics (Hackman & 
971). This shares a common ques- 
methodology with the Protestant 
Ethic, but differs conceptually in that 
€ desires for job characteristics are 
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eee as follows 
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split was used to separate employees into high and 
low “higher order need strength” groups. 3 

In addition to employees’ individual differences, the 
presence of certain task characteristics and reactions 
to the characteristics were measured. The degree to 
which certain task characteristics were present on 
this job was measured on the four core dimen- 
sions of feedback, variety, autonomy, and task 


r 


; 


“identity (Hackman & Lawler, 1971), and several 


other job characteristics, Each person was asked to 
describe his job by rating the amount of each job 


| characteristic present on the job. 


Three types of employee reactions were ob- 
tained: (a) satisfaction with a specific job char- 
acteristic, (b) global job satisfaction, and (ç) job 
behavior (i.c., absenteeism and performance). 

Satisfaction with specific job characteristics was 
measured on the Minnesota Satisfaction Question- 
naire, (MSQ) Short Form (Weiss, Dawis, England, 
& Lofquist, 1967). The MSQ is a different scale 
from that used to measure the presence of job char- 
acteristics, and was located in a different part of the 
questionnaire. This was intentionally done to reduce 
the amount of common method variance (Campbell 
& Fiske, 1959) that might spuriously inflate corre- 
lations between measures from the same type of 
scale. 


Item 
1. Description 


Item Content 


The number of different things done 
on the job. 

The chance to do different things 
from time to time. 


Satisfaction 


Description Dealing with co-workers as part of the 
job. 

Satisfaction Co-workers get along with each other. 

Description Dealing with customers as a part of 
the job. 

The chance to do things for other 


people. 


Satisfaction 


Description The opportunities to talk informally 
ey with other people on the job. 

Satisfaction Co-workers get along with each other, 
Description 


S Left on your own to do your work. 
Satisfaction 


The chance to work alone on the job. 


„Description 


scription Left on your own to do your work. 
Satisfaction 


The freedom to use my own judg- 
ment. 

Description Left on your own to do your own 
work. 

The chance to try my own methods 
of doing the job. 


Satisfaction 


Description Finding out from the work itself 
about your own performance. 

I vill get a feeling of accomplishment 
from the job. 


Satisfaction 


Description Finding out from management about 


your performance, 


Satisfaction I will get praise for doing a good job. 


Description 


S i The amount of challenge on the job. 
Satisfaction 


Being able to keep busy all the time, 
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How meaningfu 
I will get a feeli 
from the job. 


11. Description 
Satisfaction 


12. Description The use of impo 
ties at work. 
The chance to 


makes use of n 


Satisfaction 


Overall job satisfaction was 
ming the responses to two glol 
(a) Generally speaking, how 
your job? (b) How characteristi 
of you?: Taking everything ir 


very satisfied with my job. The 8 


these two items was r = .76. 

Job behavior was measured b, 
performance. Job performance 
supervisory ratings of an indii 
quantity of performance. Thes: 
lated .59 with each other and wi 
sum) into a composite measure 
In addition, absenteeism was us 
sure of job behavior. The nun 
senteeism in the first three mont 
used. Occasions absent rather 
were used to discount the efit 
periods of absence. 


RESULTS 


Table 1 shows the correlat 
presence of certain task ch: 
the specific satisfaction wi 
istic. If a particular indivi 
measure works as a modera 


expect to find significantly mor 


relations for high versus lo 
strong versus weak Protesta 
and for rural versus urban | 

The results in Table 1 
higher order need strength 


ences measure seems to work the I 


rk is, 
omplishment 


ills and abili. 


ething that 


ties, 


d by sum- 
t questions 
re you with 
s statement 
ount I am 
ion between 


nteeism and 
veasured by 
quality and 
itings corre- 
bined (linear 
performance. 
nother mea- 
cases of ab- 
the job was 
absent 
single long 


lays 


between the 
ristics and 
ı character- 
differences 
one would 


positive cor- 
need strength, 
Work Ethic, 


12 relationships shown, S are significant (one 
tailed tests) at the Os level or better, 3 show 
differences close to significance ($ values be- 
tween .05 and .10), and 1 is not significant, 


although it is in the predicted d 
the Protestant Work Ethic scale result 
fewer significant differences between 


irection. Using 


s in 
groups: 


Of the 12 relationships shown, 4 are ae 
cant at the .05 level or better, 1 is close 


significance, and 7 are in the pre 
tion but not significant. Using the ur 
rural backgrounds of these employee 


dicted direc- 


ban an 
s as al 


individual differences moderator shows 7 


fewest differences between groups. ~ 
relationships shown, only 1 is signi 


f the 1 
ficant at 


A cc 
I DUAL DIFFERENCES AND REACTIONS TO Jos Craractexistics 019 
TABLE 1 
Cornet WEEN Jon CHARACTERISTICS AND JOB SATISFACTION WiTH EACH CHARACTERISTIC 
r order need strength Protestant Work Ethic Urban-rural Differences 
Item need Low need Strong Pro- | Weak Pro- | Rural back- | Urban back- 
ngth strength testant Ethic | testant Ethic ground ground 
35-39) (n=35-39) (n= 35-39) (n=35-39) (n= 28-32) (n= 35-39) 
1 pas 20 AD 24 38* —.01 
2 4 | 25 39 17 520° 07 
3 s | —.12 -18* —.18 24° —.13 
4 5* -00 27 13 36 05 
5 oa 18 48 1 38 | 33 
6 SHS 30 ö 24 29 55 
7 gree —.18 38** | —.07 —.08 33 
8 1* 12 .24 | -20 20 34 
9 Maiga —.02 39 37 24 16 
10 43 —.14 32 08 —.0¹ —.24 
11 3757 23 50˙⁷ 23 40 32 
12 g 82 F 33 -68* Ad 
Note. All tests w-tailed ¢ tests for the difference between correlations. 
$ <.10. 
ot 
wep <.01, 


the .05 leve 
are in the p 
Wrong way. 

Separate items for each task characteristic 
Were used in «he present context rather than 
à single composite of all items. This was 
done because the average item intercorrela- 
tions are in the low-to-moderate range. Had 
these intercor:'ations been high, it would 
have indicate that the items were perhaps 
Rot referring to discriminable facets of a 
Particular job. In such a case, simply com- 


are close to significance, 5 
ted direction, and 3 are the 


would seem to be indicated. Presently, how- 
ever, the average item intercorrelation for the 


task descriptions was r= .18, and for the 


MSQ satisfaction items r= .25. Thus, using 
separate items seems reasonable. 
Table 2 shows the correlations between four 


core task dimensions (variety, autonomy, task 


identity, and feedback) and overall job satis- 
faction. The results using overall job satisfac- 
tion are not as strong as with specific job 
satisfaction measures, but the pattern of 
results is similar to that in Table 1. When 


bining all overlapping items into a composite using higher order need strength, two of the 
TABLE 2 
CORRELATIONS BETWEEN JOB CHARACTERISTICS AND GLOBAL JOB SATISFACTION 
Higher order need strength Protestant Work Ethic Urban-rural background 
Type of task 3 
characteristic High need Lew need Strong Pro- | Weak Pro- | Rural back- | Urban back- 
strength strength | testant Ethic | testant Ethic ground ground 
(n=35-39) (n=35-39) (n=35-39) (n=35-39) (n= 28-32) (n= 35-39) 
Variety a 50** 15 41* 10 -46 45 
Autonomy 50 — 09 38 21 .38* .02 
Task identity 30* 07 .33* —.05 34 .09 
ask feedback .41* 10 24 22 34 Az 


Noi 8 
27 An J05 are one-tailed / tests for the difference between correlations. 


3 
b < ios. 
p S01, 


le 


4 83 2 


y 
- 
4 
: 
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four relationships shown are significant at the 
05 level or better, and the other two are 
close (between .05 and .10) to significance. 
When using the Protestant Work Ethic mea- 
sure, none of the four relationships are sig- 
nificant, two are close to it, and two are in 
the direction predicted but nonsignificant. 
When using urban/rural differences, none of 
the four relationships are significant, only 
one is close to significance, and three are 
nonsignificant but in the predicted direction. 

The individual differences were also used 
as moderators of the relationship between the 
four core task characteristics and job be- 
havior. The results were disappointing. There 
were virtually no differences among the three 
individual differences for their usefulness as 
moderators—none of them worked any bet- 
ter than any others. In fact many of the 
correlations themselves were not significantly 
different from zero. 

Thus far these three individual differences 
have been compared to each other as to their 
effectiveness in moderating human reactions 
to job characteristics. The three individual 
differences can also be compared by inter- 
correlating all three measures directly. The 
inputs to this analysis are the composite total 
for those eight items measuring higher order 
need strength, the composite total for the four 
items from the Protestant Ethic scale, and 
urban/rural differences expressed as a 4-point 
scale (1 = large city, 2 = medium size city, 
3 = small city, 4 = rural area). The results 
show that urban/rural background is not 
closely related to either of the other two, that 
is, correlating +.12 with the Protestant Ethic 
and +.17 with higher order need strength. In 
contrast, the Protestant Ethic and the higher 


order need strength correlate +.41 (p < .01). 


Discussion 


; The present correlational study was de- 
signed to compare three different methods of 
measuring individual differences relevant for 
future job redesign experiments. The results 
indicate that the use of higher order need 
strength is the best of these three individual 
differences, followed by the Protestant Work 
Ethic which showed moderate effectiveness as 
a moderator variable, and last by the urban/ 
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rural difference which was g lly ineffec- 
tive in the analyses performe his study, 

In considering the ineffecti of urban 
rural background differences least two 
points must be kept in mind t, only one 
occupational group, telephon rators, Was 
used in the present study. juestion is 
how to classify this group collar or 
white collar. On the one ha is occupa- 
tion is a low paid, entry-leve tion—simi- 
lar to a number of blue-co!! s. On the 
other hand, this job is typic erformed in 
an office-type situation, not factory or 
plant environment, and th somewhat 
similar to a white-collar l position. 
How one classifies telephone oj»: ators is im- 
portant, because Hulin and 's (1968) 
suggestion was that urban, differences 
are important only for blu: r jobs not 
white-collar ones. The fact t yis particu- 
lar job seems to contain el ts of both 
may in itself be one reason rban/ rural 
differences turned out as did in this 
study. 

The second point to ren regarding 
the ineffectiveness of urban ifferences is 
that location when being so: d or grow- 
ing up was used in this stud location of 
work site, which was urban man and 
Lawler (1971) had some suess with this 
method, but Shepard (1970) not. ; 

As moderators of the job characteristic- 
job satisfaction relationship, ike three indi- 
vidual differences differed in ir usefulness. 


cteristic-job 
not differ. All 
quite simi- 


As moderators of the job 
behavior relationship, they di: 
were generally ineffective. Th 
lar to what Hackman and Lawler (1971, P. 
279) found in their analysis using higher order 
need strength. The only significant differences 


they found between high- and low-need- 
strength groups concerned the relationship 


between task characteristics and specific 0 
facet satisfaction or other attitude variables 
such as job involvement. One interpretation 
of this outcome for their study and the Pres- 
ent one is that job behavior was measured vä 
different methods (supervisory performan i 
ratings and occasions of absence) than ara 
task characteristics or attitude v 
(self-report questionnaire). The genere 


lower correla found for job behavior 


could be due common method variance. 
A second int ition is that the correla- 
tions were lov cause the measures of job 
performance present context are less 
reliable (test t) than the satisfaction 
measures. 

Job satisfa yas more reliably measured 
than job be! and shared a self-report 
method with neasures of job character- 


idual differences. This un- 
the general level of rela- 
in Tables 1 and 2 in com- 
behavior data. However, it 
imagine this affecting the 
n correlations as reported in 
ore reliable job satisfaction 
better test of individual dif- 
sults for job behavior are 
ially low due to less reliable 
variables. 
scognized that the subjects 
nale and all telephone op- 
neralizations to other groups 
arefully. Interestingly, the 
one job raised a question 
sis, that is, how these three 
“aces measures worked in re- 
job satisfaction. 
rediction is that those indi- 
high need strength, or 
Protestant Work Ethic, or 
Who are from ral backgrounds should have 
lower average job satisfaction than the cor- 
"sponding groups. This is because the nature 
of the operator’s job is such that a person 
With strong desires for an enriched job will 
‘ely be dissatisfied as a telephone operator. 
It is quite plausible to expect those with high 
strength for higher order needs to have lower 
Mean job satisfaction, but still report positive 
bons between enriching task character- 
Ea and their job satisfaction. The same 
ould be said for those who strongly believe 
i the Protestant Work Ethic or who grew up 
M tural areas, 
a test this hypothesis, each of the three 
ee differences measures was used as a 
‘oh aor for comparisons of mean global 
A clio and mean job satisfaction 
each of the 20 MSQ items. No signifi- 


istics and the 
doubtedly ra 
tionships rep 
parison to the 
is much hard 
differences be 
this study. T) 
measures allo 
ferences. Th. 
most likely a 
measures of 

It should 
here were al! 
erators. Thus 
must be mac 
limitation to 
for further ar 
individual diff 
lation to avera 

A reasonab! 
viduals who | 
Strong belief ir 
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cant differences occurred for either the global 
measure of satisfaction or for any specific 
facets of it as measured on the MSQ. How- 
ever, the pattern of differences was highly sig- 
nificant. For example, when high- versus low- 
need-strength groups were compared, satis- 
faction was lower on all 20 items of the 
MSQ for the high-need-strength group. When 
strong versus weak belief in the Protestant 
Work Ethic groups were compared, satisfac- 
tion was lower for the strong belief group in 
17 of 20 items. Using the sign test (Siegel, 
1956) reveals that both patterns are signifi- 
cant beyond the .001 level, using a one-tailed 
test. There were virtually no differences be- 
tween rural and urban groups. Thus, the 
results in this study roughly parallel earlier 
findings and strengthen them. 


A Developmental View of Individual 
Differences 


One way to interpret the role of individual 
differences in the present study is to view the 
concept of an individual difference as the re- 
sult of a developmental sequence. One of the 
earliest determinants of an individual’s work 
needs is the environment of his socialization. 
In the context of a rural or urban white-collar 
environment, an individual may be more 
likely to adopt a set of work values similar 
to what has been called the Protestant Work 
Ethic or middle-class work values. As a 
result of such an upbringing, an individual 
could develop a general value orientation 
toward work which emphasizes the importance 
of work in one’s total self-esteem and rein- 
forces the belief that work can hold intrinsic 
satisfaction. 

The consequence of such values is that an 
individual (when asked) is likely to state his 
own particular desires for job characteristics 
in terms similar to the higher order need 
concept. That is, an individual will prefer or 
desire a job to have large amounts of such 
characteristics as autonomy, variety, chal- 
lenge, and meaningfulness. A job will be 
desired that has task identity and provides 
the person with feedback. It is likely that such 
an individual will respond favorably when 
holding a job with such characteristics. 


~~ 


4 


rr 
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The rank ordering of individual differences 
can now be interpreted in a theoretical con- 
text. The fact that higher order need strength 
_ yielded the clearest results seems reasonable 
because it is closest to the employee reactions 
it moderates. Thus, it is also not surprising 
that the Protestant Work Ethic showed mod- 
erate usefulness as a moderator and urban/ 
rural background the least because each is 
increasingly farther removed from on-the-job 
attitudes and behavior. 

For the future, those interested in applying 
behavioral science knowledge to the practical 
problems involved in job’ redesign should be 
careful not to overlook differences among the 
very individuals whose jobs we are trying to 


change. 
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ELATIONSHIPS BETWEEN INDIVIDUALS 
AND ORGANIZATIONS 


H. RUSSELL 
coc Graduate School of M: 


JOHNSTON, Jr. 


anagement, Wake Forest University 


the Thematic Apperception Test and nondirective interviews with 39 
ionals employed by a consulting firm, the study investigates the link 


n three personality dimensions 


and the individual’s perception of his/her 


nship with the organization, Results show that individuals with active 


h task orientation perceive a m 


ore positive relationship with the organi- 


generally and with specific features of the organization than do passive 
task peers. Differences along the interpersonal dimension were not 


ant. The study raises question 
iduals and organizations. 


Much of urrent theory about organi- 
tational funci ing alludes to, but deals only 
summarily w the effects of different mo- 
dalities of ir dual personality orientation 
on behavior organizations. Specifically, 
there has be ittle attempt to determine 
Whether parti r attributes of the individu- 
als personaliiy have significant effects on the 
Nature of the r-'ationship of the individual to 
purposive orgs» izations and therefore on his 
or her behay in organizational settings. 


s concerning more effective matching of 


Sharing the conventional assumption that 


basic personality orientation influences the 
relationship between the individual and the 
organization, the present study investigated 
the nature and strength of these influences, 

The model upon which this study is based 
is that basic aspects of the individual's per- 
sonality importantly affect the way that the 
individual perceives his/her environmental 
situation, and that perception in turn influ- 
ences behavior. 


Personality 


The research reported in this article is 
attempting to test the hypothesis that indi- 
viduals faced with a common environment 
Perceive it differently, and that these differ- 
Mies m perception are strongly influenced by 
Particular dimensions of the individuals per- 
sonality, 
ae personality dimensions were chosen 
ori udy. These were activity—passivity, task 

“tation, and interpersonal orientation. 


11 


1 
Tone ests for reprints should be sent to Russ 
aston, Babcock Graduate School of Manage- 


Mi 
an Wake Forest University, Winston-Salem, North 
tolina, 27109, 


jointly 2 
determine determine 
We Perceptions | —————* | Behavior 


Activity—passivity. This dimension relates 
directly to the capacity of the individual to 
establish and maintain a satisfying and pro- 
ductive relationship with his environment. 
Robert White (1963) has stressed the im- 
portance of the individual’s capacity to take 
effective action (as opposed to feeling acted 
upon) in achieving a sense of competence or 


feelings of effectance. White develops the the- 


sis that the need to take effective action to 
achieve a sense of efficacy is an independent 
ego drive similar in strength and importance 
to the sexual and aggressive id drives. If 
White’s theory is correct, one would expect 
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that the individual's capacity for effective and 
sustained action would have an important in- 
fluence on the quality of his relationship to 
purposive organizations. 

Tesk end interpersonal orientation. These 
two dimensions are concerned with the means 
by which the individual attempts to establish 
and maintain relationships with his environ- 
The person with strong task orienta- 
tion secks to establish his sense of compe- 
tence through manipulation of the object en- 
vironment, while the person with a high in- 
terpersonal orientation relies to a greater de- 


proach capacity. 

The present study attempts to determine 
variations among individuals along 
three personality dimensions relate to 
variations in perception of the quality of the 
| relationship between the individual and the 

organization (1/O relationship). Thus mea- 
sures of the strength of the individual’s activ- 
ity and task and interpersonal orientation 
become the independent variables, and mea- 
_ sures in perception of the quality of the 1/0 
relationship are taken as dependent variables. 


MetHop 


Subjects for the study were all professional em- 
ployees of a five-year-old, rapidly growing consult- 
é ing company. About half of the subjects (19) had 

been with the organization for over three years, 

while 20 subjects had been employed for terms of 
six months to two years. 
To develop a measure of the independent varia- 
bes (scores on personality dimensions), the The- 
matic Apperception Test (TAT) was used. In se- 
_ kecting this test, the author was sharing the as- 
: sumption/conclusion of others who have used it 
_ extensively (e.g., Henry, 1956; McClelland, 1958; 
Murray, 1938) that the test enabled one to identify 
basic and enduring aspects of a subject’s personality 
as revealed in the projective fantasy responses to 
the cards. Henry (1956) described the TAT stories 
as portraying the subject’s “usual definition of and 
usual manner of responding to (such) reality events 

Ip. 10J.“ Six cards (numbers 1, 2, 7BM, 13B, 14, 

and 17BM) were selected from the set developed 

by Murray (1943). Selection was based on an at- 
tempt to balance the stimulus properties of the cards 
among the three personality dimensions of interest. 
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Subjects were given the standa: 
tell an imaginative story based on 
card. All stories were recorded 


analysis. 

A scoring manual was develope: 
lines for counting and classifying 
in the stories which related to t! 
dimensions of interest. Each 
classified (scored) on a scale oí 
the nature of activity—passivity 
The average of the scores on 
used as the measure of the 
toward activity or passivity. A 
sented a relatively active orienta! 
task and interpersonal dimensi: 
the presence and elaboration o! 
sonal imagery. Each story wa 
The sum of the task and interp. 
six stories was used as the m 
interpersonal orientation. 

TAT protocols were scored in 
trained scorers. Reliability corre! 
the task dimension, .84 for int 
for the activity-passivity dime: 

Data on the quality of the I 
gathered by conducting a re! 
interview with each of the 39 su 
ginning of the interview the su 
the study involved the relations 
to an organization and was ash: 
own relationship to the host or 
requested to particularly discus 
had occurred in the I/O relat’ 
and effects, and the adaptions 
had made necessary or desirable 
sophistication of the subjects an 
the host organization was impor! 
of the study. The subjects we: 
articulate and well educated (mo 
ate work, many in the social scien 
the atmosphere and norms witi: 
were highly supportive of open dis 
such as the individual’s feelings to 
zation. Thus all subjects were abl 
talk freely for a period of one-heli 
and a half without substantive intervention b. 
interviewer. These interviews were tape-t 

The protocols of the unstructured interview į 
subjected to a two-pass content analysis. The í 
pass was for the purpose of obtaining an exh 
list of all the types of comments made by th 
jects about the organization and their relati 
to it. This procedure resulted in a list of 264 
ments which were then grouped into 25 © 
categories. (A comment category included all 
ments by the subject about the effect of a p 
feature of the organization on his/her 1/0 
ship.) x 
A four-page scoring sheet was developed W 
included all comment types grouped within ax 
along with provisions for indicating positiv 
negative affect associated with a stat 


ny given statement a subject 
ositive and/or negative affect 
organization and/or its effect 

Descriptive statements made 
smments did not receive an 
they might have been scored 


> tape-recorded protocols was 


$, marital status, job level, and educational 
level yielded no relationships even approach- 
Mg significance. 
Nonparametric correlations were calculated 
to test the relationship between personality 
ion scores and the total number of 
®currences of active, passive, task, and inter- 


TABLE 1 
THEMATIC ÅPPERCEPTION TEST SCORES 


ReLanoxuup Berwzex Tumne 
Test axo N Ixaczey axo Consens 
OS SS EE Ee 


ich interview, At the comple- Imagery 
tie of this pa net number of positive minus Engage 
wegutive comm nade by the subject in each Withdraw 
femment cates sas used as a measure of the Task 
sabject's attitu ard that aspect of the organi- Interpersonal 
tition or I/O ship to which the comment Comments 
Gtegory relate addition, the number of occur- No. positive 
news of tas! terpersonal, active, or passive No, negative 
Geagery was 2 lated for each imagery class. Net (positive — 
negative) 
RESULTS 
The scorin TAT protocols resulted in ase for the Social Sciences (SPS 
the data (Ts 1) for the three personality — “* # <0! (one-tailed test). 
dimensions. high intercorrelations be- 
tween the ac’ passive dimension and both personal imagery, and positive and negative 
the task an erpersonal dimensions were comments within the unstructured interview. 
expected sinc. ne former dimension is con- Table 2 shows these results. 
ceived as a -ure of the individual’s capac- These results suggest that imagery used by 
ity for actin. on his environment, while the subjects in their fantasy responses to the 
the two latter ensions are viewed as chan- TAT cards is rather strongly related (in the 
dels or mean: interacting or acting upon active-passive and task classes) to the im- 
ality, A che. for relationship between the agery with which they perceive and react to- 
Sores on the three dimensions and demo- their organizational environment aad * 
phic factors such as age, company tenure, own relationship to it. Such a relationship is, 


of course, the raison d’étre for the TAT, but, 
frankly, the investigator was startled by the 
strength of the relationships on activity and 
task dimensions. These results lead one to 
suspect that the weaker relationship on the 
interpersonal dimension is attributable to 
problems in that section of the scoring man- 


Intercorrelations 


Personality dimensions Range M SD Activity aie 128 
X passivity personal 
sr r 
Activity-passivit 1.2-8.5 4.2 2.4 — 31 
ME .2-8.. f S 
ask orientation 1.022 9.4 6.0 48 5 2 
terpersonal orientation 0.017 6.2 4.1 : 
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ual or to the weak interpersonal stimuli in 
the TAT cards used in the study. 
Tue results in Table 2 also suggst that at 
de aggregate level, those subjects with strong 
activity and/or task orientations feel more 
positively about the host organization and 
their relationship to it than do their peers 
with a more passive and/or lesser task orien- 
tation. 


N (active) = 18 
N (high task) = 19 
N (high interpersonal) = 9 


The disparity in Ns on the interpersonal di- 
mension is, of course, statistically unfortu- 
hate, but there was a natural break in the 
scores from 10 to 8 which, upon investiga- 
tion, seemed to provide the best line of de- 
marcation between high and low interpersonal 
TAT protocols. 

In order to gain a finer grain picture of the 
nature of the I/O relationship for members 
of high and low scoring groups on the TAT 
dimension, each of the 25 comment categories 
Was examined. The dependent variable used 
was the average number of positive minus 
negative comments made in each category by 


* 
È (num- 


121 

bers of positive comments about “participa- 
tion in decision making” minus the number 
of negative comments about participation) / 
N; where N equals the number of subjects in 
the TAT scoring group. This statistic is re- 
ferred to as the mean positive balance of com- 
ments. Results for the 25 comment categories 
with a breakdown by personality grouping are 
shown in Table 3. (No attempt was made to 
lest for significance of difference between the 
means for high and low interpersonal scorers 
because of the small sample size of high scores 
: and the disparity between the two Ns.) 

: The data in Table 3 tend to support the 
general conclusion from the aggregate results 
Presented in Table 2 in that those subjects 
With relatively active or high task or inter- 
personal orientations showed a more positive 
: perception of specific features of the organiza- 


members of the group, for example, 
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Subjects were then divid: 
on each of the three dimer 
a score of 5.0 or greater 
passivity dimension were 
tive; lower than 5.0, Passi 
and interpersonal scores oi 
sulted in a subject being cl 
that dimension. This proce: 
following breakdown: 


tuo groapa 
Those with 


ze activity. 
ied as Ac 
milarly task 
r higher ree 
as high og 
ulted in the 


N (passive) 
N (low task) 
N (low interpersonal) 


n did theif 
ıterpersonal 
positive on 
f the differs 
wnd to be 


tion and their relation to 
more passive, or low tas! 
peers. Active subjects wer: 
19 of the 25 categories 
ences between the means 
significant at the .025 leve! 
task subjects were more pos’ 
25 categories (eight were f significant ab 
the .05 level or higher) ; h interper 
sonal subjects were higher í of 25 cates 
gories (none significant). Th ure of these 
perceptual differences is ex; in follows 
ing sections. 


Organization Climate 


Active versus passive subj: 
jects saw the organization a 
tured, open, and egalitarian. = 
experience substantial support for innovative 
ideas, recounted no unreasonable difficulty in 
pursuing projects in which they became Im 
terested, and found the atmosphere prof 
sional and intellectually stimulating. 

Passive subjects made a number of com 
ments about the pervasiveness of office po 
tics. They perceived a wide gulf betw 
managers and other professionals which P 
vented a free flow of ideas, particularly t 
flow up the hierarchy. A number of passe 
subjects objected to the chaos in the —— 
zation and complained that they were uns 
to work in such an atmosphere. b 

High task versus low task subjects. Su f 
jects who scored high on task orientation a 
pressed substantial satisfaction with the 
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TABLE 3 


ore A 


Personality grouping 


fee to which the organization’s climate fa- 
tated accomplishment of their project goals 
Permitting access to the necessary re- 

a Low task scorers generally expressed 
“isms of the climate in much the same 
mer as passive subjects. 


Evaluation of Own Performance 


Active versus passive subjects. Active sub- 
8enerally felt reasonably satisfied with 
tality of their work, with their personal 
tae development since joining the 
Yards Y, and with the recognition and re- 

which they perceived as related to 


the q 


tegory i Task | 
| Active | Pasive | wear 
N = Ne 12 
een e Low | High 
ae | | N=19 % Va 
Commitment t vation 3.7 47 R 87 “a | . 
Organizations ä 
Evaluation of « formance 3.30 | 0.2 | 2.8 06 32 | 
Personal goals : itegies 3. -03 | 23 | 06 3 
Organizational! re, systems, | ; : : 
practices -29 —4.1 3.0 — — 
Organizations] nd incentives 7 ~04 te | nos | = a4 j 
Quality of orga n’s product —12 3 -19 „ 
Fee le =3.0 -17 13 =27 | 61.) =12 
jor-subord:' relations E E hal EET 242 | -28 | igs 
Organizational © xpectations is | r0 1 09 y 
Isdividual’s ro tations 20 -04 17 03 e 
2 s 2.5 —0.9 82 —1.7 3.1 i 
or ini ao? 1.1 0 S 
Reward systen On|: =ao~ prira ey td set) omne 
tion ir ion-making —0.7 —0,2 —08 0.1 0.1 —0.6 
Kon re on th —0.3** -27 —0.6* —26 —16 —16 
atmosp! 0.1 0. 04 0.2 03 | ' 
8 orientatic —0.2 —04 —0.1 —0.5 —0.2 
engruence of i iual and organiza- 
pn expectat —0.2 -19 0.0˙ —2.1 —1.1 =li 
8 System rés 0.2 —0.1 0.1 0.0 0.1 . 
e change 01 —0.1 —0.2 GAS AEA S —0.1 
“ap dence on ior 0.1 1.1 0.3 0.9 0.9 z 
8 aith in 1/0 :elationship —0.3 —1.1 —0.6 —0.9 —07 —08 
ommitment to r re of work 
Performed 1.2 09 | 13 08 09 
lor promotion 0.3 03 04 —0.5 Cte = 
ay 2 i 
te. Tests for si nificance between the means used the ! statistic for small sample sizes with appropriate reduction in degrees 
h os aequat Ne. A one-tailed test was used. 
2 < -025, 
$ 01. 


their work. Many active subjects commented 
that their work had had substantial impact 
upon the company and/or its clients. Perhaps 
most importantly, many reported instances in 
which they had altered the work situation to 
facilitate achievement of their objectives. Pas- 
sive subjects were not only less satisfied with 
their own performance, but frequently ex- 
pressed strong resentment toward perceived 

management blunders which had prevented 
them from achieving success, resulting in 
frustration and resentment. They also ex- 
pressed far more concern over how others 
evaluated their work than did active subjects. 


À, 
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Personal Goals and Strategies 


Active versus passive subjects. This cate- 
gory proved particularly valuable in under- 
standing the differences between active and 
passive subjects. Subjects near the extreme 
activity range reported (almost unanimously ) 
a clear and well-articulated set of personal 
goals and realistic strategies for achieving 
these goals. Some envisioned a long associa- 
tion with their current employer; others saw 
their current job as a temporary relationship 
during which they planned to achieve specific 
advantages and progressions. The more pas- 
sive subjects, on the other hand, were not 
only imprecise or unclear in goal definitions, 
but many stated that their lack of goal direc- 
tion was intentional and advantageous to 
them. 


Organizational Goals and Incentives and 
Organizational Changes 


High task versus low-task subjects. These 
two comment categories are closely related in 
that the rapid growth of the host organization 
had resulted in a series of related changes in 
the organization’s structure and internal sys- 
tems. The most prominent of these changes 
involved attempts by management to gain 
sufficient control over the organization to 
insure an orderly and reasonably efficient 
pursuit of the organization’s goals as con- 
ceived (primarily) by the senior members of 
the organization. The changes (by their na- 
ture and effects) had two important results. 
First, they tended to clarify, objectify, and 
make more internally consistent the com- 
pany’s goals. Second, they tended to intro- 
duce order into the project work situation 
(task). The increased clarity of goals and the 
more systematic and methodical work pro- 
cedures were greeted with enthusiasm by the 
high task scorers who reported that they fa- 
cilitated improved task performance. Low task 
scorers tended to ignore or denegrate the fa- 
cilitative aspect of the changes and to char- 
acterize them as “bureaucratic red tape.” Low 
task subjects also perceived that organization 
goals were too narrowly conceived and re- 
lated almost exclusively to profit maximiza- 
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tion while ignoring such 


as personal 

development and socially or 1 objectives, 
Superior-Subordinate Relati ps 

Active versus passive si . While all 


negative bal- 
haracter of 
active sub- 
one impor- 
isive feeling 
ympany often 
pared to do 
tive subjects 


subject groups tended towa: 
ance of comments about 
superior-subordinate relatio 
jects were markedly differe 
tant respect. They shared a | -r 
that middle managers in t 
made mistakes or were pox 
managerial jobs, but man, 


disclosed that they had fo: r developed 
ways of compensating for king around 
these problems in order to e their own 


goals. Few passive subject ı alluded to 


constructive responses. 


Organizational Role Expe: 


Both active and high subjects ex- 


pressed feelings that the nization ex- 
pected a high degree of com nt, initiative 
and innovativeness, and ac! e to profes- 
sional standards of quality performance. 


s reasonable 
and intrinsic 
„rganizational 


They saw these expectati: 
and regarded both the extri 
rewards available within 
environment as being commer urate with the 
company’s expectations. Passive and low task 
scorers shared the same ignition of the 
manifest demands of the coi as the ac- 
tive and high task subjects. They were more 
likely, however, to regard tl expectations 
as either phony (“They say they're looking 
for initiative and innovation, 
aren't“) or unwarranted ( 
here are excessive”). 


Support for Initiative 


All subject groups showed ; 
ment balance on the category related 11 
organization’s support for initiative, T 
was, however, a very significant 6 
between active and passive subjects. Al > 
tive subjects shared the view that the n 
ponderance of informal attitudes and for ae 
procedures were generally favorable to 85 
individual exercise of initiative. Most ae 
subjects saw the situation as actively oe ; 
for, even demanding, initiative. Active ? 


a positive com- 


Per 


reported incidences in 
ercised initiative and expe- 
sponses, 


jects also fre 
which they h: 
rienced favor: 


The passiv ects were mixed in their 
responses with category. Some expressed 
recognition o rt for initiative but re- 
ported that ti id not felt a need or wish 
to take initia )thers expressed a variety 


of feelings th 
tiative was a 
tentional. A 
nouncing the 
fraud and re 
ences of them 
charge. 


apparent support of ini- 
either intentional or unin- 
were quite bitter in de- 
est support as a willful 
ed the frustrating experi- 
and others to support the 


Pressure on t h 
In this cat 
ment among 
about the ob 
high level of 
ment. Howey: 
Subjects’ citec 
ternally gener 
High task sco: 


ry there was again agree- 
abers of all subject groups 
e existence of a relatively 
sure in the work environ- 
he remarks of the active 
ch of the pressure as in- 
| and at times functional. 
agreed to some extent with 
the functional sessment, but saw the pres- 
Sure arising fror: <he nature of the work being 
A performed (project work with stringent dead- 
lines under cond:tions of high uncertainty and 
å lack of contiol over important factors). 
Passive subjects, on the other hand, attributed 
the pressure to management failures and a 
tendency to manipulate the junior profes- 
onals through inducing guilt. 


The Emerging Picture 

A pattern of responses emerges from the 

ata which takes the following form: 
Active subjects. Subjects with a relatively 
active Personality orientation perceive fewer 
Bt less Severe problems and constraints 
thin the organizational environment than 
a Passive co-workers, Perhaps more im- 
ieee they regard these problems and 
i aints as matters which called for adapt- 
a y and increased effort in order to deal 
cessfully with them. Many reported, along 
their recitation of the existence of 
ae Proactive attempts at solution. 
Ba not all reported solution attempts 
described as successful, failures were 
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generally discussed with some equanimity and 
often with intent to try again or continue 
trying. Equally notable to their relative per- 
sistence was the clarity, preciseness, and 
realistic nature of the goals and goal strate- 
gies expressed by the active subjects. 

Passive subjects. Passive subjects displayed 
quite clearly the familiar constellation of 
passive-aggressive orientation. Not only did 
they see many of the problems as more se- 
vere than did their more active counterparts 
but in dwelling on the problems, they ex- 
pressed far more bitterness and anger. Senior 
members of the firm (authority figures) were 
cited as the usual cause of problems and 
there was a greater tendency to impute ca- 
pricious and manipulative motives to mana- 
gerial acts. Goals, if mentioned, were less 
immediate and less reality oriented. The dis- 
satisfaction of -these subjects extended to 
their own performance as well as to the or- 
ganizational situation. 

High task scorers. These subjects were 
positive and explicit about aspects of the en- 
vironment which facilitated achievement of 
work objectives. They made more comments 
about specific projects, and their criticisms 
were often tied to problems related to their 
work within the company. 

Low task scorers. Low task scorers were 
not only more negative toward the organiza- 
tion and their relationship to it than were 
high task subjects, but also their criticisms 
were more diffuse and general in nature. They 
expressed a more pervasive and elusive sense 
of uneasiness and frustration with the climate, 
goals, systems and practices, superior—sub- 
ordinate relationships, expectations, etc. They 
cited fewer specific incidents which caused 
dissatisfaction and spent less time discussing 
their own project assignments. In perception 
terms, they discussed the ground rather than 


the figure. 


CONCLUSIONS 


First, it is necessary to look again at the 
nature of each of the two sets of data. The 
TATs were administered prior to disclosure 
of the subject of the study in order to pro- 
duce a set of data which revealed character- 
istic ways in which the individual’s person- 
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ality defined and dealt with reality. The at- 
tempt was to minimize the influence of stim- 
uli from the organizational situation on this 
set of data. 

The second set of data from the interview 
reflects the introduction of organizational 
stimuli by the investigator. However, the un- 
structured nature of the interview was an at- 
tempt to prevent the investigator’s own per- 
ceptions or biases concerning the objective 
reality of the setting from tainting the inter- 
view data. Thus this set of data represents 
the subject’s perceptual field of his/her par- 
ticular organizational setting as determined by 
the interaction of stimuli from both individual 
personality and “organizational reality” (as 
per the model early in this article). 

While the data presented do suffer from the 
imprecision of classification and the inability 
to isolate causation commonly found in ex- 
ploratory field studies, the results do suggest 
that the personality dimensions studied (par- 
ticularly the active/passive and task dimen- 
sions) as measured by the TAT bear a strong 
relationship to the perceptual field data 
gathered during the interview. Put another 
way, the personality variables appear to ac- 
count for (in the regression sense) some of 
the variation in affective perception of the 
organization and the I/O relationship. If, as 
phenomenological psychologists suggest, the 
perceptual field determines behavior, then the 
logical extension of this finding is that these 
aspects of personality would also account for 
variation in behavior in organizational set- 
tings. Research currently underway is at- 
tempting to test, in a variety of settings, the 
validity of this extension. Given the findings 
of the present study, it is now possible to ex- 
amine these relationships in a more directed 
and precise manner. 

The present results seem to fit the White 
(1963) “sense of competence” model quite 
neatly. The consistent reports by active sub- 
jects that they were able to have an appreci- 
able and positive (at least from their point of 
view) effect on the environment within the 
organization appear to be quite closely related 
to the greater satisfactions which they express 
with the organization and their own situation 
within it. There is qualitative and quantita- 
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tive support for the assertio! t these rela- 
tively active people are d important 
satisfactions from a sense o tance. Asa _ 
consequence they had a h regard not 
only for their own perforn but for the 
organization which provide: etting for 
perhaps even facilitated successful 
maintenance of feelings of « y. 

Passive subjects expressed ngs of being 


ier forces in 
feelings are 


acted on by the managers ai 
the organizational milieu.“ 


the antithesis of feelings of y and lead 
directly to the passive—ag e attitudes 
against the organization and omena over 


which these subjects felt r pability for 


control. 


As was mentioned earlier, e task and 
interpersonal dimensions ha do with the 
means by which the indivic lates to his 
environment in search of sa tion (where- 
as activity-passivity corre ids to the 
strength and direction of rive). High 


heir goals to 
favorably to 
which tend 
rtainty sur- 


task scorers regularly relat: 
task performance and respon 
those organizational mecha! 
to reduce ambiguity and 


rounding their tasks. They yonded favor- 
ably to assignments within Thompson 
(1967) calls the “technologies! core” of the 
organization and sought buffering from both 
external and internal sources of uncertainty. 


Existence of Multiple Climat 


Perhaps one of the most in 
admittedly serendipitous—f 
two rather distinct organ 
existed within the small organization where 
the study was conducted. One climate was 
primarily the realm of those employees who 
had joined the organization in its early, forma 
tive period. This group, despite being dis 
tributed among all hierarchical levels p l 
the firm, had maintained an organic type A 
climate which operated primarily karg 
reliance on informal systems. The newer E 
ployees who had joined the organization n 
a time when its size had forced top as 
ment to install and rely on more formal A 
ganizational mechanisms for control, © 
countered a more rigid, formal system: 


PERSO Y 


two rather distinct cli- 
nization made it possible 
elationship between the 
is and I/O perception 
\izational or situational 
purpose, the 19 subjects 
the company for over 
ssified as “first genera- 
he 20 subjects who had 
veriods ranging from six 
wo years were grouped as 
A check revealed that 
TAT scores were quite 
nge, and variance for the 
ups. Thus the personality 
dent variables) were con- 
‘lated to the generational 
‘hile generational differences in 
perception may `e attributable to shifts in 
Slection criteri:, both personality tests and 
information suggest that 
efforts at maintaining con- 
el selection had been rea- 


The discover 
mates within the 
w% investigate 
personality din 
‘oder varying 
monditions. For 
who had been 
three years we! 
tion employees 
teen employed 
months to alm 
"scond “genera 
the distribution 
Smilar in mear 
two generatione 


tes to I/O Perception 


independent variable (per- 
ationship) revealed an in- 
On the one hand, the dif- 
verception between active 


bo 


Smeration subjects displayed more positive 
"ceptions than did their more passive gen- 
“ational peers, and they did so to similar 
tees. Inference—the relationship between 
e personality dimensions and I/O percep- 
1 abpears to hold over at least some range 
Variation in situational variables. 
n the other hand, first generation active 
(or high task subjects expressed con- 
a More positive perceptions than sec- 
ien active/high task employees 
en active employees were more 
imila] on 21 of 25 comment categories). 
Alerts first-generation passive/low task 
10 were more positively disposed than 
Second-generation passive/low task em- 
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ployees on 14 of 25 categories. Second in- 
ference—organizational climate variables also 
relate to the dependent variable (perception 
of the I/O relationship). 

These findings lead one toward the tenta- 
tive formulation that, at least over some 
range of variation in organizational mecha- 
nisms (stimuli), the personality dimensions 
studied bear a steady relationship to percep- 
tion of the 1/0 relationship. However, 1/0 
perception does vary with variations in or- 
ganization climate. I/O perception is indeed 
found to vary as a joint function of both the 
personality variables studied and organiza- 
tional climate variables. 

A number of questions still remain in the 
intriguing and interesting category: 


1. Would varying the organizational setting 
over a wider range of variation narrow or 
even reverse the results? Are there organiza- 
tions in which passive subjects would express 
more positive perceptions of the I/O rela- 
tionship than would their more active peers? 
Intuitively it would seem that the answers 
would be “yes,” at least in the extremes of 
organizational inflexibility. However, the range 
of variation in organization climate in this 
study did not produce such a result. While 
increased rigidity related to less positive atti- 
tudes for active subjects, they remained more 
positive than for passive peers under the same 
conditions. 

2. To what degree do differences in per- 
sonality dimensions among subjects lead them 
to create the situations which they perceive 
(eig., Does the passive individual behave in 
his/her interactions with others in the en- 
vironment in such a way that he/she creates 
an environment, the reactions of others, that 
is objectively different in important ways 
than that faced by more active peers?) or are 
the differences reported in this study entirely 
a matter of perception? One of the current 
criticisms by Argyris (1972) of organiza- 
tional sociology is that man is treated as “a 
mechanistic closed-system. Man is not con- 
ceptualized as proactive but as passive with 
little influence on the organization [p. 72].” 
As the investigator listened to active sub- 
jects describe their 1/0 relationship, his con- 
sistent impression was that they were de- 
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scribing processes in which they altered and 
shaped their objective environment in a pro- 
active manner. The converse also appeared to 
be true. Though the processes are far more 
subtle, it appeared that passive employees, 
through their behavior, did create a more 
controlling, unyielding environment. A more 
precisely calibrated study is necessary to 
determine whether these effects of the indi- 
vidual personality on its environment do exist 
and if so what forms they take. 

3. If there are types of organizational cli- 
mates which are more suited to particular 
types of individuals, how can social science 
methods aid in better matching of individuals 
to organizations? 

4. Are there positive means to affect or 
alter the activity-passivity, task, interper- 
sonal, or other dimensions of individuals to 
aid in their adjustment to purposive organi- 
zations? Or alternatively, what dimensions of 
the organization can the manager adjust in 
order to accommodate the differences in per- 
sonality dimensions among employees? 
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Each of these questio: 
great deal of thoughtíi 
study before convincing < 
However, both our soci: 
meaning and satisfaction 
angst experienced by th 
an organization deman 
havior call for answers. 
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licy-capturing method of job evaluation, which involved the multiple 
ssion weighting of numerical job analysis data obtained using the Posi 
\nalysis Questionnaire (PAQ), was used to predict compensation rates 
o studies. In Study 1, compensation rates were derived and compared 
municipal government jobs using the policy capturing and four conven- 
methods., The rates derived using the five methods intercorrelated 
den .82 and .95. In Study 2, PAQ data and job evaluation points previ- 
y determined using a point method were obtained for 131 jobs in a utility 


any. 


A stepwise regression analysis, used to select and weight PAQ 


scores to predict the job evaluation points, yielded a multiple correla- 
of .90, Because of comparable results and lower cost, the policy-capturing 
hod was adopted in both organizations. 
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as criteria. Several investi- 


Bators (eg. Champagne & McCormick, 
1964; Finn & Lee, 1972; Madden, 1963; 
McCormick, {+anneret, & Mecham, 1972) 


have applied this approach to a range of job 
valuation probiems in which job, incumbent, 
ot judge characteristics expressed in quanti- 
lative form as predictors are analyzed to sta- 
tistically capture the policy which led to rates 
of compensation established by some job 
‘Valuation process. The predictor weights, in 
“uation form, resulting from a regression 


_imalysis presumably reflect this policy and 
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could be used thereafter to predict compensa- 
tion rates if predictor data could be obtained. 

Following the development of the Position 
Analysis Questionnaire (PAQ)* (McCor- 
mick, Jeanneret, & Mecham, 1969), a struc- 
tured job analysis instrument, data about 
the characteristics of a wide variety of jobs 
were obtained. Multiple correlations between 
various combinations of job characteristics 
and wage and salary rates ranged between .82 
and .90 for a sample of 340 jobs (McCor- 
mick et al., 1972; Mecham & McCormick, 
1969). Later, in a cross-validation study, L. 
R. Taylor (personal communication, 1970) 
obtained a correlation of .93 between existing 
salary rates and predicted rates for 79 insur- 
ance company jobs analyzed with the PAQ; 
and Harris and McCormick (1973) success- 
fully applied a policy capturing approach in 
the analysis of rates of compensation for U.S. 
Navy jobs. 

The present report extends the use of the 
policy-capturing approach to an operational 
setting involying a medium-sized municipal 
government (Study 1) and a utility company 
(Study 2), and compares the results ob- 
tained with those of several traditional meth- 
ods of job evaluation, 

The Position Analysis Questionnaire is available 
from the University Bookstore, West Lafayette, 
Indiana 47906. 
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Stupy 1 
Method 


Five methods of job evaluation were employed in 
deriving compensation rates for 19 benchmark jobs 
from the city of Boise, Idaho (population 74,990). 
The benchmark jobs were selected on the basis of 
representation throughout the compensation range, 
clarity of function, and likely occurrence in other 
municipal governments. 

The methods of evaluation included the following: 
(a) a ranking method in which the jobs were 
ordered in terms of overall worth, (b) a factor com- 
parison method in which rates of a sample of 
equitably compensated jobs were divided among 
compensable factors (e.g., knowledge required, skill, 
hazards, etc.) which resulted in a reference scale 
against which other jobs were compared and their 
rates established, (c) a point method in which the 
job characteristics were matched with various de- 
grees of compensable factors having designated 
point values from which total points and compensa- 
tion rates were determined, (d) a policy capturing 
method which predicted rates by use of a regression 
equation to weight PAQ data, and (e) a wage and 
salary survey of similar municipalities to determine 
the median compensation rate for each job. The 
survey data were viewed as a criterion against which 
the results of the other methods could be compared 
especially if the methods yielded dissimilar results. 

As a first step in the evaluation procedures, job 
descriptions were prepared or revised and coded 
using the Dictionary of Occupational Titles (United 
States Department of Labor, 1965, 1967). 

Secondly, mayors of all cities in the 48 contiguous 
states having populations within + 10,000 of that of 
Boise and which were located 50 miles or more 

from a large metropolitan area were asked to par- 
_ ticipate in the wage and salary survey. The survey 

asked for current compensation rates for 17 jobs 

and area wage survey data (United States Depart- 

ment of Labor, 1970) were available on two addi- 
c tional benchmark jobs (account clerk and secre- 
_ tary) which were adjusted to the current compensa- 
tion level by adding an 8% cost of living factor, Of 
the 31 cities contacted, 21 returned the requested 
data. 

As a third step, job analyses of almost all of the 
city’s 168 full-time jobs were performed using the 
PA by one or two incumbents, a supervisor, and 
one or two upper-division college students who 
_ worked with both incumbents and supervisors in the 
field. These analyses involved the rating of the de- 
_ gree to which each of the 194 job elements in the 
PAQ was part of the job. Interpretational assistance 
‘of job element meaning was provided at times but 
not to the extent that independence of observation 
Was violated. The PAQ data for each job were 
then averaged and job dimension (factor) scores 
were calculated from the job element scores using 
computer program located at Utah State Uni- 
_ Yersity. Wage and salary estimates were then made 
by weighting the job dimension scores using a 
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general nonlinear regressior on previously 
derived from a sample of Írom approxi- 
mately 50 different orga (McCormick, 
Mecham, & Jeanneret, 1973, 

As the final step three either five or 
six senior college students in el were asked 
to evaluate the 19 benchmark th each group 
using either the ranking, fact rison, or point 
method according to the pro outlined in a 
wage salary text (Dunn & Rac 1). The group 
involved with the factor con method was 
provided with wage and salary data on five 
jobs of their choice from amon nchmark jobs. 


Results 


The results from applica‘ 
job evaluation methods ar- 
1. 


the various 
ted in Table 


thods were 
from .82 to 
89. Thus it 
ids, includ- 
ielded simi- 


When the results from 
intercorrelated, coefficients 
95 with a median coefficien 
was generally found that al 
ing the policy capturing met 
lar results. 


STUDY 2 
Method 


In 1973 the job evaluation com 
utility in the Intermountain Wes 


e at a major 
because of 


union agreement, the systematic rt 131 classi- 
fied jobs. Many of the jobs wer: ical, craft, oF 
operative and were located in the four geo- 


though some 


graphical divisions of the com 
sions. In the 


were found in only one or two 


judgment of the committee the point method bie 
had been in use yielded generally actory results 
time, and in 


but required considerable comm 
some cases sufficiently objective j 
difficult to obtain. The committee 
costly method which would perpet 
phy of the committee but which wo 
of possible inequity. 

It was decided to conduct job an: 
PAQ for the jobs with a minimum o ; 
and job analyst (an upper-division college student 
or company personnel officer) independently oF 
pleting each analysis. Additionally, some 1 
were conducted by job incumbents. In cases wher 
the same job was found in more than one 4 
at least one analysis of a randomly selected a 
for each job in each division was undertaken. S 
jobs on which there were substantial numbers 
employees, analyses were secured from up to 
persons. ies 

Once the analyses were completed, the mean Y 10 
on the PAQ ratings were determined for each 210 
job dimension (factor) scores were computed, 4 
the mean job dimension scores were compared 175 
the dimension scores from individual analyses 0t 5 
job following instructions found elsewhere 


ormation was 
wanted a less 
the philoso- 
disclose areas 


es with the 
a supervisor 


L 


— “we al i 
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Cormick et a! ). This resulted in difference Following this reliability and screening process an 
«ores (sums © re error) across all dimensions attempt was made to “capture” the company job 
between each dual analysis and the mean evaluation policy. Job dimension scores from the 
mah sis. Also and individual scores were mean analyses for the jobs were used as predictors 
plotted as th ‘ed to each other. Whenever (in a buildup stepwise regression procedure) of the 
difference scor: ‘ded seven, the individual di- job evaluation points that had been previously 
mension scores to have had a strong relation- established by the job evaluation committee using 
ship with con tion rates in previous studies a point method. Additionally the predictions were 
(dimensions: B making decisions; D-17, com- made using the general nonlinear equation (used in 
municating ju s, decisions, and information; Study 1) and were correlated with the company 
and, 0-28, ha ‘ecision making, communication, job evaluation points. 

and social res ility) were visually inspected. 

Analyses by ir nts in general proved to be less Results 


satisfactory t! ose done by the personnel de- š z A Š : 
vort ly dor blueco oA N Table 2 contains the job dimension scores 


scores on the imensions were judged diverse, statistically selected as predictors with their 
additional PAC a were collected to determine respective weights in the order in which they 


whether the po cns within the job were actually entered the equation. As indicated in the 
different or v er differences arose because of 28 


analyst error. ional data were collected on 17 table, panes involving decision making, 
jobs by the nnel department and indicated communication, and responsibility were posi- 
corrections we: de. tively correlated with compensation rates 1 
* 
TABLE 1 
RESULTS FROM Five Jorn EVALUATION METHODS 
Job evaluation methods 
Wage and f 
Job title salary sur- Policy cap- : Factor com- Ranking 
vey (median turing Point method parison TEA 
yearly rate method (points) method (rank) 
for 21 cities) {points)* (yearly rate) 
($) 
Parking ticket clo-\ 5,296 6,881 193 6,872 5 
Meter maid 5,328 7422 180 5,3284 6 
Account clerk 1 5,616° 6,663 N/A 7,328 7 
Bibrary asst., technical processes! 5,644 6,334 213 6,477 1 
Custodian 1 5,828 4,783 120 5,958 2 
Park worker 6,004 5'306 100 6,094 3 
Caretaker, park dept, 6,418 5,786 113 5,834 4 
l keeper, head 6,672 10,667 348 11,418 13 
dann 6,683" 5870 N/A 6,809 8 
laboratory tech. /clerk, sewer dept. 8,161. 7,147 225 8,241 9 
atrolman ; 8210 9,582 330 8,2103 10 
Park foreman 8,805 9,734 285 10,131 11 
Plumbing inspector 9,374 9713 330 13,078 12 
Safety director 0617 11,584 330 13,203 14 
“enior accountant 10,004 11,580 433 14,843 1y 
a cleric 10,680 10,604 320 10,6804 18 
erintendent, sew: r 
plant sewage treatment foes 12822 333 11,958 16 
5 of police 15.580 N/A 403 15,5801 15 
Ector of financial & administra- d 
We management 16,224 15,719 493 17,148 19 


Ni 
ene not available is indicated with N/A. All data were collected in 1971. 
ý t Converted 15015 monit to yearly base. 
P m hoi te 80. ay 5 
leck wage survey Oaa Department of Labor, 1970) with an upward cost of living adjustment of 8%. 


T 
4, 
eae 
$ Polied from wage and salary survey data. 
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TABLE 2 
Jon DIMENSION SCORES AND WEIGHTS FOUND By STEPWISE REGRESSIO) 
TO Prepicr Company Jos EVALUATION POINTS 
Regressio: s 

2 Sin 

Dimension scores selected as predictors Rateachstep | predi 
of the comy B weight 

equation | eva 

| 50 
B-8 Making decisions 82 41.34 
F-27 Working on a variable vs. regular schedule 84 | = —7.70 
C-14 Using hands and arms to move/position things 86 — 9.40 
0-32 Processing information (an “overall” dimension) 87 —17.90 
B-9 Processing information (a division“ dimension) 89 10.04 
A-5 Evaluating information from things 89 | 6.30 
C-11 Using hands and arms to control/modify 90 —3.29 
C-13 Performing activities requiring general body movement 90 —4.69 
0-28 Having decision making/communication/social responsi- 
bilities 90 8.30 
Nole. The mean of the company job evaluation points for the 131 jobs studied was 227 with a standa ition of 43 which 


corresponded to monthly compensation values of $812 and $141, respectively. 
a Weights given are for equation at nine steps, constant at that step was 227. 


while those involving the use of the hands 


especially if the PAQ data w 


also used for 


tended to be negatively correlated. The cor- establishing selection proc: , Classifying 

relation of predictions using the general non- jobs, etc., and the relatively <>jective nature 

linear equation with company job evaluation of the data and the evalua nethodology. 
points was .81. 
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COMPARISONS OF SELF- AND SUPERIOR 


OF MANAGERIAL PERFORMANCE 


HERBERT G. HENEMAN IIT * 


University of Wisconsin-Madison 


Self- and superior ratings of performance on nine dimensions we: d 


from an interorganizational sample of managers by a mail questi 
was found that self-ratings possessed less leniency, restriction of d 


halo error than did superior ratings. The ratings only partially m: 
quirements of convergent and discriminant validity. Possible expla 
the results and implications for future research on managerial p 


were suggested. 


Meaningful generalizations about the nature 
and prediction of managerial performance 
(Campbell, Dunnette, Lawler, & Weick, 
1970; Korman, 1968) are difficult to make. 
This is due, in part, to the organization and/ 
or criterion-specific nature of most studies. 

5 One research strategy to overcome this defi- 
= ciency would be to identify performance di- 
"mensions common to most managerial jobs, 
approach managers themselves in different 
organizations, and obtain self-ratings on the 
_ identified performance dimensions. The self- 
_ ratings could serve as a substitute for the 
more difficult to obtain superior ratings. Such 
Procedures would facilitate the design of 
broad concurrent and predictive validation 
investigations. 

4 Questions may be raised regarding the po- 
tential usefulness of self-ratings, however, es- 
pecially relative to traditionally obtained 
superior ratings. For example, a number of 
studies (Parker, Taylor, Barrett, & Martens, 
1959; Prien & Liske, 1962; Thornton, 1968) 
have found that self-ratings tend to have 
higher mean values (leniency error) and less 
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' 
variability (restriction error) than 
do superior ratings. Such t be the case, 


tained under 
be used only 


however, if self-ratings \ 
conditions in which the; 


for research purposes (Mi 1968). 
Another important is rtains to the 

relative incidence of ha! among self- 

and superior ratings. WI might be as- 


nong perform. 
sistently higher 
atings, the evi- 
Lawler, 1967; 

Liske, 1962; 


sumed that intercorrelati: 
ance dimensions would be 
for self-ratings than super 
dence suggests the opposi 
Parker et al., 1959; Pri 
Williams & Seiler, 1973). 
Finally, the convergent 
validity (Campbell & Fis! 
ratings has been questions 
vergent validity, Miner 
that “while various levels of 
to agree, superior and self 
p. 84].” The evidence 
Nealy & Owen, 1970; Pz 
Prien & Liske, 1962; Thor: k! 
liams & Seiler, 1973), however, is more mis 
than this criticism implies. h 
Little evidence has been obtained on 15 
convergent and discriminant validity. Wa 
liams and Seiler (1973) found favorable con 
vergent and discriminant validity for self- an 
superior ratings. Lawler (1967) and 192 
and Owen (1970) found little convergen 90 
thus discriminant, validity for self- an | 
perior ratings. A 
In short, the evidence suggests that A 
ratings of managerial performance hold 85 9 
ise as a means for expanding the scop 5 
research on managerial performance. 


purpose of the present study was t? 


| discriminant 
1959) of seli- 
Regarding con- 
168) concluded 


pegit 
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F- AND SUPERIOR RATINGS OF MANAGERIAL PERFORMANCE 


work in ti ection by (a) identifying 
broad, yet ngful, dimensions of mana- 
gerial perf: e, (b) obtaining self- and 
superior pe ince ratings for managers 
employed ferent organizations under 
conditions ch it was apparent that the 
ratings cot used only for research pur- 
poses by ti estigator, and (c) assessing 
self- and sı ratings in terms of leniency, 
restriction nge, halo, and convergent- 
discriminan lity. 
METHOD 
Performance asure 
A basic p isite of managerial performance 
measurement 2e specification of significant di- 
mensions of erial activity or behavior to be 
evaluated. Ir resent study it was necessary to 
select dimens vhich were specific enough to be 
behaviorally + ingful, yet general enough to be 
applicable to agerial jobs across organizations. 
Given this ce nt, it was decided that the eight 
functional di: ‘ons of managerial performance 
identified by voney, Jerdee, and Carroll (1963, 
1964) would iseful (see below). 
This decisi vas based upon several considera- 
tions. First, dimensions theoretically represent 
activities enga in, in varying degrees, by mana- 
grs regardless specific job or position in the 
P ganization. d, research by Mahoney et al. 
(1963, 1964) is supportive of the reliability and 
validity of this xonomy. For example, a group of 
Industrial supervisors had no difficulty classifying 
100 specific performance incidents into the eight 


dimensions, and there was a median agreement of 
10% between t classifications and classifications 
by the research staff. Mahoney et al. also found 
igh agreement between self-reported estimates of 
time spent in h activity and time estimates 
derived from sampling procedures for two 
Simples of managers. Finally, it was felt that the 
Mensions, and their descriptions, would possess 
adequate face validity for the managers involved in 
© present study. 
ter specification of the dimensions, a perform- 
âne evaluation form was developed. An abbrevi- 
ated behavioral description of each dimension was 
ou from Mahoney et al. and placed on the form, 
Ba with a 7-point rating scale (1 low perform- 
‘nd to 7=high performance). A ninth dimension 
i. tating scale, pertaining to overall effectiveness, 
a t included. Performance was thus measured 
Reet of nine dimensions: (A) Planning, (B) 
(E) gating, (C) Coordinating, (D) Evaluating, 
(H) 11 nne (F) Staffing, (G) Negotiating, 


Presenting, and (I) Overall Effectiveness. 


Data Collection and Sample 
The 


rate Performance evaluation form was incorpo- 
Ito two questionnaires that were sent to a 
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sample of managers as part of a larger study. In the 
first questionnaire (self-rating) each manager was 
asked to rate his own performance on each dimen- 
sion in his present job. The second questionnaire 
(superior rating) contained identical performance 
scales. Each manager was asked to give the ques- 
tionnaire to his immediate supervisor and request 


that the supervisor evaluate the manager's perlorm- 


ance. Attached to both questionnaires was a letter 
indicating that responses would be used only for 
research purposes by the investigator and that indi- 
vidual responses would remain confidential. A self- 
addressed, stamped envelope was included with each 
questionnaire so that the manager and the superior 
could independently return their questionnaires 
directly to the author. 

The two questionnaires were sent to all master 
of business administration (MBA) graduates of Indi- 
ana University during the years 1964-1967 for whom 
current addresses were known (u = 218). A follow- 
up postcard was sent to those managers who had not 
returned the questionnaire within two weeks. A 
telephone call was then made to those managers who 
had not responded to either the initial or follow-up 
mail request. 

Usable questionnaires were received from 163 
MBAs (response rate of 74.7%) and from 121 
immediate superiors (response rate of 55.5%). 
Among these, there were 102 usable MBA-superior 
pairs of performance ratings. This later group com- 
prised the final sample in the present study. Due to 
missing data, however, final intercorrelation sample 
sizes ranged from 45 to 102. 

The managers’ average salary and length of 
service in present job were $18,386 and 3.7 years, 
respectively. The managers had held an average of 
1.6 jobs in different organizations since graduation. 


RESULTS 


To assess relative leniency and restriction 
of range errors, means and standard devia- 
tions of self- and superior ratings were com- 
puted and compared (see Table 1). While six 
mean self-ratings were lower than the cor- 
responding superior ratings, suggesting greater 
leniency error for superior ratings, a sign 
test ® suggested this was probably due to 


chance (p < .25). However, three of the self- — 


rating means were significantly lower than 


the corresponding superior rating means, using 
a mean differences test for correlated samples 


(Ferguson, 1959, pp. 138-140). 


8The sign test (Siegel, 1956; pp. 68-75) focuses 


on the direction of the differences between pairs of 
observations. Under the null hypothesis, half of the 
differences are expected to be positive and half to 
be negative, All subsequent references to the sign 
test are from the above source. 


Six of the nine standard deviations were 
larger for self-ratings than for superior rat- 
ines (p < .25, sign test). The self-rating vari- 
ances were significantly greater than the cor- 
responding superior rating variances in three 
of four instances, however, using a variance 
differences test for correlated samples (Fer- 
guson, 1959, pp. 142-142). 
Io assess the relative incidence of halo 
error, it is necessary to compare intercorrela- 
tions among performance dimensions for self- 
and superior ratings (monomethod-hetero- 
trait triangles in Table 2). There are 36 
intercorrelations in each triangle and thus 36 
possible comparisons of self-ratings versus 
_ superior ratings. The intercorrelation for self- 
ratings was less than the corresponding inter- 
correlation for superior ratings in 31 of the 36 
comparisons (p < .001, sign test) indicating 
relatively greater halo error for superior rat- 
ings. Further evidence of this is provided by 
comparing the correlations of the eight func- 
tional performance dimensions with the over- 
all effectiveness dimension for self- and su- 
perior ratings. In all eight instances, the 
correlation for superior ratings was greater 
_ than the corresponding correlation for self- 
ratings (p < .008, sign test). Evidently the 
superiors were evaluating the managers on a 
more “global” basis than the managers were 
evaluating themselves. 
Procedures suggested by Campbell and 
Fiske (1959) were used to assess the conver- 
gent and discriminant validity of the ratings. 
4 The basic requirement of convergent validity 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF SELF- 
AND SUPERIOR RATINGS 


Toni Self-ratings | Superior ratings 
dimension 
Mean | SD | Mean | SD 
Planning (A) 5.38 1.10 5.60 -96 
‘a Investigating (B) 5.49 1.16 | 5.95** | _g6** 
_ Coordinating (C) 5.56 1.10 | 5.56 1.12 
Evaluating (D) 5.62 99 | 5.60 -93 
E Supervising (E) 5.34 1.19 | 5.30 1.15 
_ Staffing (F) 4.74 1.26 | 5.58** | .o1* 
Negotiating (G) 485 | 1.43 5.49% 1.035 
Representing (H) 5.30 | 117 5.43 | 1.28 
Overall effectiveness 
E (5) 5.53 79 | 5.69 -98* 


$ Difference signcificant at p <.05, 
** Difference signifiant at 7 A 01, 


5 


Hersert G. HENEMAN III 1 


is that correlations bets elf- and superior 
ratings on each perfo > dimension be 
Statistically significant correlations are 
circled in the validity zal in Table 2. 


The validity coefficient 
Dimensions A, E, G, H 
fulfilling the convergen 
The first requirement 
ity is that the converger 
be larger than their ci 


significant for 
|, thus partially 
ty requirement, 
riminant valid. 
lity coefficients 
nding row and 


column correlations in heteromethod- 
heterotrait triangles. T} ere 144 such 
comparisons possible in and the con- 
vergent validity coeffici: ere higher in 
101 of these comparisons dition, a sign < 
test for each performan nension indi- 
cated that the number o the validity 
coefficient exceeded the iate row-col- 
umn correlations was st: y significant 
(p < .05) for Dimension E, G, H, and 
I. The first requirement s fairly well | 
met in general, and in pa r for the di- 


mensions that met the gent validity 
requirement. 
The second requiremen 


validity is that the conve: 


discriminant 
validity 00- 


efficients be larger than th: responding 
row and column correlatior the mono- 
method-heterotrait triangles. {1 essence, this 


involves comparisons of convergent validity 
to halo error. For self-ratings the validity co. 
efficients were larger in 25 of the 72 possible 
comparisons; a sign test indicated that only 
for Dimension H was the comparison signifi- 
cant (5 < .05). For superior ratings the Va 
lidity coefficients were larger in only 9 of the 
72 possible comparisons; a sign test indicated 
that the comparisons were not significant for 
any performance dimension. The data thus 
did not adequately meet the second require- 
ment, especially in the case of superior rat 
ings. 

The third requirement of discriminant Va- 
lidity is that the same pattern of intercorre- 
lations should exist in the heterotrait triangles 
of both the monomethod and heterometh ; 
blocks. This is a very stringent wan 
(Lawler, 1967), and it was not met in 
Present study. 


DISCUSSION 


Contrary to previous evidence, there 19 
a tendency for self-ratings to be less leni 
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em 


and more variable than superior ratings in 
the present study. For three of the perform- 
ance dimensions (B, F, G), these differences 
were statistically significant. These results 
may have been due to the fact that the rat- 
ings were obtained under conditions in which 
the ratings could be used only for research, as 
opposed to evaluation and decision-making 
purposes. This explanation should be consid- 
ered tentative, however, pending research 
which explicitly investigates the impact of 
different “use of ratings” conditions on self- 
and superior ratings, Sharon and Bartlett 
(1969) found that such conditions had a 
differential impact on the leniency of subordi- 
nate ratings, and it is reasonable to assume 
that such also would be the case for self- and 
superior ratings. 

Consistent with previous evidence, self- 
ratings contained less halo error than did 
superior ratings, which is another factor in 
favor of self-ratings. The evidence on halo 
error suggests a possible explanation for 
Miner’s (1968) conclusion that superior- 

_ Superior ratings have higher convergent valid- 
_ ities than do self-superior ratings. If supe- 
riors’ global assessments of subordinates agree 
with one another, and these global assess- 
ments dominate their evaluations of subordi- 
nates on specific performance dimensions, it 
is likely that the superiors will tend to agree 
with each other on each performance dimen- 
sion. On the other hand, since subordinates’ 
= self-ratings are more discriminating across 
dimensions, they are less likely to exhibit 
high agreement with their superiors’ ratings. 

The evidence was only partially supportive 
of the convergent and discriminant validity 
of the ratings. In part, large halo error in the 
superior ratings may have inhibited high con- 
vergent validity and, in turn, the ability of the 
superior ratings to meet the second require- 
ment of discriminant validity. 

It is also possible that the performance di- 
= mensions and descriptions were not very 
meaningful to the raters. Unfortunately, there 
is no way of directly assessing this possibil- 
ity. There are many other possible specifica- 
tions of broad managerial performance dimen- 
sions (see Campbell et al., 1970), however, 
Wich may warrant serious consideration in 
kuture research. 
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results in the 
> possibility of 
serial perform. 
ld be seriously 
on managerial 
esearch is con- 
m may apply 
clear that the 
arch purposes 


In summary, the pa 
present study suggests 
obtaining self-ratings of 
ance across organization 
considered in future re 
performance. Until addit 
ducted, however, this s 
only in situations where 
ratings will be used fo 
only. 
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[S OF HEAD RESTRICTION ON DRIVERS’ EYE 


)VEMENTS AND ERRORS IN SIMULATED 


correlated with driving errors. 


In their s 
havior, Smith 
the automobi 
fon of the di 
the systems c 
Wehicle be c 
Seems to be 
uate driver 


ms analysis of driver be- 
plan, and Kao (1970) view 

s a rigid, wheeled exoskele- 
Such a view implies that 
cteristics of both driver and 
uent, and, in general, this 
(Schroeder, 1971). Ade- 
‘formance greatly depends 
»tual-motor coordination and 
the processing of informa- 
-ving environment. Synchrony 
between eye, he and foot are critical varia- 
bles for proper operator performance. Smith 
and Putz (1970), for example, have shown 
that coordinat eye and head movements 
irony for accurate tracking 
avior, Feedback delays as 


system were found to 
atply decrease subsequent steering adjust- 


Ments, 


Research by Bartz (1966) on humans and 


lore recently Bizzi, Kalil, and Tagliasco 
— 


= 
oe research was conducted as part of a larger 
ct entitled, Driver License Road Testing (DI- 


1 (002)) contracted with the North Carolina 
n Dartment 


of Motor Vehicles and sponsored by 
dle North 
ce 


Carolina Governor's Highway Safety 


ice, 
= Requests for reprints should be sent to John 
ay 875 Jr., University of North Carolina High- 

oe Research Center, Chapel Hill, North 
lina 27514 
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oups of 15 subjects tracked a segment of the Aetna Life Insurance 
ny training film, Trafic Strategy, by manipulating the controls of an 
Drivo-trainer station, One group had their heads restricted during the 
the other did not. Restriction of the head resulted in more driving 
ind fewer (a) long eye movements, (b) driving adjustments to critical 
to the left of the driver’s field of view, and (c) fixation errors, espe- 
overshoots. Eye movement frequency was positively correlated with 
adjustments and negatively related to driving errors. It was concluded 
a) driving performance is better when the eye-head compensatory sys- 
able to move freely, (b) head restriction may bias the distribution of 
yn locations, and (c) the frequency of shifting fixations may be nega- 


(1971) on monkeys has begun to elucidate 
the complex relationships between eye and 
head movements during visual scanning. When 
a stimulus appears in the periphery, the 
normal sequence of events is a centrating 
saccade, a following movement of the head, 
and then a compensatory eye movement 
which adjusts for traversal of the head so 
that the stimulus is fixated foveally. Bartz 
(1966) demonstrated that the initial saccade 
begins before the head movement and that 
the compensatory saccades are completed be- 
fore completion of the head movement. 
Furthermore, the angle of rotation of the 
head and that of the compensatory move- 
ment always sum to yield the appropriate 
foveal fixation. This reciprocal relationship 
holds for a wide range of angular displace- 
ments of a stimulus in the visual field. 

Bizzi et al. (1971) have replicated Bartz’s 
results by measuring EQGs (electroculo- 
grams) from the eye and EMGs (electromy- 
ograms) from neck muscles. They observed 
that during eye-head turning, neural activity 
reaches the neck muscles before the eye 
muscles, and that all agonist neck muscles 
are activated simultaneously regardless of ini- 
tial head position. Since overt movement of 
the eyes precedes head movement, the com- 
mand for the eye-head sequence must be a 
central neural one, even though it does not 


specify the serial order. 
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In another experiment (Bizzi et al., 1971), 
‘a head restraint with a ſast-action brake was 
used, which allowed the head to be stopped 
alter a saccade was begun but before it was 
s completed. Compensatory eye movements 
were not necessary in this case and in fact 
did not occur, It was concluded that com- 
pensatory eye movements are not initiated by 
a central neural command, but are reflex re- 
_ sponses resulting from movement of the head, 
perhaps vestibular in origin. 

In driving, a substantial proportion of eye 
movements are compensatory in nature, yet 
most eye movement recording systems have 
= restricted the head to some degree (e.g., 
Mourant & Rockwell, 1970; Whalen, 
_ Rockwell, & Mourant, 1968; Zell, 1969). 
It is possible that in such situations the 
oculomotor system may display a different 
search pattern than in those situations where 
_ the head moves freely and where the usual 
_eye-head sequence can occur without inter- 
ference. In situations where only the analysis 
_ of the content of fixated material is of in- 
terest, head restriction may not be a critical 
Variable. However, head restriction may have 
important effects when criterion performance 
involves the relationship of eye movements 
to vehicular adjustments. The purpose of this 
study was to examine the effects of head 
restriction on eye movements and the result- 
ing driving performance. 
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Ficure 1. Schematic drawing of eye movement laboratory. 
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a vehicular adjust: 


RisreicrioN Errects on D Eve Movements axp Kuno 


awot was defined ¿s one in which failure to 
went would have resulted in 
accident in the real driving situation. 


the expected vehicular 


The criterion for an acceptable adjust- 

was simply that appropriate to the dan- 
situation. For «x ric, if the situation called 
the driver to com a complete stop, decel- 
a low speed without stopping would be 

red inappropria the situation called for 
pulling to tb cht and the subject re- 
by lazily tur the car to the right, the 
was consider propriate, although in a 
situation it may have prevented an acci- 
. Errors were deñ ıs failures to make steer- 
accelerating, dece! ing, or braking adjust- 
within the allot: me. Recording lasted for 
nds for each da us event since all events 
15 seconds or | Thirty-nine expected ve- 
adjustments v distributed among nine 

ous events. Driv crors and total vehicular 
mts per sessior luded those of steering, 
tion, decelerati« ind braking movements. 
add realism, sounds from suburban 
were tape- res und played through the 

s speaker. S volume was controlled 


Movement res; 
Mequency of long 

Short (les than < 
shifts in fixation 
n errors (both over 
ch event. 


When a subject ent: 

Ml at the Drivo-T: 
ed on the screen 
s followed: 


the recorder. 


measures included the 
visual angle or more) 
«ree visual angle) ampli- 
the frequency of fixa- 
its and undershoots) for 


d the testing room he was 


ver such that his eyes were 
\ set of tape-recorded in- 


We are going to show you a short film on this 

i Your job is to track the film as best 
can by adjusting the controls of your car. 
St drive as you normally would on the high- 
First I will calibrate the eye movement 
» then I will run the film, then recali- 


fr adjustment and calibration of the eye move- 
recorder, the experimenter left the room. The 
Was darkened during the film. Immediately 
Owing the film, recalibration was performed, and 
Session was ended. 

t of the subjects were run with a head and 
fest, that is, the head restriction group. The 
Were run with no head and chin rest, that 
no head restriction group. The subjects were 
mly assigned to one of the two groups. 


RESULTS 

g Adjustments 

analysis of variance was used to com- 
the total numbers of right and left turns, 
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let ern — 
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Ficurn 2. Mean driving adjustments as a function of 
head restriction and type of driving adjustment. 


accelerations, and brake actions of the head 
restriction and no head restriction groups. 
There was a significant effect of both head 
restriction (F = 11.18, df = 1/28, p < .005) 
and type of vehicular adjustment (F = 17.64, 
df = 3/84, p < .001). Their interaction was 
not significant however, As can be seen in 
Figure 2, the primary significant effects of 
head restriction were on left turns and accel- 
erations plus decelerations. 


Driving Errors 

An analysis of variance of driving errors 
revealed that errors differed significantly as 
a function of the type of vehicular adjust- 
ment (F = 40.64, df = 3/84, p < .001) but 
not as a function of head restriction. Once 
again the interaction between the two varia- 
bles was not significant. 


Eye Movements 


An analysis of variance comparing total i 
numbers of eye movements per subject for 


the head restriction versus no head restriction 
groups yielded no significant difference. 


(Mean eye movement frequency per subject 


was 342 and 353, respectively.) 


=e. = 


J. A. uu, Je 


= 


Heod 
Restriction 


No Heed 
Reitration 


M Fieve 3. Mean percentage of jong excursion eye 
Movements as a function of head restriction. 


Since head restriction appeared to have a 
Strong effect on turning left, long left and 
right excursions of the eye (ie, 5-degree 
visual angle or more) were totaled for each 
F group and subjected to an analysis of vari- 
» ance. The main effects of head versus no head 
| restriction and looking right versus looking 
leit were not significant, but their interaction 
Was (F = 6.5, dj = 1/28, p < .025). In Fig- 
ure 3 it can be seen that with head restric- 
on, subjects made proportionally fewer long 
De movements to the left than to the right. 
This difference was highly significant for the 
head restriction group (F = 10.30, df = 1/ 
d 28, P < 001) but not for the no head re- 
striction group. 

Fixation errors (i.e., undershoots and over- 
hoots) of the two groups were compared by 
s of an analysis of variance. Although 
main effects of head restriction and type 
T Of fixation error were not significant, their 
interaction was (F= 5.54, dj 1/78, p< 
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05). Head restriction ule fewer 
overshoots (F = 5.47 p< Of 
Relationship of Eye M Driving 
Adjustments, and Error 

To examine the relat tween eve 
movements, driving adj nd driving 
errors, several correlat omputed 
from the data. For bot combined 
the rank order correlatior total eye 
movements and total dr onses for 
each subject was +.45 between 
total eye movements ar errors it 
was —.75 (p < .001) 1 fixation 
errors and driving error TE 
001). Since the correlat ave cum 
tained artifacts due to the two 
groups, the same correlati erformed 
on each group separate!) ions be- 
tween total eye movemen driving 
responses were significant groups 
(head restriction, p = .56 no head 
restriction, p= .47, p< the head 
restriction group the c between 
total eye movements and d rors was 
—.05, whereas for the n stricuion 
group it was —.75 (p< vilar re- 
sult was found for driving fixation 
errors (ie,, head restricti 9; no 
head restriction, p = —.71, p 

Discussion 
In the present experiment iction of 


the head was associated with icrease in 
driving errors and a signific ecrease in 
vehicular adjustments (especially accelera- 
tions, decelerations, and left turns). bie 
result was not due to a disproportionately 
small number of critical events occurring t0 
the left of center in the film, since an exami- 
nation of the film revealed that slightly * 
dangerous events developed to the left ol 3 
driver’s line of sight. With respect to pee 
adjustments, an examination of the wee 
indicated that subjects with no head restrain 
made many more adjustments than were nec 
essary to successfully negotiate a given pas 
while subjects with their heads restricte 
usually made only the minimum necessa": 
and sometimes less. 


c movements to the left 
ight. This finding comple- 
t subjects with their heads 
ver left turning adjustments 
without restriction, It is 
result was due to differ- 
cal restraint, since head re- 
was the same as that to the 


conditions of the present 
different, the results for the 
subjects agreed with those 
periment by Whalen, Rock- 
int (1968). Using a head- 
n an instrumented car, they 
estriction subjects tended to 
fixations to the right of the 
le driving on the open road 
traffic. The data for the no 
zroup did not follow this 
sible that when the head is 
ts tend to concentrate on 
related to information on 
f the road (eg. lane posi- 
ng, road sign information, 
ense of scanning a broader 
perhaps is more typical of 
inrestricted head movements. 
gs with fixation errors support 
of Bartz (1966) and Bizzi et al. 
S71). Under conditions of no head restric- 
fixation errors following long excursions 
“e equally distributed between undershoots 
=t Overshoots. On the other hand, restric- 
n of the head greatly decreased such move- 


might arise as to why any fixation 
s5 occurred at all. In this regard, Bizzi 
(1971) had noted that complete elimi- 


meg oth, Kaplan, and Metlay (1964) have 
d that eye movement rate is an index of 
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nation of compensiory eye movemmais ox 
curred only when the location of the atlamdes 


The fact that eye movements were found to — 
relate significantly to both driving adjust. 
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10 
2 
gany 
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found that subjects who shifted their fixations 

more frequently detected many more 

In the present study, the fact 1 
i errors 


LF 
eft 


to the left of the driver's field of view. These 
results have several implications for automo- 
bile driving and general highway safety. First, 
the optimal scanning strategy during driving 
should be one in which the head remains fixed 
such that peripheral targets have to be de- 
tected by long eye movements alone. Second, 
eye movements data gathered from recording 
systems which restrict normal head move- 
ments should be interpreted with caution. The 
present evidence suggests that head restric- 
tion may bias the distribution of fixation lo- 
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berate consideration-satisfaction relationships. 
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two research and development departments, 
found influence to be a moderator of the leader 
behavior-employee satisfaction relationship in 
only one of them. The lack of effect in the second 
group was hypothesized to be a function of 
Waters (1965) concept of onganizationally mar- 
ginal status, arguing that people were of high 
enough status so as not to be dependent on their 
superiors for need satisfaction, 

The hypothesis suggested by some of the con- 
tradictory influence findings is that subordinate — 


haviors of influential leaders on 
faction, In other words, the facilitation of hier- 
archical influence on the consideration-employee 
satisfaction relationship (Pelz, 1952) is a fune- 
tion of subordinate independence (House et al., 
1971; Wager, 1965). Based on suggestions of 
past research, we predict a stronger relationship 
between considerate leader behavior and subordi- 
nate satisfaction for leaders who are seen as 
being influential and subordinates who are de- 
pendent, than the relationship between leaders 
who are seen as being influential and subordi- 
nates who are independent, For the low. influence 
leader no predictions are made other than testing 
for the effects of different subordinate character- 
istics. 

“Independence,” as used in the present study, 
refers to an individual's reported “need for inde- 
pendence”—an individual characteristic. Neither 
the Wager (1965) nor the House et al. (1971) 
study suggested subordinate independence spe- 
cifically as a personality trait; in fact, they seem 


A 


io suggest that it is more situation specific than 

the more pervasive variable investigated in this 

study. However, some researchers have argued 

that independence as a personality variable is a 
relevant subordinate characteristic for purposes 

of understanding superior-subordinate relation- 
ships (e.g., Argyris, 1957; Vroom, 1960). It was, 
therefore, thought productive to investigate inde- 
pendence in the present context of studying 
subordinate personality as a moderator of the 
leader behavior-employee satisfaction relation- 
ship. 


METHOD 


Questionnaire data were collected from 91 
nonsupervisory employees of an electronic de- 
vices manufacturing company. Questionnaires 
were administered in the company cafeteria on 
company time. 

Leader consideration scores were obtained using 
10 items from Form XII of the Ohio State 
Leader Behavior Description Questionnaire (Stog- 
dill, 1965). Leader hierarchical influence was 
measured using a scale of items developed by 
Comrey, Pfiffner, and High (1954) and Wigdor 
(1969). The scale measures upward influence of 
the leader only with respect to matters concern- 

ing the personal management of the subordinates 
and participation in policy decisions. The items 
used to measure the need for independence 
(Vroom, 1960) were meant to reflect the indi- 
_ yidual’s predisposition to strive for self-reliance 
and to do things alone without help. Satisfaction 
measures were obtained for three facets of satis- 
faction which were related to areas affected by 


8 
TABLE 1 
_ RELATIONSHIP BETWEEN SUPERVISOR CONSIDERATION 
AND THREE Facets or EMPLOYEE SATISFACTION 
FOR GROUPS WITH HIGH- AND 


| Low-INFLUENCE LEADERS 

3 Employee satisfaction categories 

Supervisor influ- 

2 ence level Advance- A 
Mett Pay Security 

Low hierarchical 

A influence 25 (39) | .30* (37) | .30* (39) 

5 High hierarchical 

influence 315 (45) | .30* (45) 41% (45) 


Vote. Numbers in ses i 
> + Nu parentheses represent ns on which correla- 
ier coefficients are based. Respondents for whom data enter- 
= ae 90 5 l were manr, were dropped from the 
"4 analysis accounting for diff 8 
(Fronps and tow Greene erences between sub. 
„ $05, one-tailed test. 
2 $ <.01, one-tailed test. 
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TABLE 


RELATIONSHIPS BETWEEN Sur) 


CONSIDERATION 
AND THREE Facets or Eu SATISFACTION 
FOR DIFFERENT COMBINA r LEADER 
INFLUE. TE 
| Employee n categories 
Levels of | 
independence |—— 
5 | Advance- 
influence | Mien Security 
Low influence | .54** (20) 46* (20) 
Low independence 
Low influence —.06 (14) 4) 16 (14) 
High independence 
High influence —.03 (11 11) % (11) 
Low independence 
High influence 33* (28) 8) | .62** (28) 


High independence 


Note, Numbers in parentheses repr 
tion coefficients are based. Respond 


1 which correla- 
om data enter 


ing the correlations were mis: 1 from the 
particular analysis accounting between sube 
groups and total sample size. 

*p <.05 one-tailed test. 

p <01. 
the hierarchical influence of (a) pay, 


These scales 


(b) advancement, and (c) 
House, and 


had been previously used by 
Lirtzman (1970). 


tn 


RESULTS 
The sample was split at the an to form 
high and low supervisor influence groups, and 
correlations were computed between the super- 
visor consideration scores and è of the three 


facets of employee satisfaction. 


As can be seen from Table 1, the moderating 
effect of influence found by past researchers was 


not supported in the present Relation- 
ships between consideration and satisfaction ye: 
approximately the same for both high an 


low hierarchical influence groups. Most of these 
relationships were significant. i 

To test the effects of different subordinate- 
superior groupings on the aforementioned rela- 
tionships, the sample was dichotomized on the 
influence and independence variables simultane- 
ously (the variables were tested and found to be 
relatively independent of each other 80 that 
placement on one variable did not determine a? 
individual’s placement on the other). Table 
reports the correlations between consideration 
and satisfaction for the resulting groups. 

As shown in Table 2, when high and low influ- 
ence are moderated by subordinates’ need m 
independence, there is a drastic change or qua? 
fication of the findings in Table 1. We find 7 
relationships between consideration and satisfa 


SHort Nores 


tion for two \ iferent groups (low influence- 


low indepen high influence-high indepen- 
dence), and lificant relationships between 
tonsideratior itisfaction for the other two 
groups. 

DISCUSSION 

The pred made for the high-influence- 
low-independ group, suggested by previous 
research, w onfirmed. In fact, the predic- 
tion was re with the high-influence-high- 
independen up exhibiting strong positive 
relationship le the high-influence-low-inde- 
pendence gi xhibited no significant relation- 
ships. These its should alert us to the possi- 

| bility that r ed findings of weak or moderate 
relationship y be masking the existence of 
two or mor ulation subgroups which exhibit 
Widely diffe elationships. 

The findi f strong consideration-satisfac- 
tion relation s for the low-influence—low-inde- 
pendence he high-influence-high-indepen- 
dence group m to suggest a need for further 
research to er understand the processes which 
account for ‘nese subgroups’ congruency, One 
possible exp tion involves a further concep- 
tual develop t of the independence and influ- 
ence measur:: Perhaps the need for independence 


also expresses need to influence one’s own life 
by affecting es outcomes in a situation. The 
considerate, =o wardly influential, supervisor can 
act as a facilitator of this self-determination 
Process and ‘hus satisfy his employees. Con- 
versely, the supervisor who lacks upward influ- 
ence may attempt to exercise greater control or 
influence downward. If he does this in a consid- 
erate manner may fill a satisfying, paternal- 
Stic role for dependent subordinates, while sti- 
fling the attempts at self-determination on the 
part of independent subordinates. While such 
explanations are obviously post hoc, the findings 
Suggest that such interactions of leader and 
Subordinate characteristics should be more 
thoroughly investigated and perhaps even replace 
the unidimensional studies of leader behavior 
that are then generalized across populations. 

Further, contradictory findings in the area of 
leadership should not necessarily be viewed as 
the Weakening or failure of the construct being 
Studied, but rather, they could be viewed as a 
failure to sufficiently elaborate the construct or 

Cory to account for different behavior on the 
Part of different population subgroups. 

Using the often researched relationship between 
“nsideration and satisfaction, we showed this 
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relationship to vary for groups of subordinates 
having differing levels of need for independence 
and supervisors described as having different de- 
grees of hierarchical influence. These findings 
have the potential for resolving the previous 
contradictory findings pertaining to the leader's 
influence by demonstrating that the same level of 
influence on the part of a leader can be accom- 
panied by different relationships between leader 


behavior and subordinate satisfaction, depending 


on the characteristics of the subordinates. To the 


extent that Pelz (1952), Wager (1965), and the 


House et al. (1971) studies dealt with popula- 


tions of differing characteristics, they could well 


have produced seemingly contradictory findings. 
The elaboration or qualification of some of the 


present leadership concepts or constructs may be 


helpful in consolidating this area of abundant but 
inconsistent findings. 


REFERENCES 


Argyris, C. Personality and organization. New Vork: 


Harper, 1957. 

Comrey, A. L., Pfifiner, J., & High, W. S. Factors 
influencing organisational effectiveness. Los Ange- 
les: University of Southern California, 1954. 

House, R. J., Filley, A. C, & Gujarati, D. M. 
Leadership style, hierarchical influence, and the 
satisfaction of subordinate role expectations: A 
test of Likert’s influence proposition. Journal of 
Applied Psychology, 1971, 55, 422-432. 

Pelz, D. C. Influence: A key to effective leadership 
in the first-line supervisor. Personnel, 1952, 29, 
209-217. 

Rizzo, J. R., House, R. J., & Lirtzman, S. I. Role 
conflict and ambiguity in complex organizations. 
Administrative Science Quarterly, 1970, 15, 150- 
163. 

Sanford, F. H. Authoritarianism and leadership. 
Philadelphia: Institute for Research in Human 
Relations, 1950. ; 

Stogdill, R. M. Manual for the Leader Behavior 
Description Questionnaire, Form XII. Columbus, 
Ohio: Ohio State University, Bureau of Business 
Research, 1965. 

Vroom, V. H. Some personality determinants of the 
effects of participation. Englewood Cliffs, N.J.: 
Prentice-Hall, 1960. 

Wager, L. W. Leadership style, hierarchical influ- 
ence, and supervisory role obligations. Adminis- 
trative Science Quarterly, 1965, 9, 391-420. 

Wigdor, L. Effectiveness of various management and 
organization characteristics on employee satisfac- 


tion and performance as a function of employees 3 
need for independence, Unpublished doctoral dis- 


sertation, Bernard M. Baruch College, City Uni- 
versity of New York, 1969. 


(Received September 17, 1973) 


= menters, 


Journal of Applied Psychology 
1974, Vol. 59, No. 5, 652-655 


DISPARAGEMENT BY A SUBORDINATE AS AN INI 


hi 


NCE 


ON THE USE OF POWER 


EUGENE M. FODOR 2 


Department of Social Sciences, Clarkson College of Technol 


The purpose of the present investigation was to determine whether 

tion of rewards by student supervisors in an industrial simulation 1 
enced by a situation in which a member of the work crew repeat is- 
paraged their competence. Disparagement did not appear to affect t er- 
visor’s allocation of rewards to an ingratiator, but it did result in a y 
to favor a compliant worker who was not an ingratiator. Under 

stances where he found himself the object of denigration by onc is 
subordinates, the supervisor gave the compliant worker higher y 
ratings on Tinker Toy models designed during the simulation ( p< n 
he did in the control condition of the experiment. 


The present investigation was the most recent 
in a series of experiments by the author dealing 
with use of power by appointed student super- 
visors (Fodor, 1973a, 1973b). The general pro- 
cedure involved an industrial simulation which 
was suggested in part by studies done by Good- 
stadt and Kipnis (1970) and Kipnis and Vander- 
veer (1971). 

Under conditions of disparagement by a sub- 
ordinate, student supervisors were expected in 
their evaluations to favor more highly both an 
ingratiating worker and an otherwise compliant 
worker who was not an ingratiator than they 
would in the absence of these conditions. 


METHOD 
Subjects 


Experimental participants were men recruited 
from the general psychology course. All subjects 
were given course credit for taking part in the 
experiment, and no one refused to cooperate. Of 
the subjects, 14% did not believe in the existence 
of workers in the next room and were conse- 
quently disregarded for purposes of data analy- 
sis. Sixty subjects remained—30 in the experi- 
mental and 30 in the control group. 


Procedure 


When subjects arrived at the laboratory, they 


were told that they would act as Supervisors in 


The research reported herein was supported by 
Grant MH 21568-01 from the National Institute of 


Mental Health. Grateful acknowledgement is ex- 
pressed to Dean 


Bailey, Thomas Ingram, Francis 
Sharp, and John Quintana who served as experi- 

2 Requests for reprints should be sent to Eugene 
M. Fodor, Department of Social Sciences, Clarkson 


College of Technology, Potsdam, New York 13676. 


cting and at- 
of three high 
whose task it 
from Tinker 
consructed on 
tudent super- 
orkers had al- 
models they 
ore the begin- 
ze supervisors 
the industrial 
them that a 

t was to assess 
It was under- 
receive a base 
e sign posted in 
him to grant 
crease in pay 
e workers were 


an industrial simulation 
tempting to facilitate the 
school students in the nest 
was to create new mode!“ 
Toys. A different model was 
each of four, five-minute t 
visors were informed that 
ready been thinking abou! 
would design for 15 minute 
ning of the first trial. To m 


simulation, the experiment 
major purpose of the exper 
their level of managerial abilii 
stood that the workers would e 
pay of $1 for their work. A 1: 
front of the supervisor authorized 
each worker a 10¢ increase or 
at the end of any trial. The th 
entirely fictitious. 

In communicating with workers, the super. 
visor could say anything he wished; he could 
give directions, advice on construction, feedback 
on awards of pay increase or decrease, encour- 
agement, or reprimands. 3 

Supervisors communicated with the workers 
by means of a one-way communication device 
comprised of a microphone and also of a switch 
and light signal for contacting individual work- 
ers. Communications allegedly coming from 5 
various workers to the supervisor consisted o 
preprogrammed, hand-written notes. The ae 
menter explained that the supervisor could not 5 
allowed to see the workers or to receive voice 
communication from them because the sensory 
impressions of the various workers thereby 
acquired might bias the supervisor's ovale 

The model designs attributed to the woe 
were independently assessed for level of creativ 
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> judges prior to the experi- 
re distributed across workers 
he mean and exact distribu- 


ity by each 
ment, and mo 
in such a was 


tion of creat scores for each worker, as 
determined by idges’ ratings, were identical. 
Immediate each trial, the experimenter 


in the initial instructions as 
stant) brought to the super- 
structed by each worker on 


(who was de 
the supervis« 
visor the m 


that trial a id him what structures the 
models were ned to represent, The assistant 
at that time gave the supervisor any notes 
allegedly wri by the workers. There were 
two conditio: the experiment, In the experi- 
mental condi Man A wrote a series of com- 
ments which e highly critical of the super- 
visor. On Tr he said, “These models are 
tough to put her and you're not very much 
help.” On T he declared, “You’re a lousy 
supervisor.” í rial 4, “I hope you’re not plan- 
ning to be a ervisor when you finish college. 
You're not v sood at it.” 

The comn of Man B were designed to 
make him an `ngratiator in the sense in which 
that concept been treated by Jones (1964). 
On Trial 1, 1 B wrote, “I hope you’re happy 
with my wor m doing as well as I can.” On 
Trial 3 he commented, “I like the way you 
supervise—I ught you’d boss me around, but 
you haven't. n Trial 4, “You’ve been a good 
Supervisor,” 

Man C wrote notes which were essentially 


neutral in ch: 
You tell us ey 


cter. On Trial 2 he asked, “Can 
y so often how much time we 


have left?” Tial 3 produced the comment, 
“These tinker toys are always falling on the 
floor.” On Tria! 4 he requested permission to 
smoke, 


Comments were so designed that each worker 
Wrote approximately the same total number of 
Words throughout the experiment. Also, the same 
Person always wrote notes for Man A, another 
for Man B, and another for Man C, 

The control condition had Man A, B, and C 
constructing the same models in the same order 
‘Sin the experimental condition. Man B made 
the same ingratiating comments and Man C the 

ame neutral comments. Instead of disparaging 
the Supervisor, however, Man A, like Man C, 
Wrote a series of neutral statements. On Trial 1 
Eed “Pye never done anything like this 
before. It's kind of different.” On Trial 2 he 
commented, “You've got to be pretty careful 
bout these models, They keep falling apart.” On 
the last trial he asked, “This is the last trial, 
‘nt it?” The control condition differed from 


Suort Nores 


TABLE 1 


MEAN Pay INCREASES, PERFORMANCE EVALUATIONS, 
AND CREATIVITY RATINGS GIVEN COMPLIANT 
Workers 


| Evaluation dimension 


Condition Pay Performance | Creativity 
Man | Man | Man | Man | Man | Man 
0 1 c 
Disparagement | | f 3 = 
M 14.0 | 19,3 | 24.1» | 27.78 | 24.8% | 28.000 
SD 9.52 | 9.80 | 4.94 | 3.19 | 3.72 | 2.91 
Control | f | | 
M | 15.3 | 15.0 | 25.2 | 25.0 | 23.1 25.3 
SD | 10.59 | 10.09 | 5.68 | 5.58 | 5.08 | 4.06 
I I I I 
* These means are significantly different from each other, 


p <.05. 
Bb means are significantly different from each other, 
p <.01, 
e Thee means are significantly different from each other, 
p < 0i. 


the experimental only in the absence of any 
disparagement of the supervisor. 


Dependent Measures 


The supervisor’s response to the three workers 
in the two conditions of the experiment was 
measured in three different ways, First, he was 
asked at the end of each trial to rate all models 
constructed on the degree of creativity he saw 
reflected in their design on a scale ranging from 
1 (low creativity) to 9 (high creativity). Sec- 
ond, he was allowed to give pay increases at the 
end of each trial. And lastly, at the conclusion of 
the four work trials the supervisor was led to 
another room and requested to evaluate the per- 
formance of each worker on four 9-point scales: 
(a) worker's ability, (b) worker’s overall worth 
to the company, (c) willingness to rehire worker 
for a second experiment, and (d) recommenda- 
tion that the worker be promoted to supervisor 
jn a future experiment. For purposes of analy- 
sis, these four scale scores were summated to 
obtain a single overall score. 

Upon completion of the experiment, all sub- 
jects were told the true purpose of the experi- 
ment, carefully interviewed to determine whether 
they suspected the nonexistence of workers in 
the next room, and then pledged to absolute 
secrecy. 


RESULTS 


Table 1 shows the mean reward level ac- 
corded Man B, the ingratiator, and Man C, the 


noningratiating compliant worker, in each condi- 


tion of the experiment. The higher the mean for 
any of the three dependent measures in the dis- 


4 
p 


TABLE 2 
MULTIVARIATE AND UNIVARIATE ANALYSES OF 


VARIANCE 


Suort Nores 


Univariate analyses 


Multi- F values 
Source variate | 
F value | 
| [af | Gr 
. | Perform- — 
| Pay ‘ance 1 
Disparagement vs E | 
trol dition 
A) 3 8.70% | 1 66 85 8.33 E 
Difference pores 
mpliant workers 
<B) z | 14.61 | 1 | 1.83 3.38 14.04% 
A XB | -2.25 12.37 4.15* AS 
I 
sp <.05. 
“p 201. 


paragement as compared with the neutral condi- 
tion, the greater was the impact of experimental 
manipulation in influencing supervisory distribu- 
tion of rewards. : 
Table 2 presents a multivariate analysis of 
variance for identifying significant effects of 
treatment and worker on the three dependent 
measures taken together. In accordance with the 
recommendation of Hummel and Sligo (1971), 
this multivariate analysis is followed by uni- 
_ variate analyses on each separate dependent 
measure to trace the source of significant multi- 
variate effects. The multivariate analysis dis- 
closes significant effects for both treatment (F 
= 8.70, df = 1/116, p<.01) and worker (F = 
14.61, df = 1/116, p < .01). As Table 2 indicates, 
both main effects can be traced through uni- 
variate analyses to the creativity ratings. Table 1 
shows that it was Man C, the noningratiating 
compliant worker, who received a significantly 
higher creativity rating as a consequence of the 
disparagement manipulation. 
Further inspection of Table 1 reveals a Treat- 
ment X Worker interaction on the variable of 
_ performance evaluation. The evaluation of Man 
-C in the disparagement condition, as computed 
by Tukey’s HSD (honestly significant difference) 
test, was not significantly greater than the eval- 
uation he received in the control (p < 10), 
although it was significantly higher than the 
E evaluation given Man B in the disparagement 
2 condition (p <.05). Table 1 Suggests that the 
latter result arose in part because Man B was 
evaluated slightly less favorably in the experi- 
mental (X = 24.1) than in the control condition 
252). 
In no instance did Man B, the ingratiator 
keceive significantly greater rewards in the dis- 


condition, Indeed, 
and performance 
y less in the dis- 
s there any ten- 
he supervisor to 
ingratiator than 
liant worker, 


paragement than in the c 

on the variables of pay 
evaluation, he was given 
paragement condition. N 
dency across conditions 
bestow greater rewards u 
upon the noningratiating 
DISCUSSION ANI LUSION 

A conclusion seemingly 

ent investigation is that 
paragement by a subordir 
visor rates as more creati 
by Man C, a complian 
worker. This finding har) s well with the 
results of the Goodstadt < pnis (1970) and 
the Kipnis and Vanderve« 1) studies, both 
of which showed an incre supervisory dis- 
pensation of rewards to co it workers when 
a member of the work c splayed a poor 
attitude. There was a tenc for Man C also 
to receive more pay and cher performance 
evaluation in the disparagen than in the neu- 
tral condition, but these did not attain 


ated by the pres- 
xperiencing dis- 
he student super- 
models designed 
noningratiating 


significance. Taken together, se data tenta- 
tively suggest that when = clouds gather 
about the supervisor it m t be so much 
the ingratiator who can ex to benefit from 


erson who unob- 
nds placed upon 


the adverse situation but tl 
trusively complies with the < 
him. 

Kipnis and Vanderveer (197 
cant tendency across conditions 
visors to favor an ingratiator 
pliant workers who were noi 
sible explanation for failure of present study 
to replicate that ‘finding may have been the 
nature of the ingratiating comments contained in 
the notes attributed to Man B. In an-endeavor 
to fully approximate the desired experimental 
effect they were made ingratiating to a rather 
blatant degree, substantially more so than the 
notes used in the Kipnis and Vanderveer 5 
ment. For example, one of the comments in the 
present study unabashedly stated, “You've 1115 
a good supervisor.” Ingratiation, if it is highly 
visible to the target person, may constitute more 
a source of annoyance and suspicion’ than 0 
gratification (Jones, 1964). 


found a signifi- 
for student super- 
er otherwise com- 
‘iators. A pos- 
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PERSONAL CHARACTERISTICS DIFFERENTIATIN( 
EXECUTIVE FROM FEMALE NONEXECUTIVE PER 


ALE 
EL* 


ROBERT F. MORRISON ? A MARIA-LUISE SEBALI 


University of Toronto, Canada 


A study of personal characteristics differentiating employed exc 
nonexecutive women was conducted in a large metropolitan area. T 
pairs of subjects, matched on age, education, work site, length 
ment, etc., were differentiated on level achieved (organization 
policy contribution). The executive group was significantly higher i 
self-esteem component of need for achievement, (b) the need for po 


(c) mental ability. Significant differences between the pairs were ni 
in variables designed to contrast early socialization processes, marital 


ment, and the need for affiliation. 


To provide the opportunity for equal employ- 
ment to appropriate candidates, it is important 
for industry to identify and select those women 
who have the potential for successful executive 
careers. To aid in implementing this strategy, 
there is a need for research comparing character- 
istics of female executives with those of female 
nonexecutives. The purpose of the present study 
was to provide such comparisons on character- 
istics involving early socialization, current en- 
vironmental factors, and needs. 

Hennig (1971) and Epstein (1971), in their 
discussions of women in business, have empha- 
sized the early socialization process as a factor 


influencing career aspirations and behavior. On 


the basis of such earlier studies, it was hypothe- 
sized that female executives have significantly 
different early socialization processes than do 
female nonexecutives. In particular, it was hy- 
pothesized that the executives (a) are the only 


a child or eldest in an all girl family; (b) perceive 


themselves more often as the parent’s favorite 
child; (c) establish a closer, personal relation- 
ship with the father; (d) have a father in a pro- 
fessional, business, or executive occupation; (e) 
have parent(s) with a higher educational level; 


and (f) come from first generation, northern 
_ European immigrant families. 


Bailyn (1964) and Bowman, Worthy, and Grey- 
ser (1965) hypothesized that concurrent environ- 


mental components such as marital relationships 


would be important factors. It was hypothesized 


The authors wish to thank Bell Canada for 
assisting in the sponsorship of the study. 
2 Requests for reprints should be sent to Robert 


„ Morrison, Faculty of Management Studies, Uni- 
_ versity of Toronto, Toronto, Ontario, Canada, M5S 


1A1. 
® Now at the Civil Service Commission, Province 


of Ontario. 
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its of the two 
more female 
hen married, 
husbands in 


that the current marital ens 
groups differ significantly | 
executives (a) are single; a 
receive more support fron 
achieving executive status. Z 

A review of pertinent lil lealing with 


male executives aided in d he primary 
factors which differentiated secutive role 
from that of the nonexecut hiselli, 1966; 
Laurent, 1968; McClelland Vinter, 1969). 
From this review it was hy ed that fe- 


male executives have (a er need 5 
achievement; (b) a higher m power; . 
a lower need for affiliation; ai a higher leve 


of mental ability, than do the sale nonexecu- 


tives. 


METHOD 
Subjects 


Information was obtained fr 
jects working in a Canadian m 5 
Each pair of females consi an executive 
and a nonexecutive matched ing to K 
education, years of employment, and type o. 
organization, Sampling techniques were adop 
to minimize systematic bias so the sample cou 
be considered representative. ble | 

Executive subjects had to (a) be accountal = 
for a profit center or major functional pan 
of the organization; (b) be responsible for 115 
development and/or approval of organizatici i 
wide policy; and (c) supervise supervisors. a | 
differences between the executive and noe 
tive groups on the above criteria were sge 
at or beyond the .01 level using a one-taile 
test. 


ypolitan area. 


Measures of Characteristics 


. jh- 
The first measure was a three-part 19990 
ical questionnaire. Part 1 (11 items) de 


agement Pote 
was develope 
hypotheses 

information 

The other 
tause they h 
tive from n 
The Wesman 
C) was usec 
merical men 
wards Pers: 
was used to 
or motives. 

from the ar 
bject withi 
ol eight or | 


sample charac 
the biographi: 
Corporation < 
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s Part 2 (55 items) adapted 
sstionnaires used in the Exxon 
Early Identification of Man- 
(Laurent, 1968); and Part 3 
ve authors to further test the 
ms) and to provide unique 
ms). 
instruments were chosen be- 
reviously discriminated execu- 
utive males (Ghiselli, 1966). 
nel Classification Test (Form 
reasure total, verbal, and nu- 
bility factors, while the Ed- 
Preference Schedule (EPPS) 
five pertinent individual needs 
of subjects were eliminated 


is of the EPPS data when a 
iat pair had a consistency score 


To reduce the data and test the judgmental 
clustering of ordinal and interval items used to 
assess the hypotheses, a principal-components 


analysis with an orthogonal rotation to a vari- 


max solution was done. Thirty nondemographic 
items from the biographical inventory were in- 
cluded in the factor analysis. 


RESULTS 


Of the 84 variables used to test the hypothe- ; 
ses, 25 yielded significant differences at the alpha 


level of Os or better. Only 10 would be expected 
to occur by chance with probability equal to 01 


(Sakoda, Cohen, & Beall, 1954). The factor 
analysis of biographical items yielded six factors 


(see Table 1), accounting for 42.5% of the vari- 


ance, of which five were consistent with ration- 


ally developed clusters previously designed to 


test the hypotheses, These six factors were titled — 


TABLE 1 


BIOGRAPHICAL QUESTIONNAIRE (BQ) Facrors DESCRIBING Executive 
AND NonExecutive WOMEN 


Less sociable 


or Representative BQ Items * 
mo m F 
Undergraduate organizations — 1.41 6 
More undergraduate leadership jobs —2.16* 7 
More high school leadership jobs =2.11* : 
More high school organizations —2,04* f: 
Rates self high as a supervisor 3.82. q 
Has exploited opportunities 2.18* 
Most effective in administrative job 2.30** i 
Sways group of associates 2.85˙ 
ry Role Desires to work at top levels Li baad 4 
5 Gives subordinates more chances e 
(5.5.% of variance) Supervises more employees -3.87% 
Fewer lunches with subordinates -3.16 
Early Career Many teen physical activities —0.10 
Socialization Process Exclusive early home z 1.58 
(5% of variance) Parents encouraged college —0.23 
Discussed career with parents —1.75 
Social Skills Others contact her with problems — 142. 
(4.9% of variance) Expresses views in groups 2.85 
Gets along well with others n 
Turns down subordinates lunches —3.16 
Independence Higher level of education 005 
4.2% of variance) Paid more college costs —0. 
Too much contact with boss pes 


idf = 38 
eh? Sol. 


b <.001, 


be 105 one-tailed for hypothesized variables. 


‘Early Career Socialization Process, Self-esteem, 
pendence, Leadership, Supervisory Role Per- 
ons, and Social Skills. Individual jtems 
within a factor were analyzed for signifi- 
between the two groups without the de- 
velopmemnt of factor scores. 
Ide Early Career Socialization Process factor 
i composed of 10 items primarily assessing 
early family life and socioeconomic status, Rep- 
| fesentative items are shown in Table 1. No 
items on this factor differentiated between the 
two groups. Other related nominal items which 
could not be included in the factor analysis also 
did not discriminate between the two groups. 
Additional items evaluating early family life in- 
‘cluded information about the number and ordinal 
Position of siblings, relationships with parents, 
aud intra- and extra-family telationships. Socio- 
economic status items included contrasts such as 
2 nt status, ethnic background, parental 
education and occupation, and family income. 
Atkinson (1958) has stated that high self- 
“esteem and independence are characteristics of 
ose individuals with need for achievement. A 
econd factor, Self-esteem, discriminated between 
two groups, In addition to the items in Table 
I, executives felt they had achieved more than 
their educational peers, had more good breaks, 
and desired administrative work. 
Independence was another characteristic which 
‘Was present in the factor analysis. None of 10 
items similar to those in Table 1 discriminated 


{ 


immig 


between the groups. 
Oft the additional nine biographical question- 
naire items used to measure need for achieve- 
Ment (need to complete work, take median level 
risks, plan for the future, etc.), only “the desire 
to assume responsibility” demonstrated a signifi- 
cant difference between the two groups with 
executives requesting more. While the Achieve- 
ment scale on the EPPS supported C778: 
P< .05) the hypothesis that female executives 
have a higher need for achievement, it appears 
that only the characteristic of self-esteem within 
the overall need for achievement measure dis- 
eriminated between the two groups. No real 
“Support was found for the hypothesis about need 
for achievement. 
The need for power (Atkinson, 1958) was 
‘Measured through the EPPS Dominance and 
ence scores as well as 19 biographical data 
As hypothesized, the Dominance score on 
EPPS was significantly (¢=—3.14, p< 
001) higher for executives, but the Deference 
e was not higher for the nonexecutives, 


jon- 
conducted to adequately assess such a conclusi 


Two of the factors 
visory Role Perceptior 
representing a need for 
major loadings on the 
were leadership positi 


titled 


Leadership 
memberikis 


oup 

items (Table 1). Five i ided onto Sc 
visory Role Perceptio scriminated gig. 
nificantly between the t (Table 1), 

Thus, it appeared th > was indeed a 
difference between fen tives and me- 
executives regarding th: or power, with 
executives having a hig than did nom 


executives, 


Two scales from the Affiliation and 


Nurturance, appeared to e the characters 
istics of need for affiljati ‘fined by Allis 
son (1958). Using a one-t t, both showed, 
Significant differences (p n the hypothe 
sized direction with execu er on each. OF 
the 19 biographical data ypothesized te 
discriminate between the ips, only four 
did so. Though all items the hypothe: 
sized direction, such resul! ot meaningfal 
(Sakoda et al., 1954), and pothesis is dot 
accepted. ] 

1 cluster representi: need for afili- 
ation did not come out of actor analysis 
which had included six ratio: jeveloped mea 
sures of need for affiliation > Social Skills 
factor contained elements of need for affiliation 
combined with social influer ems from the 


indicates that 
relate to 
power rather 


need for power (Table 1). 7 
social skills reflect the ability t 
as required to fulfill the need 
than the need for affiliation. 
Using a one-tailed “ test, 2 
man test, verbal, quantitative, 
substantiated (5 < .01) the hy; 
that the female executives illus 
tellectual capabilities. r 
The husband’s role behavior (5 items) in sup- 
porting his wife’s career, career aspirations, 
ness problems, and potential geographic raea 
tion did not differentiate significantly between 
two groups. 


rts of the Wee 
total scores, 
sis indicating 
trated more . 


Discussion 


Since the hypotheses involving the — 
power, mental ability, and a major element 0 
esteem) within the need for achievement bes, 
supported in the research and had been dev 
oped from research on male executives, it 
be possible that the female executive is similar 
motivation and ability to the male a 15 
Cross-validation and further research should e 


ume that the socialization 
important factor in female 
ture research might consider 
on may have a major impact 
voice between a career in an 
a home, Later experiences 

cation (Berlew & Hall, 1966) 
vidual’s choice about, as well 
an executive position, 
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Differences in black student and white student perceptions of unive te 

were examined at an integrated university. Factor analyses oi tem 
* university climate questionnaire were performed in racial subgr: ve 
È factors were extracted for whites and six for blacks. Two rac ors 
fs appeared in the black sample, while only one racism factor appea the 
white sample. There were consistent and significant differences on tor 
scales between the two races, with blacks perceiving the univers rte 


more negatively. 


In recent years much effort has been expended 
in the measurement of university climate (Feld- 
man, 1971). Previous studies attempting to de- 
scribe college environments have been oriented 
to interinstitution comparisons (Astin & Hol- 
land, 1961; Stern, 1970). Such comparisons are 
meaningful in evaluating overall institutional 
effects on college students as a group. However, 
it seems equally important to understand the 
perceptions of university climate in different sub- 
= groups within the university. 

The little research that has been performed 
comparing climate-type variables for blacks and 
_ whites suggests that people in different groups at 
_ the same university may function under different 
Conditions and perceive the climate differently, 
Hiedegard and Brown (1969) found differences in 
perceived academic competition, friendliness of 

personnel, participation in activities, and satis- 

faction with college. Willie and Levy (1972) 
noted a separatism between blacks and whites on 
integrated campuses that could lead to different 
university-related experiences. The purpose of 
the present research was to investigate intra- 
institution differences in perceived university 
climate for black undergraduate and white under- 
graduate students. 


E STE, 


METHOD 


Forty-three black students and 73 white stu- 
dents at the University of Maryland were asked 


5 The authors wish to thank Duncan L. Dieterly, 
C. J. Bartlett and Dorothy Evans for their com- 
ments and suggestions. Computer time was provided 
7 by the Computer Science Center, University of 

Maryland. 
. Reprint requests should be sent to C. Michael 
= Pfeifer; Jr., who is now associated with Westing- 
house Behavioral Safety Center, Box 948, American 
8 City Building, Columbia, Maryland 21044. 


UNIVERSITY CLIMATE PERCEPTIONS BY BLAC: 
WHITE STUDENTS * 


C. MICHAEL PFEIFER, Ja. ano BENJAMIN SCHNEI! 


University of Maryland 


ND 


iniversity. Four 
e used to intro 
f of the essay 
e experimenter 
r. These essay 
black students 
generate item 
> questionnaire 
tems describing 
for the final 
| of 730 white 
were asked t 
rent with eadi 
scale.“ Subjects 
tory psychology 
courses, Afro 
s, and an It 


to write essays describin 
different sets of instructio: 
duce the essay-writing ta 
sessions were proctored by 
and half by a black exper 
and follow-up interviews wi 
and 17 white students were 
content for the university 
From an initial pool of 
university life, 115 were 
form of the questionnaire. 
students and 138 black stud 
indicate the extent of their 
of the 115 items on a 5-p 
were recruited from an inti 
course, higher level psycho 
American Studies Program cour 


tensive Educational Developm<:: program. A 
The items of the questionnzire were 1 
to a factor analysis in separate racial samples 


All 138 black respondents were used in the 
black factor sample, In the white factor sample 
550 white respondents were used leaving 180 
respondents for a white holdout sample. A po 
pal-factor solution was used (BMDX72) Wil 
maximum row correlations as communality et: 
mates, An orthogonal rotation was perform 

using the varimax method (Kaiser, 1956). 

Five factors were rotated in the white ad 
ple, and six factors were rotated in the 1 
sample. An eigenvalue cutoff of 2.0 was adop! a 
when deciding the number of factors to xa 
tated, although fewer factors were rotate m 
the black sample because of the presence ot 
factors which accounted for very little acs: 
An item was retained for scoring if its 10% 
was .35 or greater in absolute value on & & 


ilable 
3 Copies of the climate questionnaire are a 
from the first author. 
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factor and : ver than .25 in absolute value 
s any ot! :ctor. However, items which 
leaded .50 ster in absolute value were re- 
tained regar of their loadings on other 
factors.* 


Factor s were computed by summing 
responses t s loading on each factor and 
dividing by umber of items responded to. 

RESULTS 

Five fac which emerged in the white 
sample, alc th item examples, were as fol- 
lows (fact lings are in parentheses): 
Impersonal emic Atmosphere—a view of the 
academic | s as inadequate, for example, 
(e) “Cour it the University are designed 
more to fi people out than to teach them” 
(58); and ‘Faculty members at Maryland 
expect too h from their students” (.54). 
Administr Neglect—a lack of concern for 
Students o © part of the administration, for 
example, The administration has been re- 
šponsive t idents’ needs” (—.60); and (b) 
"The Unis administration is behind the 
times” (. 80 


“on—an atmosphere conducive to 
king with other students, for ex- 
he University of Maryland teaches 
ng with other people” (.55); and 
n has the opportunity to meet 
is own ideals at the University” 


z directly with racial matters, for 
“Maryland's new admissions policy 
keep blacks out” (.59); and (b) 
y is the only thing blacks are 
he University” (.52). 
na lack of interaction between 
Maces at the group level, for example, (a) “Whites 
at the University feel uneasy when blacks get 
together in groups” (.57); and (b) “Black stu- 
ts at the University do not trust white stu- 
dents” (40). 


In factors identified in the black sample were 
follows: 


Institutional Racism—concerning matters such 
A admissions standards and the motives of the 
“ministration in admitting blacks, for example, 
Q “Maryland’s new admissions policy was de- 


“Tabulations of the items on each rotated factor 

90 fir factor loadings are available upon re- 

mie of intercorrelations of factors are also 
le. 


vised to keep blacks out” (4); and (6) “The 
only reason the University admits blacks is to 
get Federal money” (.56). 

Nonacademic¢ Atmosphere—concerning the leam 
ing process, for example, (a) "The faculty at the 
University create an atmosphere for learnin 
(—.54); and (b) “There is too much red tape at 
the University” (83). * 
Social Isolation —indicating difficulty in meeting 
and talking with people, for example, (a) “I 
difficult to make new friends at the Un 
(54); and (b) “Maryland's campus is so form: 
that you feel you can't relax” (46). 
Personal Racism—dealing with interactions be- 
tween black and white students at the individual — 
or group level, for example, (a) “Whites at the 
University feel superior to blacks” ($6); and 
(b) “Blacks are unwanted in fraternities or 
sororities at Maryland” (.54). 
Nonclassroom Related  Activities—concerning _ 
such varied things as sports, remedial programs, — 
geographic location, extracurricular activities, for 
example, (a) “Remedial programs on campus 
help a lot of students” (46); and (b) Sports 
are good for the University climate (48). . 
Attempts at Communication—concerning com- 
munication between students generally, between 
black and white students specifically, and between 
students and the administration, for example, (a) 
“Students at the University create an easy-going 
atmosphere for each other“ (41); and (5 
“Whites at Maryland force themselves to be 
nice to blacks even if they are prejudiced” (41). 


Table 1 presents means, standard deviations, 
and internal consistency reliability estimates for 
each factor. Black students perceived the Uni- 
versity more negatively than did white students 
(p <.01) on all but one of the climate scales, 
Attempt at Communication. 


Discussion 


One of the main differences in climate face 


tors between the black sample and white sample 
was the emergence of two racism factors in the 


appeared in the white sample contained items 
from both of the factors of the black sample.) 
An important implication of these results is that 
black students perceive differences between what 
happens to blacks on a person-to-person level 
(Personal Racism) and what the institution does 
to blacks as a class of people (Institutional Rac- $ A 
ism). Gurin, Gurin, Lao, and Beattie (1969) 


y 


found a similar distinction among blacks between 


TABLE 1 


Sraxpagp Deviations, AND SIGNIFICANCE OF 
MEAN DIFFERENCES 


Internal 

„ | Consistency 

È Reliability 

(KR-20) 
180 71 
138 67 
180 72 
138 60 
180 77 
138 68 
180 73 
* 138 71 
Racial Separation 180 61 
= 138 25 

_ Black sample factors 

Institu Rac- 180 68 
138 77 
Nonacademic At- 180 51 
mosphere 138 59 
Social Isolation 180 70 
2 138 72 
Personal Raciem 180 49 
138 71 
180 41 
138 54 
180 09 
138 47 


Note. For each pair differences between the means of whites 
= significant (p < 01), except for Attempt at 


personal items and ideological items when factor- 
analyzing items measuring Rotter's concept of 
internal-external control (Rotter, 1966). 
_ Another difference which emerged between the 
vo groups involved the perception of the social 
climate. Although a factor dealing with social 
climate emerged in both groups, there were dif- 
ferences in the items loading on the factor. Items 
loading on the Social Interaction factor of the 
hite sample were, for the most part, positively 
ded, while items on the Social Isolation scale 
the black sample were negatively worded, 
In addition to differences in dimensionality or 
actor structure of university climate for blacks 
and whites, a comparison of the rank order of 
means indicated that blacks and whites perceive 
e behavioral conditions these factors represent 
existing in different degrees. The rank-order 
lation of the means for blacks and whites 
Was —.85. This difference in rank order might 
| e a function of the salience of the dimensions 


for the two groups. 


* 29 n 
Snort Notes 


Future research might 
the ways in which differ 
use the cues available 


iseful to examine 


groups of people 
environment te 


structure their perceptior e world around 
them. The present data su iat when the set 
of cues may be of partic: nce to a group 
they will make finer discri ins among them 
(eg. racism cues for bl than when the 
cues are from a set of eve so meaningful 
or important (e.g., racism r whites), 

To summarize, dimensio; versity climate 
appear to differ for blach nts and white 
students. In addition, bla isistently per- 
ceived the university in a negative way 
Since some research sugg link between 
climate and academic per! e, future te- 
search investigating univers ate may help 
to insure the equality of | educational < 
experience, including acaden formance, for 
different races. By identify nditions and 
situations within the univer which are of 
particular salience for subg of students, 
changes in university policie vrocedures can 
be effected. 
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writers or, 


ists on any criteria even when the 
ause one-third or less of the total 


juter systems. 


writer keyboard is a common in- 
in communication between people 
y the only widely used medium of 
between men and computers. Ac- 
Michaels (1971), there are approx- 
5 million typewriters in the United 
tates, e reason the typewriter is so popu- 
ris taat it can be used by people with a 
fide range of skill levels. A secretary with 
raininy and experience may reach a typing 
ate of 80, 100, or even more words per min- 
e. On the other hand, a person with no 
aining can sit at a keyboard and do a re- 
pectable job of pecking out a personal letter 
Or even an article such as this. 
Interactive keyboard terminals in time- 
hared computer systems are already common 
in a wide variety of business, industrial, and 
educational settings. The continued prolifera- 
ion of these systems and their possible future 
volution into computer utilities (see, for ex- 
ample, Irwin, 1966) are certain to result in 
an increase in the diversity of both their ap- 
Police ions and the kinds of people who use 
them 
The high costs associated with the use of 
computer terminals and the value of a user’s 


This research was done under Research Grant 
GN35023 from the National Science Foundation, 
0 sce of Science Information Service. 

Requests for reprints should be addressed to 
Abhonse Chapanis, Department of Psychology, Johns 
Hopkins University, Baltimore, Maryland 21218. 


8 IES IN INTERACTIVE COMMUNICATION: 


\ \PERATIVE PROBLEM SOLVING BY SKILLED AND 
ILLED TYPISTS IN A TELETYPEWRITER MODE 


LD D. WEEKS, MICHAEL J. KELLY, axo ALPHONSE CHAPANIS * 


Johns Hopkins University 


erson teams of skilled and of unskilled typists (N = 16) cooperated to 
credible, real-world problems by communicating 
as a control, by conversing face-to-face. Performance was 
-sed by (a) the time taken to solve a problem, (b) measures of behavioral 
vity, and (c) several measures of verbal 
« differences between face-to-face conversation and 
writer. There were, however, no differences 
y communicated by typewriter, in part 
time was spent typing, and in part be- 
vase subjects composed their own messages. These findings have implications 
wr the use of keyboard terminals in telecommunications and interactive com- 


through interconnected 


output. All three criteria showed 
communication via 
between the two groups of 


own time have prompted wide-scale efforts 
to increase the efficiency of man-computer 
interactions. Since users of both typewriters 
and terminals come to their tasks with a wide 
range of typing skills, one plausible way of 
increasing the efficiency of this interaction 
might be to require would-be users of a sys- 
tem to demonstrate first a given level of typ- 
ing skill. Secretaries who type business corre- 
spondence must, of course, have a high level 
of proficiency. On the other hand, proficiency 
in typing is required in some occupations 
where the need for it is much less apparent. 
For example, Federal Aviation Administra- 
tion flight service specialists use keyboards 
only to teletype weather reports and flight 
plans. Yet they must demonstrate the ability 
to type at least 35 words a minute. 

The usual method of evaluating typing per- 
formance is to measure a typist’s keying speed 
and accuracy in copying text. Simple copying 
skills are, however, only a minor factor in 
most tasks involving communication via key- 
board. Jurgensen (1942) reports that tests of 
simple copying skills have little validity in 
predicting typing success in stenographic and 
secretarial positions. By concentrating on the 
psychomotor aspects of key stroking sucl 
tests neglect a host of other important skills 
This is strong evidence for what West (1967 
1969b) has referred to as the “substantia 
irrelevance of straight copy skills to prof 
ciency at realistic typing tasks.” 
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In a realistic task involving interactive 
communication with a computer system, 
Morrill, Goodwin, and Smith (1968) found 
that subjects without typing skills performed 
their task about as fast as skilled typists and 
with no more errors. Unfortunately, their de- 
sign confounded typing skill with sex and 
educational level. The typists were all female 
secretaries with high school educations, while 
the nontypists were males who had attended 
at least one year of college. 

In a similar kind of study, Chapanis, 
Ochsman, Parrish, and Weeks (1972) had 
two-man teams of subjects communicate 
through typewriters to solve problems such 
as might be used in interactive computer sys- 
tems. Some of the teams were made up of 
inexperienced typists, others of subjects who 
had completed a formal course in typewriting. 
While the linguistic transmission rate of the 
typists was greater than that of their un- 
trained counterparts (18 versus 10 words per 
minute), the differences between the two 
groups were trivial in all other major re- 
spects—total solution time, behavioral activ- 
ity, and verbal output. The investigators 
readily admit that this unexpected result may 
have been caused by unavoidable confounding 
in the execution of the experiment. Since the 
skilled typists were selected and tested later 
than the unskilled typists, the former may 
have come from a different population than 
the latter. In particular, no attempt was made 
to control for possible differences in verbal 
ability between the two groups. This may be 
a significant factor in view of the considerable 
evidence, beginning with the classic work of 
Bryan and Harter (1899), for a large verbal 
component in any text keying task. 

The major purpose of this study was to 
provide an unconfounded test of the effect 
of differences in typing skill on dyadic com- 
munication via teletypewriter such as might 
characterize interactive man-computer dia- 


logue. Another purpose was to collect verbal 
and behavioral data that might help explain 
why skilled typists do or do not enjoy an ad- 
vantage over unskilled typists. The final pur- 
pose was to achieve much better control over 
the equivalence of the two groups of subjects 
than had characterized previous studies in the 


G. D. Weeks, M. J. KELLY, AND A. CHAPANIS 


field. This equivalence we 
the skilled and unskille 
municate in a mode simu 
to-face conversation. 


natural face- 


METH 

Subjects 

Sixteen female students, sen parochial high 
school in Baltimore ranging from 17 year 
5 months to 18 years 3 mo hired as sub- 
jects. Subjects were paid a of $2.50 per 
session, a bonus of $. g at a correct 
solution, and, as an incentiv: iplete the four 
days of test, an additional $3.00 on the 


last day of test. 

The eight subjects in th: 
were selected from secretarial 
fied by their instructor to h 
cess of 40 words per mi 
typists were selected from 
riculum of the school. Our 
that the unskilled typists sh 
of less than 25 words px 
recognize that it would ha‘: 
this cutoff score lower, in < 
impossible to find high scho 
never used a typewriter and 
to skilled typists in all other 

To control for intellectual 
subjects were given the verbai 
School and College Ability Te 
sections 1 and 3 (Educationa 
The subjects were selected s 
unskilled typist groups were 
(M=300.2, SD=10.6; M 
spectively). 


* typist group 
und were certi- 
g speeds in et- 
ight unskilled 
oriented cur 
criterion was 
a typing speed 
Although we 
jesirable to st 
it is virtually 
nts who have 
ire comparable 


all prospective 
portion of the 
AT), Form 24 
z Service, 1957). 
the skilled and 


1, SD=135, k. 


Apparatus 


municated from 
ms, The wall be- 
pertures. One was 
ss window (1,06 * 
to be sound insulat- 
ed across a seconi 


The two members of a team © 
two adjacent, sound insulated r 
tween the rooms contained thr 
filled with a double-paned p 
1.39 m.) especially constructed 
ing. Speaker grill cloth was si 4 
aperture (75 X.20 m.) to allow virtually unt 
stricted voice communication. Finally, the third open 
covered by two interconnect 

5 or in one 
the other. 
d be 


This arrangement allowed objects 
tween rooms but prevented direc 
through the passage. When all three api 
left open, the subjects were able to converse om- 
face, a condition that we have been calling 2 15 et 
munication-rich mode (Chapanis, 1973; K 
al., 1972). When visually opaque, sound inst 
parels were placed over the three a i 
subjects were effectively isolated from each 1 ir for 

Each room contained a work table and «output 
the subject, and an IBM Model B inp 


ked by having 
sts also com 


1 on this variable 


mal. (1972) 


odes of C 


Two diffe 


in norn 
iction we 


m, identified 
ome of the 
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he corresponding machine in the 
sage typed on one machine ap- 
sly on the other. Each subject 
vith which she could at any time 
ner and take control of the type- 
n channel or give control of it 
nore detailed description of the 
sements may be found in Chapanis 


unication 


odes of communication, typewrit- 
atien-rich, were tested in this ex- 
typewriting mode, subjects were 
nunicate only through the slaved 
ich room. The window and grill 
covered so that they could neither 
nother. 
inication-rich mode, the subjects 
d hear each other. They could talk 
other their parts of the problem; 
zrimace, or use any other form of 
val communication that they might 
ice-to-face conversation. The only 
iat the window and wall between 
subject from touching her partner 
oblem-related materials. 


ı a team had two functionally dif- 
subject we refer to as the seeker 
person with the problem to solve. 


folio defined the nature of the prob- 


yy constraints on it, and contained 
mation needed for its solution. 
efer to as the source served as an 
seeker, providing the rest of the 
‘ion from her information folio, 
tve efforts of both team members 
ive at the solution. 


> Tasks 


d so often exploited in problem-solving studies, 
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the source's folio were similar to the seeker’s socket 
in some respect, only one was a perfect match: 

The information retrieval problem. The secker was 
asked to prepare a bibliography of five different 
newspaper articles relevant to the topic: The threat 
to individual privacy that would be posed by a 
United States Data Center based on a computerized 
information retrieval system. The source was given 
the New York Times Index for 1967, from which 
the citations were to be drawn. 

The jaultfinding problem. The secker had to per- 
form three diagnostic tests on a mockup of an auto- 
mobile ignition system, using a commercially avail- 


able automobile engine analyzer. She was told that 


there were one or more defects (there were actually 
two). The source was given a detailed set of in- 
structions for the tests. 


The class-scheduling problem. The seeker was given > 


a list of four college courses which had to be 


arranged into a workable schedule within specified 


time constraints, such as commuting schedules. The 


source had a 97-page booklet listing the complete 
time schedule for courses at the University of Mary- 


land. The courses and constraints were such that there 
was only a single correct solution. 


Experimental Design 


The design used two identical Graeco-Latin squares 
with skill levels as the between-squares factor (see 
Table 1). A between- and within-modes format was 
superimposed on one variable of the Graeco-Latin 
square. This design gave us a straightforward test of 
the two sources of variation in which we were most 
interested, the main effect of skill levels and the Skill 
Level X Mode interaction. We included the communi- 
cation-rich mode as an additional check on the 
equivalence of our two subject groups. If our selec- 
tion procedures were successful, the two groups of 


TABLE 1 
EXPERIMENTAL DESIGN 


Me attempted to model ours after everyday, real- 
Orld situations. They were tasks on which two 
People might naturally work together, and for which 
© aid of present or future computer systems might 
De useful, 
We selected four problems: (a) an object identi- 
3 and (b) an information retrieval problem, 
( eveloped earlier by Chapanis and Overbey 
aay and (c) a faultfinding and (d) a class- 
D uling problem, both developed especially for the 
Mesent study. 
ven 9 tect identification problem. The secker was 
og ee electric pilot light socket and was asked 
bartner a identical replacement socket from her 
5 differ e source was given a large tray containing 
divid ent Leecraft pilot light sockets arranged in 
ual receptacles. Although all 65 sockets in 


Communication modes 

505 e T 10 
tion-rich n 
Ti Pab, De Pa De P, Da P Di 
Skilled Te PD! Pe, Ds | Pa, Do P D. 
typists Ts PI, De PS, Ds | P, DI PDs 
Ti Ps, D; P, Dı | PDs P, D 
Ts Pa Da | Pa, Do | PD | Ps, Di 
Unskilled | Te Pa Di Pe, Ds | Pa Do | Pi,Da 
typists | Tr | Pi,D2 PD. PD | Po Ds 
Ts PD: PD | PDs | Pa, Da 


a Each team, T, was made up of two girls, designated as seeker 


and source, 


b Pi (object 
nding), anı 
a Debs Ds, D4 are the 


of test. 


identification), Pa (class scheduling), Ps (fault- 
Px (information retrieval) are the four problems. 
first, second, third, and fourth days 
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TABLE 2 


Turre FORMS OF THE ANALYSIS OF VARIANCE 


df 
Source of variation INEPT 
A B C 
— a eet 
Between teams (T) 7 7 7 
Skill levels (SKL) 1 1 1 
Between teams within skill 
levels (T/SKL) 6 6 6 
Subjects within teams (S/T) 8 8 
Job roles (JR) 1 1 
JR X SKL 1 1 
JR X T/SKL 6 6 
Within subjects 48 16 
Modes (M) 1 1 
MXT 72 7 
M X SKL 1 1 
M X T/SKL ó 6 
M X S/T 8 
MXJR k 
M X JR X SKL 1 
M X JR X T/SKL 6 
Residual 16 32 
Total 31 63 31 


subjects should solve problems about equally well in 
the communication-rich mode. In that case, the main 
effect of skill levels would be one test of any differ- 
ential effect of typing skill. On the other hand, if our 
selection procedures did not succeed in perfectly 
matching the two groups of subjects, the lack of 
equivalence would be revealed by differential per- 
formance when conversing face-to-face. In that case, 
the effect of typing skill could still be evaluated 
through the Skill Level * Mode interaction. 

Because modes were superimposed hierarchically on 
one variable of the Graeco-Latin squares, the test 
for modes may be confounded with a possible Prob- 
lem X Day or Mode X Problem X Day interaction. 
However, we could see no reason to expect these 
interactions to occur. A previous study (Chapanis & 
Overbey, 1974) had found no interactions involving 
days. In any case, these two modes of communica- 
tion have already been extensively compared in 
earlier studies (Chapanis et al., 1972; Chapanis, 1973; 
Ochsman & Chapanis; in press), so that they were not 
variables of primary interest here. 

Three forms of analysis of variance used for the 
data are shown in Table 2. The rationale for the 


three different analyses is explained in the section 
on results and discussion. 


Procedure 


The eight subjects within each skill level were 
randomly paired to make four teams of skilled typists 
and four teams of unskilled typists. Each of the 
team members was then assigned one of the two 
job roles, that is, source or seeker. When they arrived 
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for an experimental sessio! 
mained together while the 
the general instructions fc 
scheduled to solve that da) 
sized the importance of 
participants. The subjects 
and to do or say anything 
tion of the problem. They 
possible, and would be paid 
was correct. 

The subjects were then 
the two experimental rooms 
her problem-related materi 
with the communication ap; 
gan when the experimenter 
subjects were familiar wi 
cedures, and task. 


am members fe 
nters read to them 
oblem they were 
structions empha- 
ion between the 
i to act naturally 
ht expedite solu- 
vork as quickly as 
z if their solution 


and taken into 
ibject was given 
was familiarized 
The problem be- 
itisfied that the 
quipment, pro- 


Data Collection 


Three kinds of data we 
tion, observable behavior 
teams reached a solution 
few incorrect solutions ( 
evenly between skill leve! 
incorrect solutions have beer 

Behavioral measures. Wi“ 
method of recording behav! 
used by Chapanis et al. ) 
recorded the behavior of the 
of the source. Each experim«: 
tone in an earphone every fiv 
corded what the subject Ras 
instant of the signal. 

The subject’s behavior was 
nine categories. The activity © 
defined as engaging exclusively 
of a message either by typing, 
using some form of nonverb: 
activity was scored as “other s 
was simultaneously sending a 
in some other activity. Ther 
receiving activities receiving 
ceiving.” The remaining five cate 
ing only,” “making notes,” “pa 
ing,” and “other.” 

Two composite behavior cat 
from the measures above. “All ding. 
of times spent in sending only and in other sen 55 
“All receiving“ was the sum of times spent in 
ceiving only and in other receiving. 

Following the procedure used by © 
(1972), an independent observer sat in on on pedi 
sion with each of the two regular observers to ites 
interobserver reliability. The obtained relia 1 
(87 and 96) are of the same magnitude as foun 
the earlier study. 

Time to solution. The total time to solvi 
was calculated as the sum of the times @ 1 fot 
spent in the nine behavior categories, correcte 
any observations missed by the experimenter. ot 

Verbal output. Audio tape recordings Were Thee 
of all sessions in the communication-rich mode. *, y 


5 S against 
recordings were transcribed and then verified 48 


time to solu- 
al output. All 
gems. Since the $ 
vere distributed 
xong problems, 
in our analyses. 
revisions, out 
imilar to that 
de experimenter 
r, the other that 
id an auditory 
ds. He then rè- 
doing at the 7 


ded in one ú 
ding only” was 
the transmission 
speaking, or by 
„munication. AB 
if the subject 
e and engaging 
two analogous 
and “other Te 
were: Search. 


ing’ was the sum 


Chapanis et al. 
e ses- 


~~ 
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pon interest, however, was in the main effect 
of skill levels and in the interaction of Skill 
Level x Mode. Neither of these two terms 
Was significant for any dependent measure. 

oo with the statistical conundrum of 
zi ae to prove the null hypothesis, we de- 
with} to show data for the two skill levels 
oo each mode whenever we showed sig- 
ae mode effects. While the figures and 
it, thus follow the conventional format for 
pode 1 significant interactions, our pur- 
the 555 bresenting the data this way is exactly 
1 Our aim is to show how unim 
=e typing skill is for interactive com- 

cation via typewriters. 
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TABLE 3 
Sratıs SIGNIFICANT SOURCES OF VARIATION IDENTIFIED BY THE 14 ANALYSES OF VARIANCE 
Sources of Variation 
Depender es . — — — — J 
Skill levels | Modes as ob roles 5 
(SKI) | m) | SEDM ; gr) | SKLXIR | MXJR SLX 
— — $ 2 he | XJR 
Time to so! | 006 x — 
Behavioral res | 
Sending | <.0005 
Other ser — = | 
Receivin 0005 | 
Other rec = ye | 
All sendi 04S | et 
All receis | 
Searchin 06 002 | 
Making 1 >.975 ‘001 
Waiting j 
Verbal mea 
Number 035 
Number 
Message h 0005 
Commu n rate <.0005 >.99 
| 
Note. A da es that the term is not a source of variation in the analysis of variance for iat dependent N 7 
the tape b persons working independently: the Time to Solution 
Secretary v typed the original transcript and one 8 g n 
“of the exp -nenters, Subjects in the typewriting One important dependent measure in this 
mode gener: <d their own hardcopy record of their Study was the total time taken by a team to 
{a Several verbal measures (described solve a problem. Because solution time is a 
fe mputed from these transcripts and measure of team performance and not indi- 
vidual performance, the analysis of variance 
ESIS AND DISCUSSION of the time data (Form A, Table 2) contains 
no terms for job roles or any interactions in- 
_ All statistically significant effects obtained volving job roles. 
in this study are shown in Table 3. By far the The height of each bar in Figure 1 repre- 
Most conspicuous finding in the table is the sents the average time to solve problems for 
strong and consistent effect that modes ex- that skill level-mode combination. The most 
erted on almost every dependent variable, Our striking feature of that figure is the large dif- 


ference between the two communication 
modes. On the average, our teams of subjects 
solved problems more than three and one-half 
times faster when conversing face-to-face than 
when communicating through the slaved type- 
writers. Incidentally, the difference in solution 
time between modes appears to be inflated 
by the extremely short times required to solve 
the object identification problem in the com- 
munication-rich mode (M = 3.8 minutes). If 
only the other three problems are considered, 
there is a 2.8:1 ratio between modes, a figure 
that is much closer to the 2.3:1 ratio 
Chapanis et al. (1972) found between their 


communication-rich and typewriting modes. 
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80 = — 
1i ting 
ther ] 
0 I g Notes 
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Making Notes hing | 
60 ly 
Waiting 
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S22 Searching 
2 Only 
a ng 
Z 5 J 
40 
v 
E 
a Receiving | | 
5 Only 
o 30 i. — 
= 
20 
0 ' 8 9 
Making Notes | ; 
571 
aiting Only 
10 
Searching 
Only 2A 
© n 
i i lied 
Skilled Unskilled Skilled Unskil 
Typists Typists Typists Typists 
Communication Rich Typewriting 
Mode Mode 


i Y ope f with 
Ficure 1. Mean time to solve problems for each skill level in the two communication modes 
behavioral category time distributions. 


Neither skill levels nor the interaction of 
Skill Level x Mode was significant. To be 
Sure, the skilled typists solved problems about 
nine minutes faster than did the unskilled 
typists in the typewriting mode. However, the 
skilled typists were very nearly that much 
faster (about seven minutes) than their un- 


per se gave the skilled typists any differ 
advantage. 


18110. 
skilled counterparts in the con 
rich mode as well. Although the skilled typ 

appear to have been able to solve these 


prob- 


; idence 
lems somewhat faster, there is no b 
whatsoever in these data that typing 


entia 
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Searching 
Only 


Searching 
Only 


Waiting 


Receiving 
Only 


Receiving 
Only 


Sending 
Only 


100 
‘arts Handlin 
Making Notes 
8 80 
— | Searching Searching 
F | | Only Only 
0 | 
5 | 
H 
60 
o | 255 
Waiting 
© | 
5 
E | Other Other 
5 | | Receiving Receiving 
Ə 40l- | 
O o pame 
g- | Receiving Receiving 
5 Only Only 
1 
| Other Other 
20 Sending Sending 
| Bede! 
Sending Sending 
Only Only 
L i 
Skilled Unskilled 
Typists Typists 


Communication Rich 
Mode 


Ficure 2. Mean percentage of total time spent in various behavioral 
des, (The data of Figure 1 are replotted here as percentages.) 


in the two communication mo 


Behavioral Measures 


a activity analyses enabled us to see how 
ae times in Figure 1 were distributed 
der the nine behavioral categories. The 
A of each bar in Figure 1 show, in 
a order, the average times subjects spent 
ne . activities. Since the amount of 
ae 0 7 to an activity was free to vaty 
pn 1 team member, each mean is based 
ie ae of 16, Some categories occupied 
they T of the average subject’s time that 
the to re represented as unlabeled spaces at 
p of each bar. 


Skilled Unskilled 
Typists Typists 
Typewriting 


Mode 
categories for each skill level 


Because of the large differences between 
the two modes, the data were transformed to 


perc 
within and between behaviors. To do this, 


the time devoted by each subject to a par- 3 


ticular behavior was expressed as a percentage 


of the total time spent on a problem. Analyses 


of variance (Form B, Table 2) were per- 
formed on those percentage 
the nine individual behavioral 


the two composite behavioral categories. Form 


C of the analysis of variance was used for 
other sending and other receiving because 


entages to facilitate comparisons both 


data for five of 
categories and 


nr 


Vr rae” 


e two behaviors did not, by definition, 
ccur in the typewriting mode. Parts handling 
d other were not analyzed since they each 
jed too little of the average subject’s 
time (less than 3%) to be of practical im- 
_ portance. 
_ The mean percentages of time spent in each 
behavior, calculated in the same way as the 
mean times for each behavior, are shown in 
Figure 2. The categories in Figure 2 have 
been rearranged from their order in Figure 1 
to highlight certain comparisons that we find 
interesting. The figure shows clearly that sub- 
jects spend their time quite differently in the 
two modes of communication. Equally impres- 
ive, however, is the consistency with which 
_ the two kinds of subjects apportion their time 
within a mode. In the communication-rich 
mode, the skilled and unskilled typists never 
differ by more than approximately 3% in the 
percentage of time devoted to any activity. 
is one additional piece of evidence that 
successfully matched the two groups of 


The more important set of findings is shown 
in the two bars on the right in Figure 2. Even 
when typing, there is not much to distinguish 
between the two skill levels. To be sure, the 
skilled typists spent a smaller percentage of 
‘their time sending only than did the unskilled 
_ typists, but the difference, 3.4%, is much 
smaller than one might have expected in view 
of the large difference in typing skill between 
our two groups of subjects. It is also ironical 
to note that rather than exploit their modest 
advantage in communication efficiency by 


TABLE 4 


S OF THE VERBAL MEASURES FOR THE Four 
LEvEL-CoMMUNICATION MODE COMBINATIONS 


Communication- 


Typewriting 
rich mode mode 


Skilled | Unskilled | Skilled | Unskilled 
typists | typists typists | typists 
mber 978 messages 
per subject 74 
‘umber of words = N 
per subject 596 571 507 
re) 8 
ommunication rate n 2 
(words 17 5 minute 
of sending time) 122 27 18 
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turning to other pr > activities, the 
skilled typists offset t » by spending a 
greater percentage time walting 
(9.0% versus 4.5% ) d the unskilled 
typists. The sum of se: ly plus waiting, 
in the typewriting mod 8% for skilled 
and 40.6% for uns typists. These 
sums replicate almost | those obtained 
by Chapanis et al. for their “ex 
perienced” (39.4% nexperienced” 
(40.2%) typists. 

Verbal Measures 

Two measures of ling itput, number 
of messages and numbe: is exchanged, 
were counted directly fr: protocols for 
each subject. Dividing thi y the former 
gave us a mean messagi r each sub- 
ject. Finally, the total n words com- 
municated by a subject ided by the 
amount of time she spen sending to 
give us a second derived » communi- 
cation rate. 

The data for each of thes verbal mea- 
sures were entered into a B type of 
analysis of variance (Table e difference 
between modes of commun! was a sig- 
nificant source of variatio! number of 
messages, message length, an munication 
rate. No other significant efs: were found 
in the four analyses. The mes the ver 
measures for each skill leve!-rsode combina- 


tion are given in Table 4. 

Number of messages. A message began 
when a subject began to talk or type an 
ended when she had finished or was inter 
rupted by her partner. Table 4 shows that 
subjects sent about 2.8 times as many mes- 
sages when speaking as when typing. +% 
ratio between modes becomes more dramatic 
when one recalls that the time devoted t° 
message transmission in the communication- 
rich mode was only one-fifth that in the typ® 
writing mode (Figure 1). — 

Number of words. Earlier studies in 15 
research program had found that subjects 
the communication-rich mode, or in any CON” 
munication mode using a voice channel, have 
transmitted three to five times as many WF 3 
as subjects in the typewriting mode 6 
panis, 1973; Chapanis & Overbey, 1974). 
word counts in this experiment were in 


but the difference between the 
smaller and not statistically 


th. Since there is a large and 
ical! ificant difference between the 
„ages communicated in the two 
a small difference in the num- 
ised, it follows that message 
ffer. The data in Table 4 con- 
ects “packaged” their informa- 
y in the two modes: Spoken 
only about half as long as 
essages. 
tion rates. On the average, sub- 
icated six times as many words 
hen they spoke as they did when 
nis difference between modes is 
ally significant and large enough 
ctical importance, but not par- 
prising. The interesting feature 
n Table 4 is the relatively smell 
communication rates between 
levels in the typewriting mode. 
‘est of typing skill, the skilled 
far superior in performance to 
| typists (43 versus 21 words per 
nen typing to communicate in- 
| the solution of problems, how- 
ed typists typed only nine more 
minute than did the unskilled 


Effects 
$ presents the mean values of several 
dependent measures for the four days of test. 
Considering the ubiquity of learning effects 
in human experimentation, we expected that 
our teams would show some improvement in 
their ability to solve problems over four suc- 
cessive days of testing. Learning effects can- 
hot, however, be assessed directly given the 
experimental design of this study. While prob- 
ms and days are each balanced across the 
two communication modes, the unique prob- 
lem-day combinations are not so balanced. 
As a result, the effect of days is confounded 
With a possible Mode * Problem interaction. 
Because of that confounding, we believe there 
no defensible method of evaluating the 
Statistical significance of the differences among 
‘the means shown in Table 5. Nonetheless, 
Systematic changes in performance on any 


=. —~ 
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TABLE 5 


Means or Vanocs eee er Maasonte 
ox me Foca Days or Teer 


Á See 


| Day of test 


lila) sie 


Dependent variable 


‘Time to solution 


(minutes) si] a7 l s 
Number of messages | i } | 

per subject | 7 2 sj 
Number of words | 

per subject | sz | gov | m 
Message length | 

(words per message) | 14 nä n| N 
Communication rate i | ! 

(words per minute | 

sending time) 7 | u | 77 78 


— — 
Note. Bach entry te the unweighted mran for 16 honta 


dependent measure are notably absent. 
is some cross-validation for this observation in 
the study by Chapanis and Overbey (1974) 
With an experimental design that did permit 
a direct examination of the main effect 
days, they found days to be significant 
only one out of six dependent measu 
message length. 
Typing Skill 

The apparently counterintuitive statement 
that typing skill does not significantly aid in- 
teractive communication via typewriters be- 
comes less puzzling in light of two conclusions 
that may be drawn from the data, The be- 
havioral data show that about one-third of the 
total problem-solving time is devoted to send- 
ing, that is, typing. Thus, any potential ad- 
vantage in typing skill would be operating 
only one-third of the time. In addition, there 


speed for the 
= 43 words 


(M 
ing change 
less, that is, . 
The sharp decline in typing speed for the 
skilled typists may be attributed to the na- 
ture of the task, or, more specifically, the na- 
ture of the material typed. Muhich (cited in 
West, 19 60a) found that second-semester high — 
school typists with an average speed of 41.7 
words per minute on straight copy work typed P 
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at less than half that speed when they typed 
material such as letters, tables, and rough 
drafts. West (1969a), in discussing Muhich’s 
(cited in West, 1969a) data, attributed the 
decline in speed to a different set for produc- 
tion-task stroking behavior and to the in- 
creased role of planning and decision-making 
skills in working with material that must be 
organized as well as typed. Our pretest of 
typing skill, intentionally like those custom- 
arily used to select typists, was a measure of 
speed and accuracy in copying prepared 
text. By contrast, our experimental task, 
while not like the production tasks used 
by Muhich (cited in West, 1969a), required a 
great deal of planning and decision making 
as our subjects composed their messages at 
the typewriter. 


CONCLUSION 


In summary, it appears that the efficiency 
of typewriter communication for interactive 
problem solving is determined not so much 
: by how fast one can type as by how quickly 
one can formulate what to type. One cannot, 
of course, prove the null hypothesis. However, 
in the light of our results and those of studies 
cited earlier, we conclude that a typing skill 
prerequisite for using typewriter terminals 
k would produce, at best, only modest gains in 
systems efficiency. 
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stant forced rate used in the present study 
Was different for each subject (set to match 
his particular skill level on the basis of pre- 
ceding self-paced trials). Two separate experi- 
ments were conducted, one with seven-digit 
numbers and another with five-digit numbers. 
| Seven-digit numbers were used because of the 
large amount of self-paced data previously 
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ECTS OF GROUPING OF PRINTED DIGITS ON ’ 
RCED-PACED MANUAL ENTRY PERFORMANCE * ö 
E. T. KLEMMER ND L. P. STOCKER * 
Bell Laboratories, Holmdel, New Jersey 
evious studies have shown that optimum grouping of printed digits leads 
increased speed in keyboard entry, but differences in error rate due to 
rouping have generally failed to reach statistical significance. The present 
tudy presented numbers to be keyed at a forced-paced rate, chosen separately 
r each subject from his self-paced rate so as to provide moderate speed stress. 
nder this condition (n = 18), the error rate was doubled by using a 1-5-1 
rouping of seven-digit numbers as opposed to a 3-4 grouping. For five-digit 
aumbers, which cannot be grouped by 3s and 4s, no significant difference in 
performance was found (n= 24) between 3-2 grouping and ungrouped pre- 
sentation. 

Sever orevious studies have shown that gathered with sequences of this length. Five- 
the high speed in keying visually presented digit numbers are of interest because they, 
numbers curs when the digits are grouped cannot be grouped in 3s and 4s and are in 
by 3s an 4s (Klemmer, 1969). Differences common use, notably as zip codes. è 
in error e due to temporal grouping of 
auditory its can be very large, as shown METHOD 
by Ryan 609), who employed spoken recall, 4 pparatus 
but stud’ which have used visual stimuli in 3 f the Ul 

Jf- ksh h little differenc Numbers were presented one at a time on e 
pe pace “asks have shown A rence screen of a Model TIB Picturephone® set. This was 
in error as a function of grouping. Since gone by having the camera of a sending Picturephone 
some ke) ard devices (e.g., mail-sorting ma- set focused through a 132-millimeter lens on the i 
chines) oy »rate at fixed speeds, it is important printed output of a Model KSR35 teletypewriter 

Eok what h ti a func- which was in turn controlled by a Tempo computer, K 
; m appr e z The subject responded by pressing keys on a Touch- 
tion of grouping when a fixed presentation Tone® keyboard which was also connected to the 
tate is emp`oyed with visual stimuli. computer for automatic timing and scoring of 105 

The present experiments examined the curacy. Figure 1 shows the subject's position with 

ee 5 i i h t and Touch-Tone keyboard. 
effect. of uping upon error rate under display Picturephone set an k 
forced pacing in which rate of presentation is Stimuli 
0 f i i - y 8 
onstant for all trials and groupings. Traon Seven-digit numbers were presented in Experiment 


1 with one of three groupings chosen on the basis 
of past studies to represent the best (3-4), worst 
(1-5-1), and intermediate (no grouping) for seven- 
digit numbers in terms of speed of entry. Typical 
numbers appeared as follows: 5 : 

736 2311 1 


3 88912 4 
6723985 


Five-digit numbers were presented in Experiment 


2 either ungrouped or grouped 3-2, as was done bys 
Brown and Cornog (1970). Typical numbers ap- 
peared as shown: i 
43520 2 
149 22 re 


digits were randomly chosen without 
size of the number on the screen was 
ranged from 6.) centi- 


The presented 
restriction. The 
such that the horizontal extent 
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meters for the 1-5-1 grouping to 3.6 centimeters for 
the five-digit ungrouped numbers. Viewing distance 
was about 50 centimeters, No control of the position 
of the subject’s head was exercised. 


Subjects 


Eighteen employees of Bell Laboratories served as 
subjects in Experiment 1 (with seven-digit numbers): 
Six were randomly selected engineers, 6 did clerical 
work, and 6 were shop workers. The clerical subjects 
were selected by supervisors from clerical and typing 
pools, and the shop subjects were selected by super- 
visors from machine and wiring shops. Twenty-four 
subjects served in Experiment 2 (with five-digit 
numbers), 18 of whom were employees with the 


same job classifications as those in Experiment 1, plus 
6 housewives. 


Timing and Scoring 


During the self-paced runs, a new number would 
appear .2 seconds after the final digit was keyed 
without regard to accuracy. For the forced-paced 
condition, the numbers changed at a constant time 
interval set by the experimenter at the start of the 
tun. The computer compared the keyed digits with 
those presented and printed out all errors at the 

end of the run for both the self-paced and forced- 
paced conditions. 

The scoring interval in the forced-paced condition 
Was set to end 100 milliseconds after the number 
~ disappeared. In addition, a short tone (150 milli- 
l seconds) was sounded just before the disappearance 
of each number to warn the subject of the end of 

the scoring interval. The results showed this to be a 


satisfactory procedure to avoid out-of-phase 
sponses. 


. 
es 


re- 


The computer calculated a mean and variance for 


Ficure 1. View of display and keyboard from subject’s position 
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tered numbers 
used to com- 


the keying time for all corre 
in a self-paced run. These tim 
pute the timing for the forced-pa 


Procedure 

All subjects were first verbali; the self-paced 
instructions, which were as follow 
ppear on this 
the number as 
ag this Touch- 
the last digit 
a new number 
ough you may 
g at the key- 
r immediately. 


In all trials, a number wili 
Picturephone screen. You are t 
quickly and accurately as pe 
Tone telephone. As soon as y 
of the number, it will disapp 
will immediately flip into vi 
not see it change if you are look 
board. Begin keying the new nt n 
Do not pause or attempt to corr previous 8 
If you realize you made an error, there is 1 1 
to correct it, so just press the number sign H; 
[demonstration] to make the next number app = 
Press the number sign key if you finish 57155 
and the number does not change. Each of 1 
blocks of trials will contain 50 numbers 11 5 
Experiment 2]. Begin keying as soon as the 
number appears. 


i -paced 
In Experiment 1 subjects were given 10 self-pa 


warm-up trials, grouped 2-5, followed by ne oa 
rate runs of 50 trials each, with one run a of 5 
grouping, In Experiment 2 warm-up consiste runs 
self-paced trials, ungrouped, followed by DA a 
of 40 trials each, two of which were groupa 95755 
two ungrouped, The order was counterbalance! 
subjects. 8 
N the instructions for the forced-paced run 
were given to the subject. 
r er 
4The number sign key is the 12th key on the a 
right-hand corner of new Touch-Tone telephones: 


ee 


UPING OF PRINTED Dicits AND MANUAL ENTRY PERFORMANCE 


On the í your self-paced keying speed, a 
keying ti been calculated. This will be used 
as the re which the numbers on the screen 
change r words, they will be regularly 
changing ice which is independent of when 
you nov the number. Just before the num- 
ber will there will be a short tone sounded 
from th Stop keying that number even if 
you hav finished because otherwise they will 
be count ard the next number. 

Each sut zen had forced-paced runs, one at 
tach grcuy the reverse order of the self-paced 
runs. 

For eac iect all forced-paced runs were con- 
ducted at gle speed or timing determined as 
follows: ‘I ean time between numbers plus 1.28 
standard ons was computed for each of the 
seli-paced The minimum of these values was 
then used he constant time between stimuli in 
all forced l runs. If we assume a normal distri- 
bution of es within each self-paced run, this 
formula £ time which is exceeded by 10% of 
the respor mes for that run. If the subject main- 
tained thi umed normal distribution of response 
times in tl ced-paced run, then a 10% error rate 
would be cted based on slow responses alone 
for the be ouping and even more errors for the 
other gro zs. Actually, of course, the subject is 
capable lucing the number of slow responses 
during the er forced-paced condition, and so in- 
creases in yr rate by 10% are not reasonable 
Predictions 

At the c= of each subject’s session, which lasted 
an average 40 minutes, he was asked which group- 
ing he prefered 

RESULTS 

Figure 2 shows the mean time and error 

tate for each grouping under the self-paced 


condition and the mean error rate for each 
grouping under the forced-paced condition of 
Experiment 1. The differences in speed with 
grouping agree with the previous data in 
showing 3-4 best, ungrouped intermediate, 
and 1-5-1 poorest with .75 second, or 13%, 
Increase in entry time compared to 3-4. All 
differences in speed between groupings were 
_ Statistically significant. 

The 1-5-1 condition also showed the 
highest error rate of the three groupings under 
the self-paced condition. This difference be- 
tween 1-5-1 and 3-4 was statistically sig- 
Nificant; the differences between each of those 
5 and the ungrouped condition were 
e error rate differences between group- 
ail were larger under the forced-paced con- 
a 15 The 1-5-1 grouping resulted in twice 

any errors, on the average, as the 3-4 
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Ficure 2. Keying time and accuracy as a function 
of grouping. (Each point represents the average data 
from 18 subjects, each of whom keyed 50 seven-digit 
numbers. The s or ns between two points indicates 
whether or not the difference between those points 
is significant at the .05 level. Time and errors are 
calculated per seven-digit number.) 


grouping. This result was not due to a few 
subjects, since 16 of the 18 subjects made 
more errors with 1-5-1 grouping than with 
3-4 under forced pacing. The ungrouped con- 
dition again fell in between 3—4 and 1-5-1, 
All differences in error rates under forced 
pacing were statistically significant. 

All 18 subjects said they preferred the 3-4 
grouping to the 1-5-1, and all but one pre- 
ferred 3-4 to the ungrouped. The subjects 
divided evenly in preference between the 1- 
5-1 and ungrouped conditions. 

Comparing the self-paced and forced-paced 
conditions for the 3—4 grouping (the forced- 
paced speed was set on the basis of the 3-4 
self-paced speed in each case), we see that 
there was an increase in error rate from 5.6% 
for self-paced to 6.6% for forced-paced con- 
ditions over all subjects. This is much less 
than the 10% increase which would be pre- 
dicted by assuming that the distribution of 
keying times was the same in both tests. In- 
decd, only 8 of the 18 subjects showed any 
increase in errors between 3-4 self-paced and 
3—4 forced-paced conditions, so the overall 
difference was not statistically meaningful for 
the total population of subjects. These results 
suggest that people can eliminate some of 
their slow responses when forced, although 
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TABLE 1 


Tur ANp Errors N Keyinc Frve-Dicir NUMBERS AS 
A FUNCTION OF GROUPING UNDER SELF-PACED AND 
FORCED-PACED CONDITIONS 


Self- | Forced- | 


paced | paced ae pered 
Condition time | time pacs pa 
(in sec- | (in sec- ae 7 
onds) | onds) ün%) | (in 9% 
Grouped 3-2 3.76 4.3 4.1 
4.32 
Ungrouped 3.81 4.7 4.7 


Note. Table entries are means of 24 subjects each of whom 
keyed 80 numbers under each condition. Time and errors are 
calculated per five-digit number. 


confounding with practice makes this con- 
clusion tentative. 
Table 1 shows the mean times and error 
rates for the five-digit numbers of Experiment 
2. The overall results for the self-paced trials 
showed small and statistically insignificant 
differences between the grouped and un- 
grouped conditions. Each subject had the two 
conditions in counterbalanced order, so it is 
meaningful to consider differences between 
groupings for each subject. Of the 24 subjects, 
15 keyed faster with the 3-2 grouping, and 9 
faster with the ungrouped numbers (a sta- 
tistically insignificant difference). Error com- 
parisons were made for both the self-paced 
and forced-paced test, 11 subjects made less 
errors with the 3-2 grouping (9 made more 
and 4, an equal number). Again these differ- 
ences were statistically insignificant. Thus, 
the results generally favorable to the 3-2 
grouping are all small and might well be due 
to chance. 
These results are different from those of 
Brown and Cornog (1970), who found fewer 
overall errors for the ungrouped numbers, 
The difference may be due to the fact that 
they used short tachistoscopic exposures, 

whereas in the present experiment the stimu- 
lus was available for several seconds. 

When asked which grouping they preferred, 
20 of the 24 subjects stated a preference for 

the 3-2 grouping. This clearly significant re- 
Sult is surprising in view of the small perform- 
ance differences between groupings. 


Discussion 


is 


3 It is clear that under conditions of forced 
pacing, the effect of grouping upon errors be- 
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comes larger and more nt than under 
the self-paced conditii e any printed 
digits are likely to be under forced 
speed by some people s the time, the 
clear suggestion is that m (3s and 4s) 
grouping should be used printed digits, 
even if time saving und paced condi- 
tions is not an importan for particular 


epplications. A casual sur he way digits 
are grouped in docume ibers, credit 
cards, account numbers, umbers, etc, 
shows that this simple often over- 
looked. For five-digit nu hich cannot 
be grouped by 3s and ms to make 


little difference in perforn 
are ungrouped or grouped 
jects prefer the 3-2 grou 


hether they 
though sub- 
small per- 


formance differences sho: ve taken to 
mean that long sequen< rits can be 
grouped by 5s, however s work has 
shown that it is not d. (Klemmer, 


1969). 


Forced pacing in the p study was 


done by adjusting the pacin: } according 
to each subject. In many 9 ing systems 
such individual adjustment possible, 
but rather all stations must as the same 
speed. A reasonable predict: be made 


single-fixed- 
ercentage of 
isfactorily if 


from the present data for 
speed case is that a greate: 
people would fail to operate s 
less than optimum groupings were used. That 
is, for any criterion level oi errors, more 
people meet the criterion with optimally 
grouped stimuli. : 

Subjects preferred the 3-4 grouping as 
well as performed better with it, but prefer- 
ences did not distinguish between 1-5-1 a 
seven digits ungrouped. Subjects preferre 
the 3-2 grouping over five digits ungrouped, 
but performance differences were small. 
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tandard to difficult material. 


The re 
text mat 
tetically 
cal inter 
stand the 
in readir 
from effo 
tiveness c 
Most 

Klare, 10 


mship between the readability of 
| and reading rate is both theo- 
practically important. Theoreti- 
stems from attempts to under- 
oding process used by individuals 
while practical concern derives 
) measure and enhance the effec- 
itten communications. 
vious experimental studies (see 
have operated under the tacit 
assumption hat, outside of situational factors 
such as me vation, reading (inspection) rate 
at any give: time is primarily determined by 
the difficu y of the text elements which cur- 
tently confront the reader. The present ex- 
periment explores the possibility that effects 
of the readability of a given text element on 
inspection rate 3 persist over time. Specifically, 
the experimental question was whether the 
teading ease (Flesch, 1949) of recently in- 
Spected text influences the rate at which a 
subsequently presented text element is read. 
This question is of some practical concern 
ecause any substantial context effects pro- 
duced by readability would focus attention on 


* Requests for reprints should be sent to Ernst Z. 
othkopf, Learning and Instruction Research De- 
Partment, Bell Laboratories, 600 Mountain Avenue, 
urray Hill, New Jersey 07974. 

* Now at Ohio State University. 

2 term inspection rate is used in preference to 
ae rate throughout this article because of the 
ee Buous character of the term reading. Inspection 
a more neutral with respect to demands made 

Subjects by the experimental situations. 


ECTS OF READABILITY OF CONTEXT PASSAGES 
ON SUBSEQUENT INSPECTION RATES 


ERNST Z. ROTHKOPF * axb RICHARD P. COATNEY? 


Bell Laboratories, Murray Hill, New Jersey 


llege students (N =92) studied a 1,200-word (19-paragraph) passage pre- 
red in a standard (Flesch Reading Ease, RE = 51) and a difficult (RE = 20) 
sion, Half of the subjects switched after the first 10 paragraphs from the 
ndard to the difficult version or vice versa. The remainder saw all 19 para- 
ıphs in the same (standard or difficult) form. Inspection rates were found 
» depend not only on the readability of currently inspected text but also the 
adability of recently inspected text segments. The last 9 paragraphs in stan- 
ird form were inspected more slowly if the first 10 paragraphs were difficult 
han if the passage had been in the standard version throughout. Smaller con- 
ext effects on inspection rate were observed when subjects were switched from 


the variability of reading ease in text ma- 
terials. At present, the average readability 
index of a particular text material is the only 
characteristic typically considered in measur- 
ing the readability, However, Coke and Roth- 
kopf (1970) have observed a wide range of 
readability values in 200 samples drawn from 
a single chapter of a science text. 

The experimental exploration of “setlike” 
effects produced by text style is of general 
interest. For example, such studies may help 
to shed some light on discrepant results from 
studies of the relationship between readability 
and reading rate. Many of the early experi- 
ments in this area, which reported decreases 
in reading rate with lowered readability, must 
be disregarded because they used “words per 
minute” as a measure of rate. This measure 
is not suitable because average word length 
(in syllables or letters) is greater in the more 
difficult passages. Experiments that employed 
tate measures which were not affected by the 
dependency between readability and word 
length (e.g., syllables or letter spaces per unit 
time) have not produced consistent results, 
Three studies have found that inspection rate 
was relatively constant over a broad range of 
readability levels (Coke, 1974; Miller & Cole- 
man, 1971; Sticht, 1971), and one (Carver, 
1971-72) has reported mixed results. The 
exception was the result reported by Kershner 
(1964), who found that greater readability 
resulted in faster inspection speeds as mea- 
sured in the number of type spaces per unit 


of time. 
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nene 


(1974) used 32-90 passages per subject, 
‘Miller and Coleman (1971) used 36, Sticht 
1971) used 11, and Carver (1971-72) 8 
es of widely different readability levels 
subject. By contrast, Kershner's (1964) 
bjects read only 2 passages in quick suc- 
cession before being interrupted by a com- 
prehension test, and the total number of pas- 
‘sages per subject was limited to 4. 
The failure to observe influences of read- 
bility on inspection rate in experiments in 
ch the subject was asked to read a large 
er of passages could have been due to 
tent effects of readability of prior parts 
of the material read. If many short passages 
of widely different readability levels for each 
ject were used, any persistence in the 
fects of readability on inspection rate may 
h masked inspection effects produced by 
the individual passages; that is, the average 
dability of all recently read text elements 
may have contributed to current inspection 


The purpose of the present experiment was 
ð test the conjecture that inspection rate has 
extual determinants. The general method 
Was to measure whether the inspection rate of 
a given block of text was influenced by the 
reading ease of preceding text materials. 


METHOD 


experimental material was derived from a 
-word passage on the geography and history of 
nd (U.S. Department of Defense, 1965). The 
te consisted of 19 paragraphs. Two versions of 
passage, standard and difficult, were used. The 
dard passage was substantially identical to that 
previous experiment (Rothkopf & Coke, 
. Flesch Reading Ease, as estimated by com- 
rized os (see Coke & Rothkopf, 1970), 
on average, 51.03 per paragraph (SD = 
7). The difficult version was prepared by re- 

g the standard passages so as to use more 

cated sentence constructions and rarer word 


The x ent was run in two parts. In Experi- 
T e the subjects (n = 23) saw all 19 


* 
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paragraphs In the sta: m 0 
dard treatment). The subjects (e 25} 
saw the first 10 parse < difieet wis 
and the final 9 in the 'ndition (diio 
standard treatment), 1 nt 
subjects (m= 25) saw al aphs la the di 
cult version (difficult-dil e remainiag 
jects (n= 23) saw the T 
standard condition and 
version (standard-difficu 

After the entire text | pected, a cite 
pletion test (Cloze) was d consisting of 
the 19 paragraphs, each jon seen deing 
the experiment, with ninc 
nouns, adjectives, and s deleted tm 
each paragraph. 
Apparatus and Procedur 


A negative 35-millimet 
pared from each paragrap 
jector was used to project 
rear onto a 41.9 X 30.5 ce: 
screen (Polacoat LS7SPL 
approximately 60 centimeter 
subject. 

The onset and offset of ea 
by each subject with a hanc 
nected to the Carousel pr: 
mechanical shutter system. & 
electronic counter complete w 
recorder automatically record: 
was inspected. 

The subjects were directed | 
as rapidly as they could and 
elements of the content. The ; 
pletion test was administered in 
the room in which the text slides v 


Subjects 
Paid volunteer students (N= 92) from Fairlie 

Dickinson University in Madison, New Jersey were 

assigned in equal number to the four conditions 

in Experiments 1 and 2. 


RESULTS 
Experiment 1 


The results from Experiments 1 and 2 were 
analyzed separately because they were no! 
performed at the same time and because 7 
involved somewhat different populations 0 
subjects. > 

Inspection speed in syllables per minute 
was calculated separately for each para 
In Figure 1 the results of Experiment 1 are 
plotted for the standard-standard and difi- 
cult-standard treatments. etl 

The greater reading ease of the ma 3 
seen in the standard-standard treatment 


j 


i 
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Mean inspection rate in syMables per 
jects exposed to standard and to diffi. 
tent, 


dard and the dotted-line data to the 
lard treatment in the text.) 


f of the text resulted in inspection 
were approximately 75 syllables 
creater than those of the dificult- 
idition for paragraphs 1-10, 
rvations that were most critical 
text hypothesis were the rates at 
‘etaphs 11-19 were inspected. 
r.zraphs were the same for both 
s and at the standard readability 
sults were consistent with the 
pothesis in that the differences be- 
ents in inspection rates in para- 
ap © tended to persist on paragraphs 
“19. Aver axe inspection rate for paragraphs 
#7 was 2°2.4 syllables per minute for the 
Neult-standard and 358.9 syllables per min- 
for the standard-standard treatment. The 
ce between the two treatments tended 
intain itself throughout the second half 
he text. 
above conclusions were supported by 
of an analysis of variance of average 
on speeds for each subject on para- 
pas 11-13 and 17-19. The paragraph 
Sat the beginning and the end of the 
Section of the passage were used in 
alysis in order to test whether reliable 
effects persisted until the end of the 
‘treatment effects were significant (F = 
= 1/44, p < .05), but the effects due 


r a 


, oF alu, axo Susegqeeny last ben Rates 


at 
to the set (Le, paragraphs Il- ] 
versas 17-10) asd the — 1 k 


It should abo be noted that the 
inspection rate for paragraphs 11-19 ia 
Handard-standard condition was about 
syllables per minute greater than for 
graphs 1-10 even though the two blocks 
paragraphs were at the same readability 
for this treatment. This probably reflects 
slow acceleration in inspection time 
prolonged reading which has bren observed id 
previous experiments (eg, Rothkopf & Bis 
8 

slower inspection speeds on paragraphs 
11-19 in the difficult-standard treatments 
were accompanied by higher performance on 
those paragraphs on the completion test, 
Mean number of correct responses per pars- 
graph for paragraphs 11-19 on the comple. 
tion test were 4.08 for the difficult-standard 
treatment and 3.54 for the standard-standand 
condition. Mean correct responses for pars- i 
graphs 1-10 was 2.43 for the difficult-atan- 
dard and 2.93 for the standard-standard — 
condition. f 

A similar analysis of variance was carried 
out on the completion test data as on inspec- j 
tion speed, with the correct responses for 
paragraph sets 11-13 and 17-19 tallied sepa- 
rately. The analyses support the conclusion 
that subjects with the difficult-standard treat- 
ment learned more than the standard-stan- 
dard subjects. Treatment effects were dgnifi- 
cant (F = 6.74, dj = 1/44, p < 05). Neither 
the effect of paragraph sets nor the Paragraph 
Set X Treatment interaction was statistically 
reliable. 

Experiment 2 

The subjects who inspected the more read- 
able material in the first half of the text 
(standard-difficult), inspected paragraphs 11 
19 (difficult reading level) somewhat more 
rapidly (X = 291.5 syllables per minute) 
than those in the difficult-difficult condition 
(X = 269.9 syllables per minute). However, 
the difference between treatments, although 
consistent with the context hypothesis, was 
not statistically reliable (treatment F = 2.97, 
df = 1/44, .10 > p> .05). Average inspec- 
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tion rate for paragraphs 1-10 was 305.5 syl- 
lables per minute for the standard—difficult 
and 248.3 syllables per minute for the diffi- 
cult-difficult treatment. 

The data from the completion test indi- 
cated that the number of correct responses of 
paragraphs 11-19 were almost identical for 
the standard—difficult (X = 2.13) and the dif- 
ficult-difficult treatments (X = 2.17). Mean 
correct responses for paragraphs 1-10 was 
2.93 for the standard-difficult and 2.28 for 
the difficult-difficult treatments. 


DISCUSSION 


The results from Experiment 1 provide sub- 
stantial indications that the rate at which sub- 
jects inspect text depends not only on the 
reading ease of the materials with which they 
are currently confronted but also the reading 
ease of recently inspected text segments. The 
results from Experiment 2 suggest that the 
context effect is probably stronger when sub- 
jects follow difficult reading with easier ma- 
terials than when the switch is from medium 
to difficult readability. 

The slower inspection rates induced by 
previous exposure to difficult materials were 
accompanied by relatively small but signifi- 
cant increases in learning. However, this effect 
cannot be unequivocally attributed to modifi- 
cation of inspection activity by context, since 
the possibility cannot be ruled out that the 
difficult-standard treatment subjects’ per- 
formance on the test was influenced by learn- 
ing level attained on the first 10 paragraphs 
when readability was different for the com- 
pared treatments. 

The present experiment indicates fairly 
clearly that investigations of the relationship 
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between readability a 
be interpreted with c: 
employ short passa: 
readability levels. 


ection speed must 
the experiments 
widely different 
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some quit: 
answered. 
Whether lo 
abstrusenes: 
Case any 
formatting 


ındamental questions remain un- 
ne very basic question asks 
-al-tree superiority is due to the 
of the prose employed (in which 
mber of alternate methods for 
he information would do as well). 

The present experiment was designed to 
shed some light on such questions by compar- 
ing immediate and retention performance re- 
sulting from the use of logical trees to that re- 
Sulting from using a version of a prose format, 
lists of short sentences. In a third condition, 
ân effort was made to improve upon previous 
designs of logical trees by supplying short 
names (“codes”) for the task rules, which 
| Subjects associated with the rules. 

After all the data were collected, another 
experiment (Wright & Reid, 1973) was dis- 
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coding resulted in significantly ( < 01) more accuracy in one case, ‘ 


5 FOR PRESENTING PROCEDURAL INSTRUCTIONS 


covered which essentially duplicated the short- 
sentences and logical-tree conditions and, to 


a lesser extent, the retention conditions of the 
present experiment. Thus, the following ex- 
periment offers a check on the generality of 
findings for certain aspects of this previous 
investigation and also adds new factors in 
coding and variation in retention-related con- 


METHOD 


owns! of Ap? ol 
1974, Vel. 35. N 635680 
FOR) 
ARTHUR S. BLAIWES * 
Naval Training Equipment Center, Orlando, Florida 
y male college students performed a mock communication controller's 
under three different instructional formats (short sentences, logical tree, 
coding). Accuracy did not differ as a function of instructional format for 
“easier” types of problems. However, for two “more difficult” problem 
le coding and short sentences showed significantly (p< 01 and p< 05, 
pectively) greater accuracy in another. It was concluded that (a) format 
iations mainly influence difficult tasks; (b) logical-tree superiority appears 
be related to the number of conditions at a choice point and/or retention 
ditions; and (c) coding can be a potent technique. 
The pre experiment was concerned with 
improving quality of printed instruction 
by influe global characteristics of the 
presentati ormat. For example, instruc- 
tions can presented as continuous prose, 
an outline ‘ow diagrams, tables, etc. Rele- 
vant rese: has been restricted to compari- 
sons mai between only two presentation 
ſormats ‘cal trees (sometimes called algo- ditions. 
rithms, fic liagrams, or decision trees) and 
prose (De` , 1970; Jones, 1968). Although 
logical tre. ypically prove superior to prose, Syb jects 


Thirty male undergraduate students from Florida 
Technological University were randomly assigned, 10 
to a group, to the three experimental conditions 
(short sentences, logical trees, and coding). 


Materials 


The apparatus consisted of a synthetic communica- 
tions control console utilized by the subjects, and 
another control console with which the experimenter 
manipulated problems appearing on the subject’s 
console. For each problem, the subject’s task was to 
connect two hypothetical subscribers for communi- 
cation purposes. This was accomplished by: (a) 
determining which of five modes of operation to use 
and (b) performing a routine (consisting of setting a 
series of buttons and dials) on the console appro- 
priate to both the mode selected and particulars of 
the problem. Selection of modes was based on the 
experimentally manipulated instructions, on informa- 
tion contained in the problems, and on a wall chart. 
(See Bernstein and Gonzalez, 1971, for a detailed 
description of the experimental equipment and the 
subject’s and experimenter’s tasks.) 

Tapes were played for each subject describing the 
characteristics of his console and its various modes 
of operation. The tapes differed among experimental 
groups in their reiteration of sets of printed instruc- 
tions provided to the subjects. Printed instructions 
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to the coding group are exemplified in the 


YES ———> S-S 


RECEIVER'S 
FREQUENCY? 
Is there an open 
_ | channel in a re- 
| celver’s frequency 
or fi ? 


YES————> S-T-S 


(ete.) 


short-sentences group received identical instruc- 
tions in the following format; “Use S-S if there is: 
1. A common language, 2. A commen frequency, and 
3. An open channel in a common frequency. Use 
_ S-T-S if there is: 1. No common language, and/or 
{etc.).” Logical-tree subjects received the same 
printed instructions given to coding subjects with the 
omission of the one- or two-word code preceding 
_ each question. 

Two sets of thirty problems each were used equally 
for first and second test sessions. The order of sets 
of problems between test sessions and problems 
K within sets was systematically varied among subjects. 
Among the six groups of five problems, problems 
were equated for difficulty and type both within each 
set and also between sets. 


Procedure 


_ For motivation purposes, the subjects were in- 
formed by the experimenter that they could earn 
_ from $2 to $4 per hour, depending on how well they 
performed, and were not informed of their perform- 
ance level until their role in the experiment was com- 
pleted. Subjects were treated individually in three 
_ distinct sections of the experiment. 
During Section 1, each subject worked with only 
one of the presentation formats. Taped instructions 
Were presented, and five minutes were allowed for 
_ Study on the printed instructions. Then each subject 
Vas tested for five minutes at recalling his printed 
instructions and was corrected by the experimenter 
hen in error. Next, following directions from the 
k Pe 85 ae solutions on his console 
0 sample of each of the five problem $ 
In Section 2, all subjects were EA one 
t of 30 problems, Subjects were told they would 


be evaluated in terms 


eed sad axurxy 
and that they could refe 


rinted instrections 


after each problem was j but not after they 
started implementing t} n on the cosi 
The experimenter inic bjects when they 
were correct and told the he nature of thsi 
errors when incorrect. 

In Section 3, all subje i one week after 
the initial session for ano of 30 pre 


These problems were pr 
as those of the week befor 


he same miss 


RESULTS AN SION 


No overall effects d tructional for 
mat were found in se! ilyses of vari- 
ance on latency (time ate solutions 
to problems), accuracy tage of errors 
made in performing so! \roblems), of 
referrals (percentage ns on w 
subjects referred to nstructions). 
However, a significant n ($< 0$) 
was obtained with all ms on accu. 
racy between instructior at and prob- 
lem type (see Table 1). zal Newman- 
Keuls tests showed that t ng group was 
significantly more accur: < 01) thas 
both logical-tree and shor nces groups 
on Deny problems. Furth: iracy scores 
of coding and short-senten: ups were sig- 
nificantly greater (p < .0} $ < 05, re 
spectively) than those oi logical-tree 
group on S-S interrupt problei Accuracy on 
the other three problem ty; id not differ 
significantly as a function <f instruc 
format. These findings are important because 


Deny and S-S interrupt problems were more 
difficult to perform than the other three prob- 
lem types (see Table 1). : 

Lack of accuracy differences on re prob- 
lems may have been due to a ceiling “i 
(ie., the scores were incapable of measuring 
higher levels of ability). Alternately, maxi 
accuracy ability on the simple problems a 
not have required the more efficient formal 
Absence of overall significance among accu 
racy scores (in view of the large accu 
differences across all problems) ran 
high variability within format groups px 5 
dard deviations on accuracy were 13%, 1 1 
and 21% for coding, short sentences, an 
cal trees, respectively). 

Analyses of error types indicated that a 
curacy advantages among formats W 


. 
Fonuas Fox PRESENTING 


Gon of differences in skill at 
correct procedures once the 
as determined, Presumably, the 
ormats served to unburden the 
n pressures of mode selection, 
time and thought for other 
problem. A major error source 
o the greater difficulty of Deny 
a tendency to mistakenly use 
terrupt” modes instead of Deny. 


mainly 3 
be 
proper n 
more effi 
perform: 
allowing 
aipects 
contrib: 
problen 


one of U 


Prese ta support the Wright and Reid 
finding performance effects associated 
with pr: tion formats were manifested to 
the gre extent when the problems were 
dificult vever, particulars from the two 
experim differ under similar conditions; 
with di problems and written materials 
availab! subjects, Wright’s logical-tree 
group ` more accurate than his short-sen- 
tences ; ip, This discrepancy might be ex- 
plained de likelihood that present subjects 
Operate m memory to a greater extent 
(see lo erral rates in Table 1) than sub- 
jects in ion I of the Wright study (where 
the logi ee superiority was found), Where 
Wright :bjects were required to perform 
without written instructions (Sections II 
and II] de superior logical- tree accuracy 
disappe Thus, when memory is required, 
it appen nat logical trees are no better than 
Short se: onces, and in fact, short sentences 
may be -iter than logical trees. For logical 
trees to superior to short sentences under 
memory- nanding conditions, coding may be 
Tequired. 


Another plausible explanation for the noted 
discrepancy in accuracy effects between the 
two experiments is that benefits typically de- 
tived from logical trees were lost in the 
Present case due to confusion from the com- 
bining of conditions into compound sentences 
at single-decision points. 

More rigorous training for retention in the 
Present study may account for the finding 
that although accuracy and latency improved 
zor some formats and deteriorated for others 
in the Wright study, these scores (along with 
referrals) consistently improved to the same 
extent with all formats in the present experi- 
ment, Also, if memory did fade, it could be 
F eshed under present conditions by referral 

the instructions. 


< V a 


Ineteverions 


TABLE t 


Mean SU, roa La! 
e 


Fozwar ax» Damme Tyre 
— —ͤ— = — 
nnr 
. hee 


Latency (in seconds) 


Short sentences | 70> Jọ | w| w 
Logical tree w | T | 2 
Coding 1 % 

Reterrah * 7 
Logical tree AS l o |23 |18 
Coding “|o |» |2 

Accuracy (percentage of errors) 

Short sentences a | > ip | 1> 
Logical tree * A 
Coding w| „ ie) oo 


Superior accuracy in the coding conditia 
may have been due to several factors: (a) 
The codes may have served as aids to retrieval 
and/or reconstruction of a more coberent 
version of the rules. (b) The codes may have 
acquired the meaning of the rules resulting in 
faster and more accurate processing of rules. 
(c) Shorter times required to operate with 
the codes than with the rules, per se, may 
have permitted more practice in Section 1. 

It is not clear from the present dati 
whether this particular logical tree is a nece: 
sary condition to coding for the beneficial 
effects obtained from coding. That such short- 
hand languages probably do exert some in- 
fluence on performance in a variety of con- 
texts, however, is suggested by the fact that 
they develop naturally, over time, even where 
no systematic effort is made to do so. Furthe 
coding is related to a notion concerning 
general nature of retention. Theory sug 2 
that adult human memory may be derived 


ntage to systema assuring 
coding associated with a task is suitable and 
it enters appropriately (in training and 
haps in a logical-tree format) into per- 
formance. Before such a recommendation can 
be made with confidence, however, further 
research is needed to unscramble possible in- 
ons among coding, retention require- 
s, and other characteristics of the pre- 
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iree years have produced a major 
erest in the psychological litera- 
understanding and solving the 
the hard to employ (e.g, Feld- 
1973b; Friedlander & Greenberg, 
kin & Sherwood, 1974; Goodman, 
„ Salipante, 1973; Rosen & 
1; Triandis & Malpass, 1971). 
olf this research effort appears to 
ning for the hard to employ is in- 
Doeringer, 1969; Johnson, 1969; 
lonsky, & Rosen, 1970). Absen- 
ness, and turnover rates are un- 
ı among the hard to employ (Fer- 
Purcell & Cavanaugh, 1972; Ross 
& Whee). 1971; Ruthledge & Gass, 1967) 
and do n appear to improve with training. 
In the resent article we argue that one of 
the major {actors that is responsible for the 
lack of success of training programs is the 
neglect of cultural variables. Training which 
lakes the cultural variables into account can 
be provided via the culture assimilator (Fied- 
ler, Mitchell, & Triandis, 1971). A theoretical 
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'ESIGNING PREEMPLOYMENT TRAINING FOR 
THE HARD TO EMPLOY: 
A CROSS-CULTURAL PSYCHOLOGICAL APPROACH * 


\RRY C. TRIANDIS, JACK M. FELDMAN,* axo DAVID E. WELDON * 
University of Ilinois, Urbana-Champaign 
WILLIAM M. HARVEY 
St, Louis Narcotics Service Council 
Most reports concerning the effects of training of the bard to employ suggest 
iat the training is ineffective. One major factor responsible for the lack of 
iccess may be the neglect of cultural variables. A theoretical framework is 


resented which helps in the design of training programs that take cultural 
ariables into account, Studies employing this framework are reviewed. 
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framework for the design of such training i 
presented. 
PRELIMINARY CONSIDERATIONS 


The behavior of “leaving the field” which 
is analogous to turnover, can be analyzed 
a response that reduces noxious stimuli, An 
yses of “culture shock” (Gullahorn & : 
horn, 1963) suggest that a person who finds 
himself in a different culture sometimes ex- 
periences a severe disorientation and depres 
sion. His habitual responses do not result 
reinforcements, but instead appear to be 
related to outcomes. Without some p 
tion the experience can be so punishing that 
escape from the field is the only rewarding — 


A significant factor in reducing such dis- 
orientation is the development of institutions 
and personnel in the host environment who 
can recognize the visitor’s needs and can help 
him adjust (Fromkin & Sherwood, 1974; 
Triandis, 1974b). Triandis and Malpass 
(1971) reviewed several factors which suggest 
that it is more important to train the super- 
visor of the hard to employ than it is to train 


visors and workers. 

The goal of intercultural training is 
make the behavior of each participant in an 
interpersonal relationship more satisfying io 
the other member. Thus if the foreman and 
the worker belong to different cultures, good 
training should make the worker’s behavior 
more reinforcing to the foreman and the fi a 


. 
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man's behavior more reinforcing to the 
worker. If this is accomplished the worker will 
be less likely to leave the field. 


ANALYSIS oF CULTURAL VARIABLES 


The classic anthropological approach is 
participant observation, One lives in the com- 


" munity he is to study and interviews various 


“informants” about the way things happen. 
An ethnography is a report of the way people 
use language, their customs, privileges and 
duties in relation to various types of people in 


their community, forms of social and political 


organization, economic activity, belief sys- 
tems, religious rituals, values, and so on. In 
the case of the black ghetto good material re- 
flecting such topics can be found in Clark 
(1965), Gladwin (1967), Hannerz (1969), 
Keil (1966), Liebow (1967) and Suttles 
(1968). The most perceptive presentation and 


summary is probably Valentine's (1968). 


The black ghetto is different from other 


black and white environments and results in 


R 


a 
ar - 
* 
N 


ne 


perceptions of the social environment that 
differ substantially from those typically found 
among the middle class. As an example con- 
sider some differences in language: 


Black Ghetto 
English* Standard English 

ace = good friend 

bad stylish, very good, 
or boss 

down agreeable, best ever 

F. B. I. fat, black, and 
ignorant 

fuzz police 

gig job 


In addition, there is evidence of cultural 
differences in attitudes toward nonstandard 
family patterns (Rodman, Nichols, & Voyda- 
noff, 1969), greater latitude for what is con- 
sidered acceptable behavior (Rodman, 1963) 
and weaker perceived connections between 
behavior and outcomes (Triandis, Feldman, 


Te Harvey, 1970, 1971a, 1971b, 1971c; 


i ae 
Personal communication from Dr. Daniel Landis 
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Triandis, Weldon, & 
ghetto than in the 
white. The ghetto is 

lar perceptions of soci 
tem distrust (Triandi 
Harvey, in press). E 
label that subsumes su 
interpersonal trust, reje 
one’s environment, 
figures, and the perce 
connection among event 
vironment. 

It seems obvious that i 
not taken into account 
ing programs, it is very ! 
ing will prove unsuccess! 
personal relations betwe 
in job settings, we need a whites (a) 
to recognize the substan -ences which 
exist among different k lacks (eg., 
ghetto versus middle cla ) to under- 
stand the black point of Similarly, we 
must train the black ghett: ers to under- 
stand the middle-class p< view, black 
or white. 

Differences between blac! 
the poor and the rich, have 
a long time; the literatu 
Race differences have b 


, 1972) in the 
class, black or 
zed by particu- 
yns and ecosys- 
an, Weldon, & 
1 distrust is a 
»mena as lower 
institutions in 
of authority 
little or no 
ring in the en- 


lifferences are 
esigning train- 
hat the train- 
improve inter- 

; and whites 


i whites, and 
liscussed for 
voluminous. 
discussed in 


Baughman (1971), Ehrlich (1973), Fanon 
(1967), Frazier (1932, 1 , Glazer and 
Moynihan (1963), Gol id (1970), 


ester (1968), 
yiyrdal (1944), 
and Wilcox 
cial class dif- 


Hauser (1971), Jones (1972 
Miller and Dreger (1973), 
Pettigrew (1964), Rose (19 
(1971), among many others 
ferences have been discussed extensively bY 
Lewis (1959, 1960, 1961, 1966) and Miller 
(1958). Difficulties associated with the em- 
ployment of blacks have been discussed by 
Hearns (1968), Kirkpatrick (1973), Pinte 
and Buchmeier (1973), and Purcell an! 
Cavanaugh (1972), among others. Obvious 
there are practical difficulties in imparting 
this much information during training. 5 
need a framework which selects those aspects 
of the information that are most impor 
for the improvement of interpersonal rela ia 
between a black ghetto worker and his sa 
visors, black or white. This framework is P" 

sented below. 
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ROSS-CULTURAL APPROACH TO TRAINING THE Harp To Eurtor 


THEORET FRAMEWORK FOR THE DESIGN 
OF 3S-CULTURAL TRAINING 

Trian )74a) has analyzed the prob- 
Jem of i rsonal behavior across cultures 
as one ch people make inappropriate 
attributi Kelley, 1967). A relatively well 
establis} enomenon discussed by attribu- 
tion the: (Jones & Nisbett, 1971) is that 
the act: his own behavior as controlled 
mostly tuational factors, while the ob- 
server s the actor’s behavior controlled 
mostly versonality traits or internal pre- 
dispositi Kelley puts it slightly differently, 
pointing that the observer does not give 
sufficien eight to situational variables. 
Triandi; 4a) argues that one of the ways 
to impro interpersonal relations is to make 
the obse er more similar to the actor in his 
attribut about the causes of the actor’s 
behavio> un short, to increase isomorphic 
attribut The actor is usually a more ac- 
curate a vst of his own behavior (except in 
the cas- abnormal behavior). Thus, by 
using th riterion of isomorphic attributions, 
we can rove the effectiveness of the ob- 
servers alysis of the actor’s behavior. 
Further e, when the observer makes iso- 
morphic «‘iributions, he increases his general 
Similarity to the actor, which should eventu- 
ally lead o greater interpersonal attraction 
(Byrne, 1°59). 

For example, if a supervisor and a sub- 


ordinate scree that a job is to be finished by 
a certain date, but the supervisor perceives 
the antecedents of this event as “employee 
Participation,” while the subordinate per- 
ceives it as “the boss’s orders, when the 
employee later discovers that he is unable to 
complete the job on time, the supervisor may 
Conclude that the employee is “unable to set 
realistic time standards” while the employee 
May conclude that “the boss gave the wrong 
Orders,” leading to a cycle of misunderstand- 
ings. Isomorphic attributions would involve 
either both agreeing that it is an order or both 
Understanding the principles of employee par- 
ticipation. 

Triandis’s (1974a) analysis of interpersonal 
Conflicts across cultures suggests that noniso- 
Morphic attributions are a major, though a 
Subtle, problem. Tt is particularly serious in 
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cross-cultural encounters because people have 
different “subjective cultures.” 

Culture is here defined as the man-made 
part of the human environment (Herskovits, 
1955). Subjective culture is the subjective 
reaction of members of a cultural group to 
the man-made part of the environment, A 
cultural group is a set of individuals who 
speak a mutually understandable dialect” 
(Naroll, 1968). Subjective culture can be 
studied through the development of culture- 
specific (emic) measures which have culture- 
general (etic) properties, as described in 
Triandis, Vassiliou, Vassiliou, Tanaka, and 
Shanmugam, (1972). Such an approach per- 
mits each cultural group to express itself in 
a manner that is most appropriate for it, but 
also permits comparisons of the responses of 
different cultural groups. 

Social scientists have used about 80 con- 
cepts that can be defined as intervening vari- 
ables, mediating between consistencies in 
social situations on the one hand and con- 
sistencies in responses on the other (Camp- 
bell, 1963). Among the most frequently used 
of these concepts, listed alphabetically, are 
attitudes, beliefs, concepts, norms, roles, and 
values. Triandis, Vassiliou, Vassiliou, Tanaka, 
and Shanmugam, (1972) argue that some of 
these concepts are required for the prediction 
of behavior, but a much smaller set of con- 
cepts than the set currently in use is essential. 
They present a theoretical framework which 
uses (a) behavioral intentions, measured with 
the behavioral differential (Triandis, 1964); 
(b) norms; (c) roles; (d) general behavioral 
intentions, measured with norm and role dif- 
ferentials (Triandis, Vassiliou, & Nassiakou, 
1968) and generalized behavioral differen- 
tials; (e) the affect toward the behavior, 
measured by semantic differentials (Osgood, 
Suci, & Tannenbaum, 1957) in which the be- 
havior is used as a concept and it is rated on 
several scales of the evaluative factor of the 
semantic differential; (f) the perceived con- 
sequences of the behavior, measured on prob- 
ability scales with a semantic differential 
format; and (g) the value of the conse- 
quences, measured by semantic differentials 
in which the consequence is the concept rated 


on evaluative scales. 
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Specifically, but in summary, Triandis ar- 
“gues that the probability of a particular be- 
havior (P,) is a function of previously estab- 


lished habits (H), the intention to behave 
(I, and facilitating conditions (F). 


$ P, = @H + BDF. 


Habits reflect the subject’s previous rein- 
_ forcement history; facilitating conditions in- 
clude motivation, ability, and the difficulty of 
carrying out the act. Intentions are a function 
of social factors (S), the affect toward the 
behavior (A), and the cognitions (C) asso- 
_ ciated with the behavior. 


I= yS+6A+C, 


The social factors reflect existing norms 
(N), roles (R), the self-concept (I am the 
_ kind of person who behaves this way), and 
interpersonal agreements or contracts; the 
affect toward the behavior reflects classical 
conditioning involving the thought of the be- 
havior, resulting from pleasant or unpleasant 


experiences associated with the particular be- 
havior; the cognition is a function of the per- 
_ ceived consequences and the value of these 


consequences. 


C= LPV. 


c=1 


The coefficients a, £, y, 8, and e reflect 
interactions among characteristics of the per- 
Son, situation, and behavior. For example, 
_ high F-scale subjects, in cultures that clearly 
Specify that a person must pray in a particu- 
ir way at a particular point in time each day, 
typically do so. This indicates a very high y 


PA 


coefficient for praying when the cues suggest 
itis the proper time. High coefficients for one 


ariable imply low coefficients for the others, 
ecause 


* ＋ g 1 andy & e =I. 


When a person from one culture observes 
the behavior of a person from another culture, 
he is likely to make wrong attributions be- 

use of differences in (a) the weights of a 
Particular determinant of behavior and (b) 

the values of H, I, F, S, A, or C. 

The analysis of subjective culture (Tri- 
dis, Vassiliou, Vassiliou, Tanaka, & Shan- 
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mugam, 1972) requ 
relevant to each oí 
example, one studies t 
behavior in role sity 
1968). Triandis et al 
1971c) covered mo 
ghetto blacks and 
Feldman (1972) cros 
with another sample 
From such data one 
attributions made by 
are likely to be most 
perceive the behavior 
culture. 

One kind of trainin 
toward changing these 


quisition of data 
ints. Thus, for 
ptions of proper 
Triandis et al, 
1971a, 1971b, 
se points for 
Weldon, and 

d the findings 
ud to employ, 
ver where the 
of one culture 
te, when they 
ers of another 


i be directed 
ions. Specifi- 


cally, let us assume tha Sof Culture 
A see the behavior of a of Culture B 
and attribute it to his tion,” while 
members of Culture B at the very 
same behavior attribute iis perceived 
role. It seems plausible th bers of Cul- 
ture B, who after all kn e about the 


bers of their 
rate in their 
yerson’s be- 
from Culture 
role percep- 
bility to work 
is the basic 
recent «culture 


causes of the behavior oi 
own culture, would be mo: 
analysis of the causes of 
havior. Thus, if we train pc 
A to see the behavior as d 
tions, we might improve the 
with the particular person. 
principle behind the most 
assimilators.”” 

Earlier assimilators seemed ‘o have been 
somewhat effective (Fiedler et al., 1971; 
Mitchell, Dossett, Fiedler, & Triandis, 1972) 
but they were not based on a theoreti 
analysis. The new assimilators should prove 
more effective. 

The culture assimilator is a programmed 
learning sequence in which incidents of inter 
cultural conflict are described and the trainee 
is asked to judge the relevance of various” 
factors as causes of the incident. Thus, 4 0 
or two paragraph story is presented, 1 
followed by a multiple-choice format witht 
four answers. When training members of Cul- 
ture A, one of the answers is an attribution 
that members of Culture B make in ana 
ing the behavior of members of Culture 2: 
The other three are attributions that 1 
typically made by members of Culture A K : 
explain the behavior in the particular inch 
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dent. A c trainee selects one of the four 
answer: given feedback. If he selects 
the rig! wer he is told why it is the right 
answer y by reference to norms, values, 
etc. of her group. If he gives the wrong 
answer told to read the incident again 
and sel. other answer. 

In t g white or black middle-class 
manage might use an incident reflecting 
ecosyst strust. In the hypothetical inci- 
dent a ian tells his black ghetto worker 
that if worker behaved in a particular 
way he l receive a 10-cent-an-hour raise. 
Some passes during which the worker 
appear esponsive. 

Exp ions of the unresponsiveness of the 
worker ht include: “He is lazy,” “He is 
dumb, e is not interested in more money,” 
and “F does not trust his foreman.” Given 
the inf. ation that the worker is from the 
black ; o and what we have found about 
ecosys! distrust, the last attribution is 
more | ible than the others. Training the 
supery o make such attributions is likely 
to suge to him other ways of handling the 
situatii For example, he might make a 
“promi that requires the worker to do 
somethi: that is very easy for him to do and 
prompt!, fulfill his promise. Repeated ex- 

_ Posure ci -he worker to the connection “super- 
visor is ustworthy” may make other prom- 
ises, wh: are less easy to fulfill, more be- 
lievable 

The re‘siionship between the black worker 


and his job environment may involve literally 
thousands of “interactions.” Each interaction 
can be misunderstood; distrust can give it 
a different meaning. The negative meaning of 
Such interactions produces a total impact of 
5 “very negative environment,” hence the 
leaving the field? behavior. The hypothesis 
Suggested by this analysis is that training the 
Supervisor (and the fellow workers) may re- 
Sult in enough changes in the valence of these 
Mteractions to reduce the noxiousness of the 
environment for the black ghetto worker. 


Discussion 


The approach advocated here should not be 
considered a panacea. In fact, it is too early 
to know how well it is going to work. So far 
We have established only that trained indi- 


Viduals change the kinds of attributions they 
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make about the behavior of others from other 
cultures, and this allows them to be more 
“tolerant” in judging such behavior (Weldon, 
Carlston, Rissman, Slobodin, & Tri d 
1974). But we are still at the level of aller- 
tive responses, not at the level of changes in 
behavior. One could argue that changing the 
way one feels about a culture is a sine qua 
non, if other desirable changes are to occur, 
But this remains to be established. 

It is unlikely that the assimilator will work 
in situations in which the person is not rein- 
forced by his membership groups for . 
propriate” behavior toward members of other 
cultures, Thus, if the supervisor of a foreman 
that is trained with the assimilator rewards 
the foreman who retains more hard to em- 
ploy workers, and punishes, or at least shows 
his displeasure toward, the foreman who has 
poor retention rates, this “environment” is 
likely to me ke the assimilator useful. An en- 
vironment in which management is not pro- 
viding differential reinforcements will not 
allow the assimilator to do his part in cross- 
cultural training. 

When comparing the assimilator with other 
methods of training it should be remembered 
that it is a highly concentrated form of “cog- — 
nitive training.” Basically, all it does is im- 
part information. One could accomplish the 
same aim with a book. However, many — 
trainees find programmed learning materials 
of a “problem-solving” kind, such as the 
assimilator, more enjoyable than a book. a 

It is difficult to compare the assimilator 
with T groups and encounter groups because 
there are so many different kinds of such 
groups (Back, 1974). However the one point ; 
that is clear is that in such groups a person 
might make cross-cultural errors, in judgment 
or behavior, which may prove embarrassing. 
The assimilator is “safe” from embarrassment. — 
Tt seems reasonable to assume that assimilator 
training should come before other kinds of 
training to protect the trainee from embar- 
rassment. Comparisons are also difficult be- 
cause assimilator training takes only 610 
hours, while most T groups take much more 


time. 

Role playing the other group’s roles may 
be helpful, but role playing assumes the 
trainee knows how to take the role of the 
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other. If he has no information, his role taking 
may be silly. Again, the culture assimilator 

i may be best before such playing. 

In addition to cognitive training there are 
forms of training which might be labeled 
“affective” (e.g., having pleasant experiences 

im the presence of members of the other cul- 

ture) and “behavioral” (e.g., learning to emit 
behaviors which members of the other culture 
find rewarding). There is no contradiction 
among these types of training, and some com- 

bination of them should prove superior to a 

single method. It is likely that a behavior 

modification approach added to assimilator 
training will prove particularly effective. Here 
the psychologist identifies behaviors that each 
cultural group finds rewarding and arranges 
= “contracts” for each to reward the other and 
for the mutual reinforcement of a supervisor— 

-= subordinate pair. 

The cultural information obtained to de- 
velop the assimilators can be used also in the 
development of such a behavior modification 
program. Thus, this information is funda- 
mental to any attempts at integration of the 
anthropological and psychological approaches 
to training. 


REFERENCES 


Back, K. W. Intervention techniques: Small groups. 
È Annual Review of Psychology, 1974, 25, 367-387. 

Baughman, E. E. Black Americans: A psychological 
analysis. New York: Academic Press, 1971. 

Byrne, D. Attitudes and attraction. In L. Berkowitz 
(Ed.), Advances in experimental psychology. Vol. 
4. New York: Academic Press, 1969. 

Campbell, D. T. Social attitudes and other acquired 
behavioral dispositions. In S. Koch (Ed.), Psy- 
chology: A study of a science. New York: Mc- 
Graw-Hill, 1963. 

Clark, K. B. Dark ghetto: Dilemmas of social power. 
New York: Harper, 1965. 

Doeringer, P. B. Programs to employ the disad- 
ees Englewood Cliffs, N.J.: Prentice-Hall, 
Ehrlich, H. J. The social judi 
New York: Wiley, on a ee 
Se 10 skin, white masks. New York: Grove 
nn J. M. Race, employment, and the evalua- 

58 a of Applied Psychology, 1973, 

aa J. M. Race, economic class, and perceived 
A ae of work and unemployment, Journal of 

. bblied Psychology, 1973, 58, 16-22. (b) 
Ferman, L. A. The Negro and equal empl 
opportunities. New York: Praeger, 1968 Peet 


5 
à 
f 
l 
3 
5 
3 


TRIANDIS, FELDMAN, WELDON, AND Harvey 


Fiedler, F. E., Mitchell 
culture assimilator: A 
training. Journal of A 
95-102. 

Frazier, E. F. The Negre 
University of Chi 

Frazier, E. F. The Negro j he United States 
(Rev. and abridged ex o: University of 
Chicago Press, 1966. 

Friedlander, F., & Green 
tudes, training, and or: climate on per- 
formance of the hard-c« loyed. Journal of 
Applied Psychology, 1971 295. 

Fromkin, H. L., & She Integrating the 
organization. Glencoe, Ill ss, 1974. 

Gladwin, T. Poverty, U.S.A 1: Little, Brown, 


riandis, H. C. The 
h to cross-cultural 
chology, 1971, 55, 


Chicago, Chicago: 


go F 


ect of job atti- 


1967. 

Glazer, N., & Moynihan, D ond the melting 
pot: The Negroes, Puer! Jews, Italians, 
and Irish of New York abridge; MIT. 


Press and Harvard Unive , 1963. 

Goldschmid, M. L. Blac ins and white 
racism: Theory and rese v York: Holt, 
Rinehart & Winston, 1970. 

Goodman, P. S., Paransky, H 
training, and retaining the 
A selected review. Journal 
1973, 58, 23-33. 

Gullahorn, J. T., & Gullahorn 
the U-curve hypothesis. Jo: 
1963, 19, 3, 33-47. 

Hannerz, U. Soulside: Inquiri 
and community. New York 
Press, 1969. 

Hauser; S. T. Black and whi 
New York: Wiley, 1971. = 

Hearns, J. P. New approaches eet post-hiring 
difficulties of disadvantaged workers. In G. C 

Summers (Ed.), Industrial Re 5 Research A5- 
sociation: Proceedings of the 25st annual winter 
meetings, 1968, 207-216. 

Herskovits, M. J. Cultural anthropology. New York: 
Knopf, 1955. 

Johnson, L. A. Employing the he 
AMA research study 98. Nev 
Management Association, 1969 

Jones, E. E., & Nisbett, R. E. The actor and the 
observer: Divergent perceptions of the causes O 
behavior. New Vork: General Learning Press, 197% 

Jones, R. L. Black psychology. New York: Harper 
Row, 1972. 8 

Keil, C. Urban blues. Chicago: University of Chicago 
Press, 1966. 

Kelley, H. H. Attribution theory in social psychology 
In D. Levine (Ed.), Nebraska Symposium 15 
Motivation: 1967. Lincoln: University of Nebras 
Press, 1967. > 

Kirkpatrick, J. J. Occupational aspirations, oppor 
tunities and barriers, In K. S. Miller &. R H 
Dreger (Eds.), Comparative studies of blacks a 
whites in the United States. New York: Semin 
Press, 1973. 1 25 

Lester, J. Look out, whitey! Black power’s gon 8 
your mama! New York: Grove Press, 1968. 


ante, P. Hiring, 
e unemployed: 
ed Psychology, 


An extension of 
Social Issues, 


ghetto culture 
bia University 


sity formation. 


core unemployed. 
Vork: American 


Lewis, O. Fit ies: Mexican case Studies in the 
culture of } New York: Basic Books, 1959. 

Lewis, O. Tep Village in Mexico. New York: 
Holt, 1960 


is, O. The en of Sanchez. New York: Ran- 


dom House 


is, O. Th ire of poverty. Scientific Ameri- 
can, 1966, 2 19-25. 

icbow, E. 7 orner. A study of Negro street- 
corner men n: Little, Brown, 1967. 
filler, K. S czer, R. M. Comparative studies 
of blacks hites in the United States. New 
York: Sen ress, 1973. 

Miller, W. B ver class culture as a generating 
milieu of delinquency. Journal of Social 
Issues, 195 5-19. 

Mitchell, T., tt, D. L., Fiedler, F. E., & Triandis, 
H. C. Cult aining: Validation evidence for the 
culture as or. International Journal of Psy- 
chology, 1 97-104. 
organ, B slonsky, M. R., & Rosen, H. Em- 
ployee atti toward a hardcore hiring program. 
Journal o died Psychology, 1970, 54, 473-478. 

Myrdal, G American dilemma, New York: 
Harper, 1 
Naroll, R. < thoughts on comparative method 
in anthror In H. M. Blalock & A. B. Blalock 
(Eds.), M ology in social research. New Vork: 
McGraw-! 968. 

Osgood, C. F ci, G., & Tannenbaum, P. The mea- 

- surement weaning. Urbana, III.: University of 
Illinois Pre- 1957. 

Pettigrew, T A profile of the Negro American. 
Princeton: . Nostrand, 1964. 

Pinto, P. R., „ Suchmeier, J. O. Problems and issues 
in the empio nent of minority disadvantaged and 
female grouß An annotated bibliography. (Bulle- 
tin 59). Minneapolis: Industrial Relations Center, 


University of Minnesota, 1973. 
Purcell, T. V, „ Cavanaugh, G. F. Blacks in the 
industrial world: Issues for the manager. New 
York: Free Fress, 1972. 
j Rodman, H. The lower class value stretch. Social 
Forces, 1963, 42, 205-215. 
odman, H., Nichols, F. R., & Voydanoff, P. Lower- 
class attitudes toward “deviant” family patterns: 
A cross-cultural study. Journal oj Marriage and 
the Family, 1969, 31, 315-321. 
Rose, A. The Negro in America. Boston: Beacon 
Press, 1948. 
Rosen, H., & Turner, J. Effectiveness of two orienta- 
tion approaches in hard-core unemployed turnover 
and absenteeism. Journal of Applied Psychology, 
1971, 55, 296-301, 
oss, J. C., & Wheeler, R. H. Black belonging. West- 
5 Conn.: Greenwood Press, 1971. 

ledge, A. L., & Gass, G. Z. Nineteen Negro men. 
aa Francisco: Jossey-Bass, 1967. ee 

1 7 G. D. The social order of the slum: Ethnicity 

territory in the inner city. Chicago: University 
Tria Chicago Press, 1968. 

ndis, H, C. Exploratory factor analyses of the 


Bowes 
thavioral component of social attitudes. Journal 


ss-CULTURAL APPROACH TO TRAINING THE HARD To EMPLOY 


693 


of Abnormal and Social Psychology, 1964, 68, 470- 
430. 

Triandis, H. C. Culture training, cognitive complexity, 
and interpersonal attitudes. In R. W. Brislin, S. 
Bochner, & W. J. Lonner (Eds.), Cross-cultural 
perspectives and learning. Vol. 1. Beverly Hills, 
Calif.: Sage, 1974. (a) 

Triandis, H. C. Person perception: A review of the 
literature and implications, In H. L, Fromkin and 
J. Sherwood (Eds.), Integrating the organization, 
Glencoe, III.: Free Press, 1974. (b) 

Triandis, H. C., Feldman, J., & Harvey, W. Person 
perception among black and white adolescents and 
the hardcore unemployed. (Tech. Report No. 5, 
SRS No. RD 2841-G) Champaign, III.: Department 
of Psychology, University of Ilinois, 1970. 

Triandis, H. C., Feldman, J., & Harvey, W. Job per- 
ceptions among black and white adolescents and 
the hardcore unemployed. (Tech. Rep. No. 7, SRS 
No. RD 2841-G) Champaign, III.: Department of 
Psychology, University of Illinois, 1971. (a) 

Triandis, H. C., Feldman, J, & Harvey, W. The 
perceptions of implicative relationships among 
black and white adolescents and the hardcore un- 
employed. (Tech. Rep. No. 8, SRS No. RS 2841-G) 
Champaign, Ill.: Department of Psychology, Uni- 
versity of Illinois, 1971. (b) 

Triandis, H. C., Feldman, J., & Harvey, W. Role per- 
ceptions among black and white adolescents and 
the hardcore unemployed. (Tech. Rep. No. 6, 
SRS No. RD 2841-G) Champaign, III.: Depart- 
ment of Psychology, University of Illinois, 1971, 
(c) 

Triandis, H. C., Feldman, J., Weldon, D., & Harvey, 
W. Eco-system distrust and the hard to employ. 
Journal of Applied Psychology, in press. 

Triandis, H. C., & Malpass, R. S. Studies of black 
and white interaction in job settings. Journal of 
Applied Social Psychology, 1971, 1, 101-117. 

Triandis, H. C., Vassiliou, V., & Nassiakou, M. Three 
cross-cultural studies of subjective culture. Journal 
of Personality and Social Psychology, 1968, 8 (4, 
Pt. 2), 1-42. 

Triandis, H. C., Vassiliou, V., Vassiliou, G., Tanaka, 
V., & Shanmugam, A. The analysis of subjective 
culture. New York: Wiley, 1972. 

Triandis, H. C., Weldon, D. E., & Feldman, J. Black 
and white hardcore and middle class subjective 
cultures: A cross validation. (Tech. Rep. No. 14, 
SRS 15-P-35175/5) Champaign, III.: Department 
of Psychology, University of Illinois, 1972. 

Valentine, C. A. Culture and poverty: Critique and 
counter-proposals. Chicago: University of Chicago 
Press, 1968. 

Weldon, D., Carlston, D., Rissman, A,, Slobodin, L., 
& Triandis, H. C. A laboratory test of the effects 
of culture assimilator training. (Tech. Rep. No. 
20, SRS 15 P-55175/5) Champaign, III.: Depart- 
ment of Psychology, University of Illinois, 1974. 

Wilcox, R. C. The psychological consequences of 
being a black American. New York: Wiley, 1971. 


(Received December 26, 1973) 


ED PEDALINO* 


Pedalino and Associates, Inc., 
Ann Arbor, Mickigan 


centive system, decreased absenteeism. 


There is a growing interest in the applica- 
m of behavior modification in industrial 
organizations. One area of potential applica- 
tion has been the absenteeism of the marginal 
worker (Porter, 1973). Interest in the use of 
vior modification is on two levels—theo- 
cal and applied. On a theoretical level, two 
landmark articles have been written, Porter 
1973, pp. 113-133) and Nord (1972). Also 
mboa (1974) addressed the issue of com- 
ibility between behavior modification and 
ent organizational theories of motivation, 
ind came to the conclusion that the two can 
be synthesized along content and process 
dimensions. Jablonsky and DeVries (1972) 
critically evaluated Nord’s article and devel- 
d an “interactive” model of behavior which 
nts for multiple reinforcement on em- 
ee behavior. 
On the applied level, the laboratory experi- 
ments of Yukl, Wexley, and Seymore (1972), 
ington, Reitz, and Scott (1971), and 
dam (1972) attempted to measure the effi- 
y of behavior modification in addressing 
anizational issues (i.e., pay incentives, 
action- productivity quagmire, increasing 
ormance quality and quantity). Popular 
rature citing successful application of be- 
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BEHAVIOR MODIFICATION AND ABs!’ 
INTERVENTION IN ONE INDUSTRIAL st! 


Behavior modification was used in an attempt to decrease ab 
sample of 215 hourly employees at a manufacturing/distributior m 
ployees in four adjoining plants served as comparison groups. A - 
line, intervention, return to baseline) intervention using a | 
system constituted the experimental design. Absenteeism decreas: 
following the experimental group intervention but did not decre 
the four comparison groups. Further, stretching the schedule oí 
did not increase the rate of absentecism. Findings are discusses 
Lawler and Hackman’s 1969 study which indicated participation, 
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havior modification i 
revolved around the 
perience (“New Tool, 
ner’s Theories Work, 
described a case study 
pany which used a lotte 
absenteeism and tardines 
in a field study, focused 
back as a behavioral tec 
in two industrial settings. 
of Brethower, there is a de 
ments evaluating the app!“ 


modification in industrial or. «nizations. 
The article written by F r (1973) pre 

vided the theoretical and applied frame 

on which to base this study. Porters fi 


is on the marginal worker, deñned as an 
vidual who has failed to demonstrate 
sistent work attendance and/or has failed 
meet organizationally defined standards 
adequate performance (Porter, 1973).A 
ing to Porter, acquisition of behavior via 
ing, and maintenance of acquired behavior 
schedules of reinforcement are the key tools: 
for improving attendance and performance oa 
the part of the marginal worker. A 

The basic hypothesis of this study 15 E 
behavior modification can be utilized in ® 
industrial organization to reduce absenteeism. 
Gamboa (1974) reviewed the literature “i 
absenteeism and concluded that the bulk A 
studies reported were either surveys or 1 
dotal reports of what companies did to de 
crease absenteeism. Three studies, Mann au 
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i, Mann and Sperling 
wier and Hackman (1969) 
All were interventions in in- 
and attempted to evaluate the 
intervention on absenteeism. 
y Lawler and Hackman (1969) 
ve present study in two impor- 
4) the use of a pay incentive 
e absenteeism and (6) the use 
riment to evaluate the effect of 
ntervention. Lawler and Hack- 
however, focused on participa- 
yees as the explanatory variable 
reased absentecism, The present 
on the use of behavior modifi- 
tempts to test its efficacy in 
enteeism. 
nner, in an interview, has only 
y stated his ideas on how behavior 
tion can be applied in industrial orga- 
s (see Conversation with Skinner, 
cxample, Skinner suggests that a 
ich as a door prize) may well solve 
of absenteeism. Formally stated, 
, the basic hypothesis of the study 
lern ent reinforcement using financial 
s will lead to a significant decrease 


ibsentecism. 


METHOD 

Strategy 

pothesis of the study was tested in a 
it. The experimental approach chosen 
pronged design; (a) the ABA design 
place (B), and the intervention strategy 
d (A) with measures systematically moni- 
in all three phases, and (ö) the “controlled” 


feature uses comparison groups 
ct of trend as an alternative hypothesis for 
ze in absenteeism rate. 


Site and Subjects 


t research was conducted at a manufacturing“ 

on center that handles numerous products, 
product housed in a separate plant. Each 
ation was, for all practical purposes, inde- 
of the others. Experimental intervention 
in one plant with the other plants serving 

rison groups. 

plants were made up of a four-level reporting 
with a plant manager at the top, then 
ndents, followed by foremen, with unionized 
s at the lowest level. Each foreman was 
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management and their reaction was positive. Thus, 
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a poker game incentive plan was chosen as the 
intervention strategy. The plan was to work as 
follows: 


Each day an employee comes to work and is on 
time, he is allowed to choose a card from a deck 
of playing cards. At the end of the five-day week, 
he will have five cards or a normal poker hand. 
The highest hand wins $20. There will be eight 
winners, one for approximately each department. 


The implementation of such a strategy constituted 
Phases 1 and 2 of the program. Basically the pro- 
gram provided an added incentive for entering the 
work environment each day. 

Since the number of men in each department 
ranged from 14 to 26, the chances of winning were 
greater in certain departments than in others. How- 
ever, since each department was a naturally existing 
functional unit, it was decided to preserve the 
current divisions for the program. 

Each time an employee came to work on time, 
he received a card from a poker deck from his fore- 
man. The foreman recorded the card chosen by the 
employee on a score sheet. The foreman also re- 
corded the card on a poker game chart. The chart 
was a larger version of the score sheet and was 
posted where men of the same department could see 
how everyone was doing. 

It is one thing to introduce a program to reduce 
absenteeism, and it is another thing to maintain 
such reduced absenteeism over a period of time. A 
fading procedure, therefore, was designed to stretch 
the interval of reinforcement. In Phase 1, the first 
6 weeks, the program was run each week (fixed 
interval—1 week). In Phase 2, the last 10 weeks, 
the program was run every other week—fixed 
interval—2 weeks). 

Baseline 2. After 16 weeks the program was dis- 
continued. Absenteeism measures over 22 weeks con- 
tinued to be monitored for the experimental group 
to determine long-run effectiveness of the program. 
The procedure employed provides two ways of 
evaluating the effectiveness of the experimental inter- 
vention: (a) the ABA design uses the experimental 
group as its own control in determining the differen- 
tial impact of introducing and removing the inter- 
vention, and (b) the comparison groups enable us 
to “control” for trend as an alternative hypothesis 
to explain the effect of experimental intervention. 


RESULTS 


Results of the experiment are shown in 
Figure 1, The baseline measure of 32 weeks 
showed the overall weekly absenteeism rate 
of the experimental group to be 3.01%. The 
goal aimed for was an absenteeism rate of 
2.31%. This was determined by computing 
what the absenteeism rate would be if all 
employees took just their six days sick leave 

each year. It should be noted that absentee- 


Ep PEDALINO AND Victor U. GAMBOA 


ism rates for somi 
if not lower, than 
effect, with existi: 
and structure, the 
been achieved and y realistic. 

Figure 1 shows t! overall absentee- 
ism rate actually aci s a result of the 
intervention was 2 »mpared to the 
baseline of 3.01% thi nted an 18.27% 
decrease in absenteei is baseline-inter- 
vention decrease in ism was tested 
for statistical signifi a one-tailed { 
test (7 = 1.98, p< «| one-tailed test 
was chosen since a nal decrease in 
absenteeism from ba to intervention 
period was hypothesize he experimental 
group. A 22-week foll fter the incen- 
tive system interven phased out 
showed the absenteeisi have climbed 
to 3.02%. This repres in increase of 
30.08% over the inte n rate. This 
intervention-follow-up i: was tested for 
statistical significance | -tailed t test 


had been as low 


mnel, technology, 
31% had already 


(t = 3.34, p<.01). / siled test was 
chosen since there v a priori basis 
for hypothesizing a che absenteeism 


in a specific direction for intervention- 
follow-up period. 

An unplanned manipulati 
eighth week of the exp 
Phase 1, the incentive progra: 
to reward good attendanc: 
present on time) on a 
Phase 2, the incentive pr 
operational on a biweekly b 
the bonus for the poker game 
on the week of September 24, no poker game 
was to take place on the week following. 
The next poker game was to take place on 
October 15, and so on. . 

Unanticipated circumstances resulted in 1 
incentive program being put into operatio 


ccurred in the 
tal period. In 
was structured 
ive work days 


31% goal line, Ip : 


on the weeks of October 1, 8, and 15, three 


hown 


weeks in a row. Absenteeism, as can be 5 30 
37 


in Figure 1, immediately increased from 
to 3.9%. The interval for reinforcement a 
been stretched beyond the point which cou 3 
maintain good attendance. When the W 
tive was again reinstituted, absenteeism 15 
mediately dropped to 2.4% and stayed 10 
about that level for the next six weeks. 
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Week Ending Dates for Experimental Period 


Ficure 1. Experimental group attendance program results. 


idental manipulation within the 
rovided further evidence for the 
ve intervention. 
he effectiveness of “stretching the 
schedule of reinforcement,” the average ab- 
Senteeism rate of the employees in the experi- 
mental group during Phase 1 (weekly schedule 
Of reinforcement) and Phase 2 (biweekly 
pee of reinforcement) were compared. 
pene Phase 1, the average employee in the 
1 group was absent 2.38% of total 
el work hours. During Phase 2 the 
te was absent 2.51% of total scheduled 
un ours. A two-tailed f test showed no sig- 
cant difference between absenteeism for 
hase 1 and Phase 2. 
baseline- intervention data of the ex- 
Roi ental group vis-a-vis the comparison 
PS were then examined. Over the same 
‘Week intervention period, matching of 


baseline and intervention measures showed an 
increase in absenteeism in four of the com- 
parison groups. Such an intervention-baseline 
increase was tested for statistical significance 
by a one-tailed ¢ test, and only one of the 
comparison groups (Group 1) yielded a ¢ 
value (7 = 1.71) significant at the .05 level, 
in the direction of increased absenteeism. In 
general, the data gathered from the compari- 
son groups showed no significant changes 
during matched baseline and intervention 
periods. In the one group where a significant 
change was shown, the change was actually 
in the direction of increased absenteeism. 


Discussion 


The results of this study support the hy- 
pothesis that behavior modification can be 
utilized in an industrial setting to reduce ab- 
senteeism to a statistically significant degree. 
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Over a four-month period an 18.27% reduc- 
tion in the absenteeism rate of 215 unionized 
employees was achieved. Over the same four- 
month period, a 13.79% average increase took 
place in the absenteeism rate of comparison 
groups. Note that the baseline rate of the 
experimental group was significantly lower 
than any of the comparison groups to start 
with. Further, the results showed that stretch- 
ing the schedule of reinforcement from weekly 
to biweekly could be achieved without bring- 
ing about a significant increase in the rate of 
absenteeism. Also, since the program was 
designed so that an employee had to be on 
time before an incentive was given, a decrease 
in tardiness was also observed. 
The incentive system was terminated after 
16 weeks. The reason for doing so was that a 
bargaining date for a new union contract was 
approaching and the company did not want to 
find itself negotiating this incentive system 
into the contract. Thus, while the results of 
this study are encouraging, further work is 
necessary relative to the long-range mainte- 
nance of such behavior. 
It would be interesting to compare the pres- 
ent study with a previous study by Lawler 
and Hackman (1969). They also decreased 
absenteeism by using an incentive system. 
However, Lawler and Hackman (1969) em- 
phasized participation as the explanatory 
variable. In this study, emphasis was placed 
on behavioral principles as bringing about the 
decrease in absenteeism. Does this suggest two 
ways of decreasing absenteeism in industrial 
organizations—participation and behavior 
modification? 
Gamboa (1974) suggested that processes 
given the label “participation” can simply be 
much better understood and operationalized 
using behavior modification principles. Thus, 
participation and behavior modification are 

not two ways of decreasing absenteeism (or 
improving any other behavior). Rather, be- 
havior modification simply helps to make the 
concept of participation operative and effec- 
tive. It is the consideration of these issues, 
both theoretically and empirically, which 
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In the 


given very little job information, while the 
other group received a rather full description 
of the job. The major finding was that inter- 
viewer agreement was far superior for the 
group having more complete job information. 

The present article reports two experiments 
investigating the extent to which interviewers 
tend to use available job information as a 
basis for employment decisions and the pos- 
sible conditions that may enhance or decrease 
such use. When an interviewer possesses job 


o> 9-104 
OF JOB INFORMATION AS A CRITERION IN 
{PLOYMENT DECISIONS OF INTERVIEWERS 
JASH WIENER? 
0 ern Reserve University 
factorially designed experiments investigated the extent to which job in- 
tion was used by interviewers as a criterion for their decisions. 
experiment subjects were experienced (n = 48) and inexperienced (n = 48) 
viewers. All subjects received identical job information. Applicant informa- 
was defined as either relevant or irrelevant to the job information. Sub- 
ts’ judgments were based on composites of relevant and irrelevant informa- 
1 segments, Each of the segments was cither favorable or unfavorable. The 
nd experiment replicated the first one, except that subjects (N = 48) did 
t receive job information. The main findings indicated extensive use of job 
owrmation. Availability of such information reduced the effect of irrelevant 
ributes on decisions but did not eliminate it. Use of job information was not 
hanced by experience in interviewing. 
A factor ich may contribute to biases in 
ing dec s by interviewers is a reliance 
person niches, theories, and stereotypes 
a crite for weighing and evaluating 
plicant mation, The McGill University 
dies (W er, 1964) strongly suggest that 
terviewe iake hiring decisions by match- 
and caring job applicants against a 
freotype an ideal applicant. These find- 
mgs indicai that if the match is close the 
pplicant is more likely to be hired; other- 
ise, he is ore likely to be rejected. Conse- 


quently, th- degree of the validity of the 
leci function of the fit of the stereo- 
val job requirements. 

reduce the potential bias in- 
personal stereotypes is to make 
interviewers a set of objective 
b requirements to be employed as criteria 
br evaluation of applicant information. The 
More interviewers use such job information, 
the less they are likely to rely on personal 
Stereotypes. 

} Only one study to date (Langdale & Weitz, 
-1973) has investigated the availability of job 
Information as a factor in employment de- 
“sions, In that study, two groups of personnel 
Metviewers were given identical applicant 
Mormation for evaluation, One group was 


Requests for reprints should be sent to Yoash 
Rt Department of Psychology, Case Western 
“Ive University, Cleveland, Ohio 44106. 


information, any applicant information he 
collects can be classified as either relevant or 
irrelevant to the available job information. 
Job information is highly used when the in- 
terviewer relies mostly on relevant applicant 
information and neglects the irrelevant infor- 
mation in his decision. Experiment 1 ascer- 
tains the relative contribution of relevant and 
irrelevant applicant information to the hiring 
decision when job information is available to 
interviewers. Experiment 2 ascertains the rela- 
tive contribution of these information seg- 
ments when job information is not available. 


EXPERIMENT 1 
Method 


Subjects 


Ninety-six subjects of whom half were experienced 
interviewers and the other half were inexperienced 
interviewers participated in the experiment. The ex- 
perienced interviewers were drawn from local in- 
dustrial companies and consulting firms. Each of 
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them had been interviewing job applicants on a 
regular basis for at least six months and had been 
responsible for making either employment decisions 
or recommendations, The inexperienced interviewers 
were drawn from low- and middle-management per- 
sonnel in local companies where they had not had 
any responsibility in making employment decisions 
or interviewing. 


Materials 


Job information. An identical set of data about 
the job of a staff engineer was given to all subjects. 
This particular job was selected because the subjects, 
both experienced and inexperienced interviewers, were 
assumed to have the same degree of general famil- 
iarity with it. The job was specific enough to ensure 
that even the experienced subjects had not actually 
interviewed applicants for that position. Thus, it was 
possible to avoid the problem found by Webster 
(1964) wherein experienced interviewers may be 
locked into their own company’s values, regardless 
of the given job information. 

The job information presented to the subjects was 
adopted from actual job analysis data of the posi- 
tion staff engineer in a large local manufacturing 
company. It consisted of a list of specifications of 
the personal requirements of a staff engineer job for 
that company. An example of one item from this 
list of job requirements is: 

Supervision and work with others: The applicant 

must be able to work independently and with 

minimal supervision, Normally, he would be given 

a general assignment and the responsibility to fol- 

low it through to conclusion. Primarily, he would 

work alone. He does not supervise anybody and 
his relations with others are limited to exchanging 
information. 
In order to approximate a realistic format of a com- 
plete set of job information data, subjects were also 
presented with a description of job duties and re- 
sponsibilities. 

Applicant information. This information was pre- 
sented in a written format of 20 descriptive state- 
ments characterizing each applicant. Similar pro- 
cedures have been commonly used in interview re- 
search (e.g, Carlson, 1967; Hollman, 1972), The 
descriptive statements used were selected from a pool 
of 730 items (Hakel & Dunnette, 1970) that had 
previously been judged on the basis of favorability, 
frequency of occurrence and importance to specific 
jobs (including the job, engineer), The process of 
item selection was conducted in four steps: (a) 
All items in the category engineer were evaluated by 
two judges as either relevant or irrelevant to the 
job specifications. The relevancy judgment was a 
straightforward procedure, For example, the job 
specifications excluded the need for effective inter- 
personal relations. Thus, any statement relating to 
interpersonal skills was irrelevant, (b) The relevant 
and irrelevant groups of items were further divided 
into favorable and unfavorable categories according 
to the ratings of Hakel and Dunnette (1970). State- 
ments with a mean rating of 5.00 and above were 
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labeled favorable and those \ vean rating of 
3.00 or below were labeled ur le. The rest of 7 
the items were discarded. (c maining item 
were arranged into opposing the basis of 
their favorability. In each of t nt and irrele. 
vant groups of items, favor ements were 
matched with their unfavora! terparts, For 
example, the favorable relevar “is a college 
graduate” was paired with its rable relevant 
counterpart “has had two ye college.” (d 
Twenty pairs were selected the pool of 
matched items, 10 relevant and levant pairs. 
Each pair contained one favora and one un- 
favorable item. 

The items were arranged to ur applicant 
information sets. Each set consi ) statements, 
half of which were relevant and which were 
irrelevant. The relevant and irre! ments were 
either favorable or unfavorable p the task 
as realistic as possible the inforn did not con- 
sist of purely favorable or unfa statements. 
That is, the favorable segment c< l of 7 favor- 
able and 3 unfavorable stateme: ice versa for 


the unfavorable segment of the in tion, A pilot 
study showed that the four appli ormation sets 
were perceived by subjects as rea and credible, 
The 20 statements in each of the cant informa- 
tion conditions were arranged in lifferent ran- 
dom orders, Each subject, howe s presented 


with only one set of information, 


Design and Procedure 


ns were ex- 
relevant- 


The four applicant information < 
amined in a factorial design: (a) favo: 


favorable irrelevant; (b) favorable —unfavor- 
able irrelevant; (c) unfavorable ce ut-favorable 
irrelevant; and (d) unfavorable re unfavorable 
irrelevant. Subjects were assigned randomly to these 


conditions, each subject belonging + y one con- 
dition. This design cannot pro about the 
main effect of relevant versus i information 
per se, since each subject received b relevant and 
irrelevant information and his employment decision 
was based on the composite. H the design 
can provide data about the main effects of two M- 
dependent variables: favorability ve unfavorabil- 


ity of relevant information and favorability versus 
unfavorability of irrelevant information. These mam 
effects indicate the impact of each of these information 
types on the decision. The more interviewers rely ona 
given information segment—either relevant Or jrrele- 
vant—in their decisions, the larger the difference n 
tween the favorable and unfavorable condito 4 
this information segment. That is, the larger š 
main effect. Omega-square values obtained from thesi 
main effects enable a separate assessment and com- 
parison of the percentage of decision variance E 
counted for by the relevant and irrelevant applicani 
information segments or the relative contribution È 
each type of information to the hiring decision. Con 


0 
sequently, an inference about the oe È J e, 
i i i i smen 
information use is possible. An asses: ariables 


interaction effect of the two favorability V 
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TABLE 1 


*LOYMENT DECISIONS AS A FUNCTION Or FAvoRADILITY OF RELEVANT AND Inex LEVANT 


INFORMATION FOR EXPERIENCED AND INEXPERIENCED INTERVIEWERS 


Experiment 1 Experiment 2 
Var Experienced Inexperienced Inexperienced 
— . 4 
Favorable ir- Unfavorable Favorable ir- | Unfavorable | Favorable ir- | Unfavorable 
relevant irrelevant relevant irrelevant relevant irrelevant 
Favorat s | 
Tele 
M 7.58 6.25 9.33 8.25 7.83 4.92 
SD 2.81 2.01 2.10 2.09 2.61 | 2,53 
Unfavo | 
rel 
M 3.50 1.75 6.08 2.25 5.33 2.25 
SD 1.80 1.36 3.17 1.36 1.75 2.00 
Note. 12 for each group. 
38 ue on the degree of rejection- acceptance scale. The scale ranged from very strongly reject (1) to very strongly accept 
may asc in whether the extent to which irrelevant Results and Discussion 
inform: affects the decision depends on the 
favoral level of the relevant information. The Decision Findings 
design allows the determination of the degree S 
to whic ¿xperience in interviewing, a third inde- Table 1 (Experiment 1) shows mean scores 
pendent ~«riable, affects reliance on relevant and of the strength of accept-reject decision of 
imeleva pplicant information, and consequently the eight treatment groups (n = 12 for each 
an infer e about the effect of such experience on 
use of jo information. group). : : 
Fach ect was presented with a booklet that Table 3 2 (Experiment 1) Summarizes the 
included structions, job information, applicant in- analysis of variance of the decision data. The 
formation, and the rating scales, in that order. Sub- main effects of favorability of relevant infor- 
7 ins 4 i . 12 1 7 
Jects we: instructed “to assume that they are in a mation and favorability of irrelevant informa- 
capacity tc make an employment decision for a 


company“ and to make the decision on 
the information supplied. No restrictions 


were made on how to evaluate the information. No 
time lim imposed. 

Depen measures. The interview decision was 
Measured in two steps. The first was a dichotomous 
decision of either accept or reject. This approximates 


the type decision generally made in a real-life 
employment interview. The second step was to re- 
quest subjects to respond to a 6-point scale indicat- 
Ing the degree of strength of the acceptance or re- 
, Jection decision. A subject who first made the de- 
Cision of accept responded to a degree of acceptance 
Scale, whereas a subject who initially made a rejec- 
tion decision responded to a degree of rejection scale. 
For the purpose of analysis, these two scales were 
Combined, so each subject received a decision score 
in the range of 1-12, indicating a decision continuum 
Tom very strongly reject to very strongly accept. 
his scaling procedure did not allow subjects to re- 
Main neutral or not make a decision. The other de- 
Pendent variable—the degree of confidence subjects 
placed in their decision—was measured by a 7-point 
polar scale. 


tion were significant. They accounted for de- 
cision variance proportions of 42% and 8%, 
respectively, as measured by omega-square 
values, These values indicate a considerable 
reliance on relevant applicant information 
when job information was available to inter- 
viewers. However, availability of such infor- 
mation did not completely eliminate the effect 
of irrelevant applicant information, It was 
much smaller than the effect of the relevant 
information, but still contributed significantly 
to the decision. This pattern of effects strongly 
indicates that the job information data were 
used by the interviewers as a criterion for 
their decision. 

There was a tendency of a Favorability of 
Relevant Information X Favorability of Ir- 
relevant Information interaction (p < .10). 
The pattern of means in Table 1 (Experiment 
1) indicates that the favorability of irrelevant 


— a 
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Information had more effect on subjects’ de- 
cisions when the relevant Information was un- 
favorable than when the relevant information 
was favorable, Previous research found that 
“Interviewers are more influenced by unfavor- 
able than by favorable Information (Bolster 
k Springbett, 1961). Despite this predisposi- 
n, the present findings suggest that inter- 
viewers tend to pay more attention and give 

nore weight to irrelevant information when 


than when faced with favorable relevant in- 
“formation. That is, when interviewers possess 
unfavorable relevant information they seem 
to seck additional available information for 
making a decision, When they possess favor- 
ble relevant information they tend less to 
seek additional information for their decisions, 

The main effect of interviewers’ experience 
as significant. The pattern of the means 
(Table 1, Experiment 1) shows that ex- 
perienced interviewers were more rejecting 

n the inexperienced ones, Similar results 
j obtained in previous studies (Springbett, 
1954; Webster, 1964). Possibly, because of 
organizational reward systems, experienced 
interviewers are more cautious in making their 
decisions. Interviewers’ experience did not in- 
with any of the applicant information 
varial This shows that the experienced 
interviewers did not differ from the inexperi- 
enced ones in the relative amount of relevant 
ind irrelevant information they used in mak- 
ng their decisions. Experience does not seem 
affect use of job information. Langdale 


Confidence Findings 

A 2 X 2 * 2 analysis of variance on scores 
of degree of confidence in the hiring decision 
showed a significant main effect only for 
favorability of relevant information (F = 
5.03, df = 1/88, p < .05). As indicated in 
Table 3 (Experiment 1), the more unfavor- 
ble the relevant applicant information, the 
ger the confidence interviewers have in 
their decisions. The main effect of favorability 
of irrelevant information was not significant, 
but the same pattern of means as for the rele- 
t information was found, Interviewers 
em to have slightly greater confidence in 
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TABLE 2 


AN eis OF Vax: 
Decisiox Sc 


Source 


Experiment 1 
Favorability of rele- 
vant information 
(A) 
Favorability of irrel- 
evant information 


> 
x 
A k 


Experiment 2 
Favorability of rele- 
vant information 
(A) 
Favorability of irrel- 
evant information 


gi 


— 


on un- 
pport for 
relation 
i the ac- 
nore con- 
reject an 


their decisions when they 
favorable information, Addit 
this pattern comes from the 
between the confidence ratin 
cept-reject scores. Interviewer 
fident in their decision when 
applicant. 

The main effect of experien 
nificant. Also, experience did 
any of the information variables so as to 
moderate their effect on decision confidence. 
The higher degree of confidence when deci- 
sion and information are negative may give 
additional support to the contention W 


terviewers attach more weight to unlavora’ 


was not sig- 
yt interact with 


information. Decisions based on 1 
that is highly weighed produce a higher de: 


gree of confidence. Pas Í 
There was a significant Favorability 0 
Relevant Information X Favorability of It 
relevant Information interaction (F = 16: 
df=1/88, p< 01). The pattern 
means in Table 3 (Experiment 1) in adi- 
that decision confidence was affected, in 40°” 
tion to the favorability of applicant informs. 
tion, also by the internal consistency of 5 
information. Subjects had most confiden 


n of the 


| 
d 


dicates 


o o a 


ation was negative and con- 
able relevant-unfavorable ir- 
on). The second highest de- 
conñdence was exhibited when 
n was positive and consistent. 
mation was inconsistent, that 
i| segment was favorable and the 
zment was unfavorable, or vice 
ence was the lowest. 


EXPERIMENT 2 
Method 


ts were inexperienced interviewers. There 
ibjects, drawn from the same population 
rienced subjects in Experiment 1. 


Since the variable of experience was omitted in 
his experiment, its design was a 2 <2 factorial. The 
ve contribution of the relevant and irrelevant 
nt information segments to ae ded- 


lent to which the effects of the relevant and irrele- 
int information in Experiment 1 differed from the 


The 2 x 2 analysis of variance of the de- 
sion data is presented in Table 2 (Experi- 
ent 2). The main effects of favorability of 
devant and irrelevant information were sig- 
cant, They accounted for decision variance 
portions of 17% and 24%, respectively. 
comparison with the corresponding propor- 
s in Experiment 1 (42% and 8%) indi- 
es that job information was used to a con- 
‘able extent by interviewers as a criterion 
hiring decisions. Its availability consider- 
reduced the effect of irrelevant informa- 
and enhanced the effect of relevant in- 
mation on hiring decisions. 
Availability of job information affected also 
he harshness of the hiring decisions. The 
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-of job information decreases such an appre- 


ad C 


Id WU Decrees 0 
TABLE J 


Coxnosscs ix Dacssox as a Peers os 
Favomasusty oF Rasvan Axe 


2). Interviewers, then, seem to make 
negative decisions when job information is not 
available for them. Experiment 1 showed that 
interviewers felt more comfortable when they 
made negative decisions, possibly because 
they perceived that it reduced the risk of 
being blamed for a bad decision. It is quite 
possible that making decisions on the basis 


hension and facilitates positive decisions. 


Confidence Findings F 
Table 3 (Experiment 2) presents the means 
of decision confidence scores for all informa- 
tion conditions. Analysis of variance did not 7 
reveal any significant effect, but the main 
effect of favorability of irrelevant information 
approached significance (F = 3.72, dj = 1/0 
44, p<.10). As in Experiment 1, inter- 
viewers tended to have more confidence in 
their decisions when they were based on un- 
favorable information, but in Experiment 1 
this effect was more pronounced for the rele- 
vant information, The reason for this differ- f 
ence possibly lies in the different degree of 
interviewer reliance on relevant and irrelevant P 
information in the two experiments. In Ex- 
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periment 1 they relied mostly on relevant in- 
formation and in Experiment 2 they relied 
more on irrelevant information. 

There was no significant interaction of 
Favorability of Relevant Information X Fa- 
vorability of Irrelevant Information, as was 
found in Experiment 1. In Experiment 2, then, 
consistency of applicant information did not 
emerge as a factor affecting decision confi- 
dence. It is possible that the availability of 
job information as a criterion for a hiring 
decision makes the consistency of applicant 
information more salient for interviewers. 
Thus, the same sets of applicant information 
may have been perceived differently in the 
two experiments: as more inconsistent in Ex- 
periment 1 and less inconsistent in Experi- 
ment 2. 


GENERAL CONCLUSIONS 


The study shows that job information was 
indeed extensively used when available to in- 
terviewers. The contribution of relevant ap- 
plicant information to the decision variance 
increased from 17% to 42%, whereas the 
contribution of irrelevant information de- 
creased from 24% to 8%. The decrease in the 
variance proportion of the irrelevant informa- 
tion indicates a decline in the effect of per- 
sonal subjective stereotypes on the hiring de- 
cision. The fact that the effect of irrelevant 
information had not been completely elimi- 
nated when job information was available, 
points at the strong predisposition of inter- 


viewers to hold personal stereotypes of an- 


ideal applicant. 
The effect of job information in the less 
structured and controlled situations of real- 
life decision making may be less dramatic 
than the effect observed in the present ex- 
periments. However, the conclusion remains 
that job information should be made readily 
available to interviewers. Since interviewers 
seem strongly inclined to use job information, 
care should be taken that this information is 
accurate. Otherwise, the availability of such 
information would hurt the quality of em- 
ployment decisions. Rigorous procedures of 
job analysis can ensure sufficient accuracy of 
job information. 

When job information was available to in- 
terviewers they tended to make more positive 
decisions. Further research is needed to find 
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out whether availabi! 
restores more balance 
interviewers give to fav 
information, and wheth 
more valid decisions. 
Experience in intervi« 


job information 

relative weight 
and unfavorable 
eed, it produces 


lid not seem to 


enhance a more extens! if job informa- 
tion. This is not very st since experi- 
enced interviewers prob 1 to be locked 


stems, or into 
nore than in- 

ster, 1964). 

in the use of 

experienced 

n general, ex- 

make more 


into their companies’ 
their own personal ste 
experienced interviewers 
It is possible that some 

job information will ben 
and inexperienced interv 
perienced interviewers 


negative decisions than nced ones. 
Confidence in the emp! Jecision was 
not found to be related bility of job 
information. It was foun affected by 
the favorability of applic: rmation and 
the internal consistency i information. 
The more unfavorable the a ant informa- 
tion and the more consistent s, the higher 
is the confidence in the deci The predis- 
position of interviewers to more com- 
fortable with unfavorable app" ant informa- 
tion and with negative decisions is probably 4 
result of the imbalance in sts given by 
interviewers to favorable and favorable ap- 


plicant information. 
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yyment 
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selection and promotion decisions. 


A p blem of current concern to the per- 
sonne? .sychologist is race and sex group dis- 
crimir ion in employment hiring. Numerous 
artic: have appeared discussing methodo- 
logice considerations relevant to discrimina- 
tion using employment tests as a method 
of er ployee selection (e.g., see Einhorn & 


Bass, 1971), Invariably, definitions and dis- 
cussi ns of employment test bias or unfair 
disc: mination refer to the important relation- 
ship between tests and job performance, The 
assumption in most of these reports is that 
the criterion of job performance is acceptable 
and accurate (Gordon, 1970). 

Boehm (1972) and Einhorn and Bass 
(1971) have pointed out that even if tests 
are used appropriately, discrimination in se- 
lection can still occur if the criterion measures 
themselves are “biased” or “unfair” with 
respect to different subgroups. Generally the 
measure of job performance used is a subjec- 
tive performance appraisal score, and there- 
fore a major question still remains to be 
answered by industrial psychologists: Are 
worker assessments free from rater bias? 


1 This research was supported by a grant to the 
first author from the Graduate School of Business 
Administration, Michigan State University. 

2 Requests for reprints should be sent to W. Clay 
Hamner, Room 239 Eppley Center, Michigan State 
University, East Lansing, Michigan 48824. 

3 Now at the Ohio State University. 

Now at Boston University. 

5 Now at the University of Maine. 


E AND SEX AS DETERMINANTS OF RATINGS 
Y POTENTIAL EMPLOYERS IN A SIMULATED 
WORK-SAMPLING TASK * 


W. CLAY HAMNER.? JAY S. KIM,’ LLOYD BAIRD * 
anp WILLIAM J. BIGONESS * 


Graduate School of Business, Michigan State University 


This investigation examined the way the sex and race of the rater and the sex 
und race of the ratee influence assessments of ratee performance on a simulated 
work-sampling task. Undergraduate students (n = 36) were asked to assume 
the role of a manager and rate all eight combinations of male-female and 
black-white performers. Results indicated that sex-race stereotypes do influence 
assessments of behavior on a work-sampling task even when objective measures 
are defined, Findings are discussed in terms of potential methods for over- 
coming these biases in order to use more effectively the work samples for 


Kavanagh (1971) recommended that a 
rating of a specific stimulus is reliable to the 
extent that independent raters agree, or al- 
ternatively, that consistency over time can be 
demonstrated. While the argument for per- 
formance rating consistency is necessary for 
establishing valid performance criteria, the 
presence of high interrater reliability unfor- 
tunately does not preclude the presence of 
bias in the performance appraisal. First, 
interrater reliability scores do not reflect 
whether or not the ratings are correlated with 
objective measures of job performance. Sec- 
ond, and more important, since much of the 
work on interrater reliability has been done 
using homogeneous (i.e, white male) groups, 
the presence or absence of rater bias is not 
accurately measured. Along these same lines, 
if multiple raters (e.g., a supervisor and his 
boss) both share the same bias, then a high 
interrater reliability score could be possible, 
even when bias is present. Flaugher, Camp- 
bell, and Pike (1969), drawing on research 
conducted by the Educational Testing Service, 
concluded that if the chosen criterion itself 
is infused with the type of bias or unfairness 
that is being investigated in the predictor 
tests, the most elaborate of statistical manipu- 
lations is unlikely to detect its presence in the 


predictor. 

Gael and Grant (1972) found that the 
quality of the criterion often determined the 
results of studies testing for differential va- 
lidity. As a general rule, most studies showing 
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lack of differential validity have used better 
average objective criterion measures, 
most of the studies supporting differ- 
' ential validity rely on subjective, poorly de- 
termined criteria. Based on these findings, it 
_ seems reasonable to suspect that the problem 
is not just with the selection tests but also 
with the fact that the majority criterion per- 
~ formance is higher than the minority group 
performance (see Schmidt & Hunter, 1974) 
_ when subjective rating scales are used. There- 
_ fore, it seems imperative that industrial psy- 
chologists examine such performance ratings 
for bias as well as consistency. 
_ While the courts have not yet examined in 
detail the criterion problem as it relates to 
_ sélection bias, several recent statements by 
the courts indicate that they are concerned 
that validation of selection tests alone is not 
_ Sufficient for the elimination of hiring and 
promotion biases. For example, Judge Brown, 
Speaking for a unanimous court in Rowe ver- 
sus General Motors Corporation,® stated that 
personnel decisions which depend almost en- 
_ tirely on the subjective evaluation of the im- 
“Mediate supervisor are a ready method of 
discrimination against blacks because bias can 
be covertly concealed with little chance of 
detection. 
Io overcome the subjective nature of the 
Criterion ratings, many knowledgeable indus- 
trial psychologists have suggested that a job 
Simulation test (i.e, work sampling) should 
d made a part of the selection procedure. 
is method of selection, according to Werni- 
mont and Campbell (1968) and O’Leary 
(1973) should reduce discrimination in the 
ection process by making the selection 
Criteria relevant and by removing unfairness 
of content found in many selection tests. 
_ The purpose of this paper is to examine 
_ two questions which have been raised in the 
ve discussion. First, are performance rat- 
gs free of rater bias when objective mea- 
res have been defined for the raters? This 
question has been raised by the findings of 
Boehm (1972), Bass and Turner (1973), 


oa et al. (1969), and Gael and Grant 


_® Rowe v. General Motors Corporati 
eres rporation, 457 F. 2d 
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Second, are the asse 
pling unbiased, as hy 
mont and Campbell 
(1973)? While some w 
“prescored” (e.g., Th 
and therefore should be 
of work-sampling perf 
sampling tests (e.g., sec on, 
quire evaluations by “ji 3” i 
ing process. It is possible 
are open to bias, just as 
rating techniques. This 
the assessment center sı 
Telephone and Telegra; es 
Campbell, 1968) and 
(Lopez, 1965) are exa of 
work-sampling exercise 
of behavior as measures 
level achieved. 

The present study em 
laboratory setting to exa 
sex and race of the rater a! 
of the job applicants had 
simulated work-sampling tas 


ipling 
Typing 
urate 


METHOD 
Subjects and Design 


Thirty-six undergraduate busin: 
teered to serve as subjects in this 
ment. A 2X2X2X2X2 factoria! 
to test the effects of rater’s race (blac 
rater’s sex (male versus female), 
versus white), ratee’s sex (male v< 
ratee’s level of performance (higi 
assessments of the ratee’s overall 
task. Rater’s race and sex were 
factors, while ratee’s race, sex, and level ot 
ance were repeated factors. The subjects inch 
black males, 9 black females, 9 white males, i 
white females. Each subject rated the ratees, W 
high performers and four low performers. The 
race-sex combinations, that is, black male, 
female, white male, and white female were 
sented in both of the two performance categon 

The subjects were assigned the role of a 
store manager and potential employer of the 
ratees, all of whom had supposedly applied for 
as a stock worker in that grocery store. As Pa 
the screening process, the ratees were asked to 
plete a three-minute work-sampling task. The 
involved removing large Number 10 cans fron 
opened case and placing them on the grocery shi 
provided. Based on the personal experience © 
researchers and on a time study, it was determ 
that the average worker could stock 36 cans 
three-minute period. The high-performance rates 


stocked 48 cans during the throe- 
while the low-performance ratecs 
during this same period. The ratoes 
the task so that certain dimensions, 
their hands, etc.), facial expressions, 
and quality of work (labels faced 
were essentially identical, For example, 
th male and female) wore dark slacks, 
and a dark grocer’s apron. Also, each 
th hands while stacking the large cans. 
iduate students chosen to act as judges 
the ratees did in fact appear essentially 


agreed 


identic these dimensions, Therefore, the only 
variar significance among the ratces would be 
along ice, Sex, or level of performance (quantity 
enly) nsions, 

Th ran State University Instructional Media 
Cente sned a “grocery store” set and filmed the 
perfor ce of each of the eight ratees. The set was 
realis nd resembled the stock section of a large 
groce hain, The film was then available to be 
show: n closed-circuit television to the 36 subjects. 
The r of presentation of the ratees on the film 
was ollows: (a) low-performing black male, (b) 
high- forming white female, (c) low-performing 
black male, (d) high-performing white male, (e) 
low-; forming white male, (f) high-performing 
blac! male, (g) high-performing black male, and 
(k) w-performing white female. Filming of the 
task -rformance was the media chosen so that con- 


siste: y in performance could be maintained, Lewin, 

Dubo, and Akula (1971) found that face-to-face 
inte ction is apparently not crucial in ratings, Rat- 
ine made after watching ratees on videotape were 
alm st identical to those made after fairly extensive 
fac. to-face interaction. 


Procedure 


Data were collected from subjects who participated 
in approximately four equal size groups, although 
the subjects’ ratings were done independently of each 
oiher. There were approximately an equal number of 
sex -race rater combinations at each showing. 

Lach subject was given an instruction sheet de- 
scrioing the purpose of the film. Also on the instruc- 
tion sheet, the subjects were given eight large spaces 
in which they were asked to make comments about 
each pplicant's performance to aid them in filling 
cut the recommendation forms. The film was stopped 
at the cnd of each three-minute segment to allow the 
Subjects to complete their comments. The specific 
instructions were as follows: 


As the manager of the XYZ grocery store, you 
anticipate several vacancies for stock personnel in 
the near future. You are considering 8 applicants 


7 Postexperimental discussions with pilot studies’ 
Subjects and debriefing sessions with the subjects in 
7 study revealed that the subjects were not aware 
Of the purpose of this study (i.e. race and sex differ- 
ence in performance ratings). 


ai * “ 5 . 
eum r Sex awn Race 


for these ponitions. Nene of the apglicante 
any prior work experiemoe, sad all have a 


period of time. Based on time-and- 
you have determined that an 
stocking large #10 cans, will be 
cases (6 cans per case for a total of 36 cand 
three minute period. Since you are very 
have asked the assistant manager to film the 

sample” results. You are now reviewing the 
in order to make certain recommendations to 
assistant manager concerning potential future 
ployment of these workers. In order to ald you 
filling out the recommendation forms which will 
be given to ycu after viewing the film, pleas 
relevant notes about each applicant in the 
provided below as you are viewing the film. 


After the subjects had read the instructions, the In- 
vestigator oriented the subjects by explaining t 
the number of openings was unclear at this time 
that they should look for cucs which would 
them to distinguish among the applicants if 
three or four positions came open. 

After viewing the film, each subject was given 
packet containing eight ratee sheets, numbered cor 
secutively from Worker 1 to Worker 3. Each worker 
was rated on his or her “overall task performance.” * 


Rating Scale 


Ratings of the ratee’s overall task performance ob- 
tained from the subjects at the end of the 24-minute 
film were the major dependent measure used. Rating 
Scale Content III format (see Parker, Taylor, Bar- 
rett, & Martens, 1959) was chosen, based on the 
that Taylor and his associates have shown that it 
is the best of four formats in terms of the problems 
of halo, leniency, reliability, and variability which 
generally surround the use of graphic rating scales. 
Each subject was asked to check the appropriate. 
block on the scale that best described the applicant 
in question. The scale points ranged from 1 to 15, 
with 1 being the lowest possible rating on the scale. 
The rating categories were as follows: (a) weak in 
overall performance, failed to produce acceptable 
amount of prcduction; (b) performance periodically 
fell below that which I would expect from similar 
individuals in this task; (c) did a satisfactory job, 
was an average worker in his performance; (d) was 


8 In addition, the subjects rated the workers’ 
motivation and ability to perform the task, but the 
results of a multivariate analysis of variance used to 
analyze the data revealed that the findings for 
three ratings were sufficiently similar that only tl 
results for the overall performance ratings need be 
presented. 
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TABLE 1 
CELL MEANS AND STANDARD DEVIATIONS OF THE RATINGS ON OVERALL T RMANCE 
Male rater Fen 
Black rater White rater Black rater White rater 
Performance i SS 
Male Female Male Female Male Female Female 
applicant | applicant | applicant | applicant | applicant | applicar nt | applicant 
Black applicant | 
Low | 
M 9.00 10.22 6.77 6.11 6.00 6.66 3 7.44 
SD 4.26 2.16 2.94 2.47 2,23 3.55 ) | 2.87 
High | 
M 6.22 9.66 8.11 9.66 9.33 10.88 11.22 
SD 2.99 4.12 1.83 3.00 2.50 2.73 4.03 
White applicant | 
Low | 1 
M 4.88 6.44 5.44 SA. 5.00 5.22 | 5.77 
SD 2.47 2.65 1,23 2.63 2.39 2.22 | 2.29 
High | 
M 9.55 9.77 10.11 10.88 9.22 11.66 11.66 
SD 2.12 2.04 2.36 2.20 1.98 2.34 3.20 
a good worker, his performance was better than While low-performing me kers were 
wil be e 15 5 5 oa 1585 VVV only slightly lower tha v-performing 
; q M = female workers (6.25 versus ), high-per- 
RESULTS forming male workers were j a 
ste igh- rming le workers 
Examination of the results of Table 1 indi- 8580 dren 16 . 1/32, P< 
Cates that the ratees’ level of performance on 0 10 1 90 St 5 4135 found that black 
the task corresponded to the raters’ evaluas >> % 04). It was also loung 
K raters tended to rate black gher than 


tion of this performance. High-performance 
ratees were rated as having significantly 
higher levels of overall performance than 
_low-performance ratees (9.67 versus 6.48; 
F = 70.45, df= 1/32, p< 001). Omega- 
square values were computed to estimate the 
proportion of variance accounted for by level 
of performance of the ratees (Winer, 1971, 
p. 428). On this significant difference (p < 
-001), 30% of the variance was due to the 
main effect of level of performance, While this 
finding alone would indicate that evaluations 
of performance of the workers were free from 
kater bias, other findings render this conclu- 
sion invalid. 7 
It was found that female ratees were rated 
higher than male ratees on their overall task 
_ performance (8.69 versus 7.46; F = 16.79, 


C df= 1/32, b 001; «= .08). This was 


especially true for high-performing ratees, 


white ratees (8.50 versus 7.72), while white 
raters tended to rate black r lower than 
white ratees (7.80 versus 8.27) on overall 
task performance (F = 4.71, dj = 1/32, P< 
05; on 02). d 
Another finding that is shown in the ratings 
on the overall task performance scale is that 
while high-performing blacks were rated only 
slightly higher than low-performing blacks, 
high-performing whites were rated signifi- 
cantly higher than low-performing whites. On 
this overall task performance scale, low Ps 
forming black workers and high-performing 
black workers were both rated as average 
workers (7.31 versus 8.98), while low-pe™ 
forming white workers were rated below aver- 
age (5.63) and high-performing white workers 
were rated better than average (10.36). Th 
significant interaction effect (F = 19-56, df = 


i RMINANTS OF RATINGS BY POTENTIAL EMPLOYERS: SEX AND RACE 


1/32, 9 „%, = 09) indicates a ten- 
dency to a high-performing white appli- 
cant ove gh-performing black applicant 
and, at t e time, to favor a low-perform- 
ing bla plicant over a low-performing 
white a int. This finding thus suggests 
that lov forming blacks are at an advan- 
tage an h- performing blacks at a disad- 
vantage ssessments are used in the selec- 
tion pri 

Whil examination of the omega-square 
values s that the performance differences 
of the es accounted for 30% of the vari- 
ance ii : subjects’ responses, an additional 
23% o` ne variance in the subjects’ responses 
was ac unted for by various sex—race differ- 
ence c \binations. This finding suggests that 
even t ugh performance differences were the 
major ‘cterminant of the subjects’ ratings, 
tace-s differences also significantly affected 
the su ects’ evaluations of the ratees. 

Discussion 
To hat extent are the assessments of work 


samp! ags biased against black and/or fe- 
male pplicants for a specific entry level job? 
Had we only considered the typical white 
male population of raters and ratees, we 
would have found that raters of high- versus 
low-nerformance workers were highly ac- 
curate in their assessment of overall task per- 
formance. High-performance applicants were 
rated significantly higher on this scale than 
were low-performance applicants. This finding 
would seem to support the contention made 
by Scott and Hamner (1973) that subjective 
ratings can be substituted for actual produc- 
tion records. Unfortunately, this may only be 
true for homogeneous worker populations. 
When the race-sex combinations of the 
raters and ratees were considered, our data 
indicated that direct, systematic biases were 
Present, First, in what is predominantly a 
male-dominated job, female applicants were 
seen as having a higher level of overall task 
Performance than were male applicants for 
the same job. This may be due to the fact 
that this is a low-level, unskilled task which 
a male would be expected to hold, and when 
the female performs equally well on the task, 
She is seen as being a better performer than 
the male, This tendency to inflate rather than 
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devalue the performance of women is contrary 
to previous findings (e. g., see Rosen & Jerdee, 
1973). The main difference between the pre- 
vious studies and the present one is that the 
present study used an unskilled task role, 
whereas the other studies used a managerial 
or professional task role, Therefore, it could 
be argued that females have a better chance 
of being hired for unskilled jobs, whereas 
males have a better chance of being hired for 
professional or managerial jobs. 

A second systematic bias can be seen in the 
higher ratings that subjects gave to applicants 
of the same race, This finding supports the 
findings of Cox and Krumboltz (1958) and 
DeJung and Kaplan (1962). It also partially 
supports the findings by Flaugher et al. 
(1969) that blacks received distinctively 
higher ratings from supervisors of their own 
ethnic group than from white supervisors, and 
the findings by Farr, O'Leary, and Bartlett 
(1971) and Greenhaus and Gavin (1972) that 
white employees obtained higher supervisory 
ratings on overall job performance than did 
black employees. 

As noted by Bass and Turner (1973), this 
finding alone would not necessarily prove that 
the criterion measure is unfairly discrimina- 
tory if there were actual differences in objec- 
tive measures of job performance. Unfortu- 
nately, such is not the case. In the present 
study objective criteria were established, and 
the fact that blacks received significantly 
lower ratings than whites from white raters 
when performance levels were identical indi- 
cates a potentially serious problem of racial 
bias. In most industrial settings, the super- 
visors are predominantly white, and therefore, 
if the results of this laboratory study were 
supported by future investigations, it would 
mean that the use of performance evaluations 
as the criterion in job selection works to dis- 
criminate unfairly against black applicants. 

The data indicate that the differential cri- 
terion bias is especially prevalent for the high- 
performing black applicant. We found that 
high-performing black ratees were rated only 
slightly higher than low-performing black 
ratees, while high-performing white ratees 
were rated significantly higher than low-per- 
forming white ratees. This finding tends to 


E the findings of Rotter and Rotter 
(1966) that raters generally gave higher rat- 
ings to black than to white ratees when per- 
formance was poor. Contrary to the conclusion 
drawn by Bass and Turner (1973), this find- 
ing seems to indicate that the subjects tended 
to rate the performance of white ratees more 
objectively than black ratees. Based on the 
last finding, it seems reasonable to conclude 
that some differential bias may be carried 

over to the job itself. For example, if salary 
increases and promotions are based on super- 
visory evaluations of employees, high-per- 
forming black employees would advance less 
rapidly than high-performing white em- 


= 


yees. 
Irhese findings also indicate that the super- 
visory ratings as obtained would be inappro- 
priate as criterion measures against which to 
_ validate tests, since females were rated differ- 
~ ently than males and blacks were rated differ- 
ently than whites, even though the per- 
- formance of both groups was identical. To 
overcome this problem, training of the raters 
in the nature and use of rating scales to 
evaluate both periodic work performance, as 
= recommended by Brown (1968) and Bass and 
Turner (1973), and work-sampling perform- 
ance, as recommended by Campion (1972) 
and Bray and Campbell (1968), is necessary. 
Also, the raters could be exposed to training 
which emphasizes sex and race fairness to 
become aware of potential biases. Schmidt 
and Johnson (1973) have shown this to be an 
effective method of reducing racial bias among 
katers. 
This is probably the first study to test the 
effect of the race and sex of the rater and 
_ the ratee on ratings given when an objective 
measure of performance was known. Care 
Should be taken in attempting to generalize 
from this laboratory study with college stu- 
dents to organizational settings. Therefore, 
more research on the effect of race and sex 
bias on work-sampling assessments is indi- 
_ cated, A generalization from these results 
which does seem appropriate is that training 
in performance evaluation techniques as well 
4 objectively defined criteria are needed be- 
4 fore the employer will be able to hire the best 
qualified applicant for the job. 
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EFFECTS OF OPPORTUNITY TO COMMUNI! 


AND 


VISIBILITY OF INDIVIDUAL DECISION: 
BEHAVIOR IN THE COMMON INTERE: 


THOMAS H. JERDEE ann BENSON ROSEN 


Graduate School of Business Administration, 
University of North Carolina at Chapel Hill 


Two experiments examined the effects of opportunity to commu 
visibility of individual decisions on socially responsible behavior in gr 
tions involving the “commons” problem. Two hundred and eig! 
students, enrolled in a business administration course, were assigne< 
firm representatives and assembled in groups of 5 to bid on contre 
various conditions. Opportunity to communicate resulted in mot 
responsible bidding, but visibility of individual decisions had no € 
portunity to communicate also helped to sustain socially responsible 
the face of a persistently defecting group member. Results are di 
terms of implications for situations where socially responsible b 
difficult to obtain because of the incentive structure. 


Conflicts between the immediate interests 
of individuals or groups and the common in- 
terest of society have become increasingly 
prevalent in recent years. This problem has 
been called the “tragedy of the commons” by 
Hardin (1968) and Crowe (1969), who re- 
ferred to difficulties that arose in England in 
the 19th century with regard to common 
pasturage. Each herdsman perceived a posi- 
tive utility for increasing the size of his flock, 
but when all herdsmen acted on this percep- 
tion, the result was serious depletion of pas- 
turage. 

More recently, Platt (1973) has discussed 
this problem in terms of reinforcement con- 
tingencies and analyzed various kinds of 
“social traps” in which the short-run gain of 
one individual, organization, or nation may 
involve a small, almost imperceptible cost to 
the common interest. In these situations, 
people tend to ignore the long-run common 
interest and to act in their own short-run 
interest. But when many people follow this 
course, the results for the common interest 
can be disastrous. These problems arise with 
regard to energy consumption, population con- 


+The authors wish to thank Thomas Prestwich, 
Edward Bailey, Mary Mericle, and Sara Freedman 
for their assistance. 

Requests for reprints may be sent to either author, 
Graduate School of Business Administration, Uni- 
versity of North Carolina, Chapel Hill, North Caro- 
lina 27514. 


trol, pollution control, ex rates, tariffs, 


fishing rights, and a hos er ecological 
and economic phenomena. yecome more 
acute as resources become arce and as 
human interdependence ises. Conse- 
quently, there is an imports d to under- 
stand the conditions in w le are more 
terest. 


likely to behave in the comn 

The formal properties of 
commons have been analyz: 
Grzelak (1972). They desc: 
person interdependency relati 
the Prisoner’s Dilemma Ga: 
son case. They have dem 
mentally that the tendency to 
socially responsible manner in 
is influenced by the natu 
matrix. People are more lil 
the common interest when 
costs for doing so are smaller. 

In the Kelley and Grzelak (1972) study 
and in another study by Kahan (1973), low 
rates of cooperative behavior were observed, 
and there were indications of a low level of 
awareness regarding the essential nature 0 
the interdependency in the “commons” prob- 
lem, This rather dismal state of affairs su 
gests that attention should be given to the 
means by which social awareness and com- 
mitment to behave in a socially responsible 
manner can be heightened. The problem 5 
particularly challenging in situations where 


blem of the 
Kelley and 
it as an n- 
hip, of which 
je two-per- 
ed experi- 
behave in 4 


of the payoff 
y to behave in 
their short-run 
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oup situations 


BEHAVIOR IN THE COMMON INTEREST 


the long- \mmon interest is very impor- 
tant but re short-run payoffs cannot 
readily | ered in favor of the common 
good. 

An at ve possibility in these situations 
is the social sanctions. If individual 
member be convinced that social accept- 
ance an sroval are dependent on behavior 
in the non interest, perhaps these con- 
tingenc an outweigh other more tangible 
short-r ncentives for self-seeking behavior. 
Thus i y be useful to determine the con- 
dition ler which appropriate social sanc- 
tions zenerated within a group. It seems 
likely t the visibility of individual mem- 
bers’ | .avior and the opportunity for group 
memb to communicate freely with one 
anoth: would both be important factors in- 
fluenc x socially responsible behavior. 

Sor support for the visibility factor is 
provi. d by findings of Schachter and Hall 
(195. and Bennett (1955) that subjects 
were nore willing to serve in experiments 


wher hey perceived a greater degree of com- 

mitn: nt among their peers. Communication 

usus ‘y promotes cooperation in dyadic Pris- 
onen Dilemma Games but was found not to 
be ( ective in producing cooperative behavior 
in dyads in a “trucking game” situation 
(Deutsch & Krauss, 1962) and in triads in 
a coalition formation game (Vinacke, Mogy, 
Powers, Langan, & Beck, 1974). However, 
none of these studies involved a small-group 
commons situation, and the effects of visibility 
avd communication on socially responsible 
behavior in small group settings involving the 
problem of the commons have not been es- 
tablished experimentally. 
_ The present experiments were designed to 

investigate these effects in the context of a 
simulation of firms bidding on manufacturing 
contracts. Bids could be either “complete 
cost,” involving a small short-run sacrifice for 
the long-run benefit of the society and the 
industry as a whole, or “cut cost,” involving 
a short-run gain at the expense of society and 
the industry, 

The first experiment examined the effects of 
free communication among group members 
and complete visibility of each member's de- 
cisions on the establishment of socially re- 
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sponsible complete-cost bidding behavior 
within the groups. The second experiment ex- 
amined the power of free communication to 
sustain complete-cost bidding in the face of 
persistent defection on the part of one group 
member who was an accomplice of the ex- 
perimenters. 


EXPERIMENT 1 
Method 
Subjects and Procedure 


Subjects were 180 undergraduate students enrolled 
in a business administration course at the University 
of North Carolina. Subjects were assembled in groups 
of 5 and seated around a table. The following written 
instructions were distributed and read aloud by the 
experimenter, 


Each of you in this room will be a representative 
of a different manufacturing firm. Your job will 
be to bid on contracts to manufacture a varicty 
of goods. In each round of bidding, each firm will 
be awarded a contract, but depending upon how 
you bid and how other firms bid, your firm will 
make different amounts of profit, Your objective 
is to maximize the profits of your firm. 

On each round of bidding, your firm has a choice 
of submitting two different types of bids. The 
nature of these bids and their expected profits are 
described below. 

1. Complete Cost Bid. This bid is relatively high 
because it takes into account the actual cost of 
production, using the finest materials available. In 
addition, this type of bid includes cost of indirect 
long-term expenses for anti-pollution measures, 
restoration of natural resources, and quality con- 
trol devices. Since this bid is high, it might result 
in less profit, particularly if the other firms are 
not submitting complete cost bids. On the other 
hand, if you and most or all of the other firms 
submit complete cost bids, the cost of production 
will stabilize, the reputation of the industry will 
be enhanced, and every firm should experience 
higher profits. 

2. Cut Cost Bid. This bid is relatively low since 
it covers only the expenses of the manufacturing 
process itself and ignores the additional expenses 
for pollution control, restoration of natural re- 
sources, and quality control, Because a cut cost 
bid is low it will probably lead to larger contracts 
and more profits. However, if you and most or 
all of the firms submit cut cost bids the reputation 
of the industry will decline. Accordingly, profits 
will be lower for you and for all of the other firms 
as well. 

You will be asked to bid on each of a series of 
contracts, Your choice involves making either a 
“complete cost” or a “cut cost” bid for each con- 
tract. The amount of profit that you make for 
your firm depends partially upon how you bid and 


* TABLE 1 
Frortt un ror Five-Frex IxpustRY 


partially on how other firms bid. In any one 
bidding session, a “cut cost” bid will always be 
more profitable than a “complete cost“ bid; how- 
ever, the more firms bidding cut cost, the lower 
the general level of profitability in the industry. 
Therefore, on any one trial, your firm has a choice 
whether to bid “cut cost“ so as to maximize 
profits, or to bid “complete cost” so as to raise 
the general level of profitability in the industry. 
You have been provided with a Profit Schedule 
showing the relationship between type of bid and 
amount of profit for a five-firm industry. Please 
examine this schedule carefully. 
Each firm will submit a bid for each contract 
by circling either “complete cost” or “cut cost” 
on the bidding form. The experimenter will collect 
the bids and calculate the profits for each firm on 
that particular contract. 


_ The experiment consisted of a 2 X 2 factorial de- 

sign, with nine groups of five subjects in each of the 
four cells. Manipulation of the independent variables 
(visibility of bids and opportunity to communicate) 
Was accomplished through the following instructions. 


[Low visibility] The experimenter will then re- 
turn the bidding form with your profit recorded 
next to your bid. Representatives will not be given 
information regarding the profits of other firms. 
[High visibility] The experimenter will then re- 
cord on the blackboard each firm’s profits. 
4 (Low communication] Please do not talk to any 
= of the other participants during the remainder of 
the experiment. If you do, the control that is 
necessary for good research will be lost, and the 
experiment will be invalid. 

{High communication] Please do not talk to any 
of the other participants while bids are being made. 
After every round of bidding, there will be a one- 
minute period for open discussion among bidders? 


In order to emphasize the point that the objective 
_ Was to maximize profits rather than to beat the other 


21n 6 of the 18 8 in the high- communication 
dition, commu tion was permitted on only 
third trial. This restriction had a negligible 
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firms, a chart with variou levels was postad 
on the wall, and the expe z called attention 
to the goal of getting the hi ‘ible total ren 

The profit schedule wh given to each 
subject is shown in Tab! schedule war 
thoroughly explained and i by the experi 
menter. Subjects were not 1 regarding the 
number of bidding trials, er to minimise 
terminal effects the biddin ere numbered 
for 20 trials, and the experi: in for only 10 


trials. The dependent varia he number of 
complete-cost bids on Trial 


Results and Di 


Results of the first ex are shown 
in Table 2. Communicati pronounced 
effect. The mean num ymplete-cost 
bids per trial on Tria! is approxi- 
mately twice as high i »mmunica+ 
tion condition as in thi nunication 


condition (F = 39.5, p Subjects in 


the high-communication used the 
communication periods t 1e need for 
socially responsible biddin: some in- 
stances they obtained verbel uitments to 
bid complete cost. In the lox munication 
condition this opportunity f asion and 
public commitment was no lable, and 


subjects tended to bid cut co 


Contrary to our expectatio ibility had 
no effect on bidding. This may xplained as 
follows: In the high-visibility 1> commun 
cation condition a cut-cost der, even 


though his action was visible. í i 
his behavior with complete impunity, since 
the other bidders had no way to persuade him 
regarding the error of his ways. In the low 


TABLE 2 


Mean Nummer or Cooperative Bins (Complete 
Cost) PER TRIAL, EXPERIMENT 1 : 


Low communication® | High communication 


Trials aT 
Low visi- | High visi- Low visi- 
bility bility bility 

1-5 

x 1.9 2.6 3.0 

SD 8 1.1 6 
6-10 

x 1.6 19 3.7 

SD 5 0 1.1 


Note. All group ns = 5. pilit- 
Number of groups = 9 for low visibility and high vava 


BEHAVIOR IN 
mmunication condition, It 
'nough for people to be con- 
mportance of complete-cost 

many Instances, to have an- 
intent to bid complete cost; 
bility was not necessary to sus 
responsible bidding when people 
that the group as a whole was 
lete cost. This suggests that if 
cial responsibility is established, 
individual behavior may not be 
inversely, when a social responsi- 
does not exist, visibility of be- 
f y not have much effect unless it is 
OmMpanied by some kind of legal sanction. 
Th summary, results of the first experiment 
cate the importance of communication as 
Means of establishing and maintaining 
responsible behavior in a commons 
uation. As long as subjects could discuss 
problem with each other and determine 

t the group as a whole was committed to 
E responsibly, they were able to “keep 

th.” 


tely to be a few defectors who, despite their 
fomises to behave in the common interest, 
tempt to take advantage of others. Thus 
ere is a question whether communication 
m sustain a norm of social responsibility in 
e face of persistent minority defection 
dward self-seeking behavior. This question 
s explored in the second experiment. 


EXPERIMENT 2 


In the second experiment we hypothesized 
at in the face of defection, socially respon- 
ible behavior could be maintained as long as 
here was continued opportunity to communi- 
ite but that social responsibility would de- 
orate if no communication was possible. 
This experiment employed a 2 X 2 design 
i which the independent variables were de- 
on or cooperation of an accomplice and 
portunity to communicate. 


Method 
ts and Procedures 


jects were 100 undergraduate students enrolled 
usiness course at the University of North Caro- 
Procedures were similar to those employed in 
experiment, 


Rr 
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perimenters bid complete cost om every trial 
retults a few days after 


; 
| 
si 


Resulis and Discussion 


As a check on the induction under fall 
communication for the first five trials, the 
number of subjects in each cell bidding com- 
plete cost on the fifth trial was tabulated. The 
mean number of complete-cost bidders per 
group was 3.8 out of a possible 4, which indi- 
cated that communication had built up a 
norm of highly cooperative behavior by the 
fifth trial, 


trial (communication F = 13.2, $ < Ôl; de- 
fector F = 4.6, p < 05). However, major in- 
terest is in the comparison between the two 
cells in which the accomplice defected on 
Trials 6-10. Here, communication had the 
predicted effect of sustaining cooperative bid- 
ding, while cooperation deteriorated in the 
low-communication cell (f = 1.82, p < .03)* 
In fact, on the 10th trial, after the defecting 
accomplice had bid cut cost four times in a 
row, the seven high-communication groups 
still produced an average of 3.1 cooperative 
bids versus only 1.3 for the six low-communi- 
cation groups (£ = 2.86, P < 01). 

The results of these two experiments pro- 
vide strong support for the importance of the 
opportunity to communicate in social situa- 
tions involving the problem of the commons. 
In both experiments, communication was not 
imposed or required; the opportunity was 
simply provided to be used in any manner the 
subjects chose. In Experiment 1, communica- 


3A one-tailed test was used because the direction 
of the effect was predicted on the basis of the strong 
effects of communication in Experiment 1. 
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TABLE 3 


) Man Nousee or COA Bios (COMPLETE 
Cost) PER TRIAL, EXPERIMENT 2 


Cooperat- 
accom- ing ac- | accom- ing ac- 

pice | complice plice | complice 
33 34 3.2 3.2 
3 4 6 4 
2.2 29 3.1 3.3 
7 1.1 1.0 9 
13 2.5 3.1 3.8 
10 14 1.2 4 


Note. All group as = 4. 
*Number of groups = 6 for defecting accomplice and co- 


accom, e 
* Number of groups = 7 for defecting accomplice and 6 for 
ng accomplice. 


tion was the critical factor in the development 
of socially responsible norms. In Experiment 
2, communication was essential to establish 
and maintain norms of social responsibility 
in the face of a defecting group member. 
In a recent study, Kelley and Stahelski 
(1970) discussed an “assimilation” effect in 
two-person, mixed-motive interactions. Sub- 
jects with cooperative goals gradually assimi- 
lated to the position of their competitive op- 
ponents. A similar assimilation phenomenon 
was observed in the defector—low-communica- 
tion condition in Experiment 2. Group mem- 
bers who initially bid in a cooperative manner 
A increasingly competitive when con- 
_ fronted with the competitive accomplice. On 
the other hand, in the defector-communica- 
tion condition, cooperation was sustained be- 
cause communication periods between trials 
= provided group members with an opportunity 
_ to establish cooperative coalitions among 
themselves. Their unanimous or nearly unani- 
_ mous pledges of cooperation signaled to each 
participant that if he would only make a 
small sacrifice on the next trial, he would 
most likely receive a higher payoff on suc- 
ceeding trials. Communication also allowed 
= Subjects to signal to each other the availa- 
bility of what Platt (1973) called “counter- 
keinforcers,“ in the form of social approval 
_ for sacrifice in the common interest. 
A practical implication of these findings is 
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to focus attention o 

taining expressions o 

a socially responsible 

producing behavior ir 

We would expect this 

effective in situations 

face groups, where the 
within the group cou 

for all to perceive, Ma 
group situations are oí 

egy would be more d 

larger groups, where 

ment could not so reac 
haps, in these situatii 

concept (Likert, 1967 

Commitment could be 

groups, and each gro could 

this commitment to level in 
organization, In additi igators 
examine the potentia! 
grams designed to insu 

ment to organizationa! e., 

fects,’ group incentive 

problem of the commons. 
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d Prya 
an t 


tion, anxiety-strem, and performance 


with personal worth, social affiliation, 
anxiety-stress; and performed better 


í che more recent attempts to improve 


the satisfaction and performance of em- 
plo is the 4-day, 40-hour (4-40) work- 
wee (see Glickman & Brown, 1973). Basi- 
ca! the shortened workweek involves more 
ho: of work per day for fewer days per 


It is estimated that between 60 and 70 

anies per month are converting to the 

- workweek (Wheeler, Gurman, & Tarno- 

i, 1972). The current estimate of the 

ber of firms using the shorter workweek 
es from 700 to 1,000 (Chard & Caruth, 
3). 
Che literature suggests that firms are 
pting the 4-40 workweek because of po- 
catia] improvements in production and mo- 
e, reduction in absenteeism rates (Steward 
& Larsen, 1972), and the convenience that it 
provides employees by providing an extra day 
off (Poor, 1970). There are, however, some 
inherent difficulties in the shortened work- 
week; for example, limitations on the hours 
f female labor, possible overburdening of 
supervisory personnel, overtime pay questions, 
aod scheduling problems. More research is 
needed on the effects of the 4-40 workweek 
and other workweek variations to clarify the 
possible benefits and costs of this change in 
working conditions. 

A recent longitudinal study by Nord and 
Costigan (1973) investigated the reactions of 
employees over time. There is also a need for 
researchers to conduct experiments to study 
the shortened workweek as an independent 
Variable that could influence such dependent 

Requests for reprints should be sent to John M. 
Ivancevich who is now at the Department of Or- 


3 Behavior and Management, University 
of Houston, Houston, Texas 77004. 
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CTS OF THE SHORTER WORKWEEK ON SELECTED 
SATISFACTION AND PERFORMANCE MEASURES 


JOHN M. IVANCEVICH* 
Depariment of Busines Admininiration, Univertity of Kentucky 


Effects of the 4-day, 40-hour workweek were examined by comparing erpai- 
mental (== 104) and control (s= 106) subjects on measures of job satisfar- 


indicated that the workers in the 4-day, 40-hour dividon were more sathdied 


control group (5-day, 40-hour) counterparts. 


over a U-month period The amabyule 


job security, and pay; experienced bow 
with regard to productivity than their 


variables as satisfaction, anxicty-stress, & 
senteeism, and performance. The present 
Cle describes such an experiment, 

The specific hypotheses tested were thi 
when 4-day, 40-hour (4-40) and 8-day, 


in (e) self-actualization, (5) . ( J 
personal worth, (d) social affiliation, (e) job 
security, and (f) pay satisfaction (Hyp 


anxiety-stress, and absenteeism with ove 
performance (Hypothesis 5). 
Merion 
Company Studied 
The company studied has a total of 1,140 m 
gerial and operating employees. These employees are 
located in manufacturing facilities in four geographi- 
cally dispersed operations in the Midwest. The four 
divisions were each engaged in work that could be 
classified, from a technological standpoint, as inters 
mediately complex (Woodward, 1965). The company 


designs, manufactures, and markets a packaging ma- 
chine used in the food-processing industry. i 


a 

Subjects f 
The company’s executive committee, which wanted 
to convert all four divisions to the 4-40 worki 


agreed to establish two divisions as experimental 
control ⁊ control? groups to be studied for 12 months, 


~ 2 The term control division is used only to indicate 
the division that was not converted to the 4-40 
workweek. The company rejected the request of the 
author to study the two divisions for at least two 
years so as to ascertain any “sleeper” effects in the 
4-40 division (Campbell & Stanley, 1963), 
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r division (u = 104) was converted to the ranging from .70 to 85 v sted from the . 
40 system, while the control group (m= 106) con- item job satisfaction mcas 
tued to junction on the 5-40 schedule. The anxiety—stress mea ed of nine quem 
“The experimental and control subjects were similar tions in which employee: how fi 
such characteristics as mean age, mean seniority, they “felt bothered” abou: r of job-related 
r present grade, unexcused absenteeism matters (see Kahn, Woli: Snoek, & Rons- 


level, and average annual wage. The thal, 1964). Examples of o c the followings 

and control divisions are both located (a) not knowing just wh ple you work 

large cities, but are 120 miles apart. with expect of you; and that you have 

The subjects participating in the study were all too heavy a workload, or. can’t possibly 

bers of the United Steelworkers of America finish during an ordinary Employees re- 

Neither division has bad a strike for the sponded to each item on a ile (never te 
nearly all the time). 

The unexcused absence ra ulated om the 
basis of hours per month r he job, where 
the reason was not known by visor, 

A questionnaire was administered during working Four performance facets were quality, 
to the total operating employee population in level, effectiveness as a team nd knowledge 
divisions, The anonymous questionnaire was of job. The performance inde: ultitrait seale, 
eted at three different time periods over a 13- developed by the company, completed by 

period. These three time periods are desig- the supervisor of each emp dex was the 
d (a) Time Before, one month before conversion average of 10 (6-point) rat ting the em- 
4-40 workweek; (b) Time After 1, three ployee’s performance on the relative do 
3 and (c) Time After 2, 12 “other employees who are n- same work 
satisfaction was defined as the extent to which (SPC2rman-Brown split-halt ; a 
b rewards meet or exceed the employees’ per- Change ¢ tests were used e net dil 
of an equitable level of rewards on six job ences between the changes in n scores for 
=self-actualization, autonomy, personal worth, both the experimental and trol groups 
affiliation, job security, and pay (Porter & (McNemar, 1962). Although cha Time After 1 
er, 1968). and Time Before, and Time After Time After 
e oe 5 faat 1 = 8 1 scores were analyzed, only th After 2 — 
questionnaire, previously pret i i per: ‘he most redii 
E Garity and reliability in a third division of the vit poose net, difference cha 
pany. Examples of items are the following: (a) A 3 had 
is the opportunity to develop friendships in 
i company.” (b) “The compensation program for RESULTS 
r in jobs similar is fair.” n S 
ea ae a fers u 2 bt Tale 1 presents data . bee 


nt Likert scale (strongly disagree to strongly changes in job satisfaction and a 
Acceptable coefficient alpha reliability levels There was a small, but signilic 


TABLE 1 


Mean Scores, NET CHANGES, AND l VALUES FOR SATISFACTION 
FACETS AND ANXIETY STRESS VARIABLES 


Experimental division (» = 104) Control division (n = 106) | 


Net dif- 
Malats ia t value 
TBs | TA | TA: TA TR TB TA: Taa | Tare | HET 2 
351 | 358 ¼ 3157 00 3.47 3.46 | 3.53 06 0 | 0 
447 | 430 4448 01 4.34 | 439 4.38 o4 03 195 d 
441 4. 5.48 1.07 442 | 461 | 462 20 7 
418 4.30 4.97 40 440 4.48 | 4:41 01 -18 | igge 
361 | 3.95 | 438 77 3.72 3.68 | 3.79 09 70 16385 
3.04 | 346 | 421 r17 3.01 | 3.09 | 310 09 108 10085 
261 408 | 475 92 3.76 3.79 3.81 05 % „% 
% 2.42 | 1186 —.75 2.30 2.24 2.25 =.05 Ho b 
Time N mean score for the first seven variables indicates a larger amount, TB = Time Before; TA = Time After li 7 
ean score of data collected one month before conversion ti 
re of data three OEMS Stee oe ato 4-day, 40-hour workweek, 


nern Woakwaxx Esrects on Savispacnion axo Prevossance 


TABLE 2 


Mazax Scomms, Nev Craxars, aw ale roe eee 
ae axo Praromwance F Fern 
eee — — ' Äͤ = a 
Espetimesial dividos (s = 104) Control Gretee (e = WO 
~ Variati — —— — — — ä — 
To re | tae Til re | ras | the ta- 


Absences wo |as |as | -58 | ee | Ma (3i | -i 


fection Aas at 101 BA an an ant n 
uon aa Aw an u im Ma iss n 

= as uram 
7 ae a a 1 1 
ine of job aM An” A» as ao i ~~ . 
mance * 4 00 * au J aa am n 


The hiser the meng for alt od the varia — — — Th = Time 
N Time After 1; TA, = Time 2. 

one month before nene to b-day, = ee 
(bree mouths ox oan 


í data collected 1 12 moat ee after conver 
eres ces in changes for TA: = Te patted bn the experteentel aint ouat òl Gtvineun 


veral] satisfaction in the experi- perceived anxiety-stress index in the 
vision which had converted to the mental division, 
k schedule. Four of the job satis- Table 2 presents data relevant to the 
facets (personal worth, social affilia- excused absences and performance facets 
security, and pay) appeared to potheses. With respect to absences, the 

this general increase in satisiac- showed no significant changes in either the 
on re was no discernible increase in the 4-40 or the 5-40 divisions. Thus, H 

act ation and autonomy facets. In 3 was not supported. The supervisory per- 
ite of s latter finding, Hypothesis 1 was formance ratings of the employees showed 
generally supported. significant improvement for the ; 
As predicted in Hypothesis 2, there wasa group in the rate of production, eff 
ignifican' (but small) improvement in the as a team member, and overall m 


$ : 
TABLE 3 4 


CHANGES IN CORRELATION COEFFICIENTS OF BEHAVIORAL AND ABSENCE VARIARLES wit 
PERFORMANCE BETWEEN Tre Arrer 2 An Tow ron Data 


Variables 


8 8 888 


| 
S 


Tests of significance were computed after f to z transformation of correlation coefficients. 
ean score of data collected one month before 555 to 4-day, 40-hour workweek. ` 
lean score of data collected 12 months after conversi 

3 in changes for TA: — TB period in the ‘experimental and control divisions. 

«01, 
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when compared with the control group. 
Quality and job knowledge ratings showed no 
differences. Consequently, Hypothesis 4 was 
only partially supported. 

To test Hypothesis 5, which predicted sig- 
nificant changes in the relationships between 
overall performance and other indices of satis- 
faction and anxiety, the net correlation 
changes between the Time After 2 and Time 
Before were examined for the experimental 
and control divisions (see Table 3), The re- 
sults for the experimental groups showed some 
significantly greater changes than those oc- 
curring in the control group. The results sug- 
gest that stronger positive relationships be- 
tween social affiliation, job security, and pay 
with overall performance occurred in the ex- 
perimental division after the conversion to the 
4-40 workweek. However, there was no sig- 
= nificant improvement indicated for the self- 
_ actualization, autonomy, personal worth, 
 anxiety—stress, and unexcused absenteeism re- 

lationships with overall performance over 
time. Thus, only partial support of Hypothesis 
5 is shown. 


; 
l 
ü 


Discussion 


Overall, the results are generally supportive 
with regard to positive effects of the 4-40 
_ workweek on a number of the indices investi- 
_ gated. There appeared to be no negative 
effects on these indices. However, not all in- 
_ dices showed significant effects. The results 
_ emphasize the need to evaluate possible effects 
on a variety of different criteria. Thus, the 
findings with certain performance facets for 
the 4-40 division seemed to support the con- 
tention of advocates of this schedule that the 
shorter workweek may result in improved per- 
formance (Hall & Nougaim, 1968; Poor, 
1970), although not all aspects of performance 
_ showed significant change. : 
Furthermore, the present study indicated 
that after conversion to the 4-40 workweek, 
there were positive effects on anxiety levels 
and on some facets of job satisfaction. How- 
ever, there was no discernible improvement in 
Satisfaction in autonomy and self-actualiza- 
tion facets. These latter facets are those which 
Alderfer (1969) labels as “growth” facets. 
his finding is not consistent with the claims 
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of 4-40 advocates, 
effects of these facets 
Tarnowieski, 1972). 
reduction in absenteeis 
does not substantiate 
& Adelman, 1971; 
1970). 

These results were o 
in a single company, an 
be made with cautio: 
limited to a 13-month 
sible that some effects 
minish over a longer tin 
(1973) results suggest 


assumed positive 
zeler, Gurman, & 
ly, the lack of a 
the 4-40 group 
is claims (Nanus 
1970; Wheeler, 


n two divisions 
alizations must 
the study was 
and it is pos- 

w up or di- 
& Costigan’s 

ssibility, and 


we cannot rule out “Hawthorne 
effects.” 
The more positive re’ ‘ips in the ex- 


erformance and 
id pay might 
n effect. The 


perimental division bei 
social affiliation, job sec 
be partially the result © 


employees may have . | themselves 
more secure because they only plant 
in the organization that < 1 to a rela- 
tively modern 4-40 sche, re are, of 
course, possible interven: such as 
physical fatigue, attitudes sure time, 
job pace, etc. which could mined as 
moderators of these efec e studies 
might well examine these issu 
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Task-specific self-esteem is likely to be a func- 
tion of past experience with the task or simi- 


lar tasks. Socially influenced self-esteem is a 


function of others’ expectations of one’s be- 
havior. When others expect a person to per- 
form well, and communicate this feeling 
through their words and behavior, it is ex- 
pected that the person’s own feelings of com- 


petence will increase. 


: The three sources of self-esteem, it is theo- 
rized, determine a person's level of “self- 
perceived competence and ability for the task 


1 Requests for reprints should be sent to Jeffrey H. 


_ Greenhaus, Department of Management Science, 
Stevens Institute of Technology, Hoboken, New 
85 Jersey 07030. 


SELF-ESTEEM, PERFORMANCE, AND SATI TION: 
SOME TESTS OF A THEORY 

JEFFREY H. GREENHAUS + IRWIN J 

Stevens Institute of Technology Montclair S 
Two studies were conducted to investigate (a) the relationship | 
three sources of self-esteem outlined in a 1970 study by Korma 
task specific, and socially influenced—and performance on an an 
(b) the moderating effects of task-specific self-esteem on the relat 
tween performance and satisfaction. College undergraduates participa 
study. It was found that task-specific self-esteem predicted performar 
total sample and socially influenced self-esteem predicted performance 
authoritarian subjects. It was also found, as predicted, that performar 
to predict satisfaction only for high-self-esteem subjects. 

A recent theory of organizational behavior at hand” which, in turi affects the 
(Korman, 1970) has emphasized the role of level of task performa igure 1 in 
self-esteem in work outcomes. Within a bal- Korman, 1970, p. 35). 
ance theoretic framework, the general thrust The conceptualization ee sources 
of the approach is that, all other things being of self-esteem raises sev: ns. First, 

equal, high-self-esteem persons are motivated the question of the relatis tance that 
to perform well on a task in order to maintain each source of self-esteem h er formance 
their self-image of competence. Low-self-es- has been neglected. Prior re (see Kor- 
teem persons, on the other hand, are not moti- man, 1970, for a review of 1 of the re- 
vated to perform well since poor performance search) has only focused on effects of 
for them is consistent with their self-image each source separately. Se: it would 
of relative incompetence. seem that the most potentia source 

A particularly unique aspect of Korman’s of self-esteem from an organi. point of 
approach is his postulation of three sources view is socially influenced + teem. It 
of self-esteem. Chronic self-esteem is seen as would, therefore, be particulary ne pful to 
“a relatively persistent personality trait that determine whether practices designed to in- 
occurs relatively consistently across various crease socially influenced self-esice =i are more 
situations [Korman, 1970, p. 32].” Task- successful for certain types of employees than 
specific self-esteem refers to an individual’s for others. It is possible, for example, that 
feelings of competence for a particular task. socially influenced self-esteem should have 


more of an effect on performance for high- 
authoritarian persons than for low-authotl: 
tarian persons. This prediction is based on 
two factors: (a) that a practice designed to 
increase self-esteem may be conceived of as 
a social influence attempt; and (2) that there 
is some evidence (Hovland & Janis, 1959) 
that high authoritarians are more susceptible 
to influence attempts than low authoritarian: 
In addition to the direct effects self-estee> i 
may have on performance, Korman (1970) 
has also viewed self-esteem as a moderatot of 
the relationship between performance an 
satisfaction. He has hypothesized that p% 
formance should predict satisfaction only for 
high-self-esteem persons, whereas satisfaction 
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should t performance only for low-self- 
esteem ms, 
The component of this moderator 


effect ormance leads to satisfaction only 
for hi; f-esteem persons) is based on the 
notion t for high-self-esteem persons, the 
better performance, the greater the bal- 
ance their self-concept of competence, 
and t ore the more satisfaction with the 
task use success and competence are not 
centr » the self-concept of low-self-esteem 
perso heir task performance should not 
have important impact on their satisfac- 
tion. ne low-self-esteem person may derive 
his s. -faction from factors other than effec- 
tive p : formance (Greenhaus, 1971). 


Th. second component of the moderator 


effec! (satisfaction predicts subsequent per- 
form: ce only for low-self-esteem persons) is 
base. on a presumed differential reaction of 
high and low-self-esteem persons to a state 
of d` satisfaction with a task. Since it is as- 
sum that individuals seek outcomes that 
are balance with their self-concept, high- 
self- teem persons will be motivated to 
ach’ ve a high level of performance on a sub- 
seq: ont task regardless of their satisfaction 


witi 
saic 
tior 


a prior task. Considering what has been 
about how performance affects satisfac- 
for high-self-esteem persons, we can ex- 
pec: the high-self-esteem person who is satis- 
ñed with a task (primarily resulting from 
good performance) to continue to perform 
well on a second task and the high-self-esteem 
person who is dissatisfied with a task (pri- 
mevily resulting from poor performance) to 
improve his performance on a second task 
to « Jevel about equal to that of the satisfied 
high self-esteem person. 

For low-self-esteem persons, however, dis- 
‘Satisfaction is presumed to result in the 
Sterectypically poor performance of the dis- 
gruntled employee (Korman, 1970). There- 
fore, we would expect a decrease in the 
Performance of dissatisfied low-self-esteem 
Persons and no change in the performance of 
Satisfied low-self-esteem persons, that is, a 
Positive correlation between satisfaction and 
Performance. 

Two studies were conducted to investigate 
the variety of predictions outlined above. The 
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first study was concerned with the main 
effects of self-esteem. It was predicted (Hy- 
pothesis 1) that there would be positive rela- 
tionships between each of the three sources of 
self-esteem and task performance, It was 
also predicted (Hypothesis 2) that the rela- 
tionship between socially influenced self-es- 
teem and task performance would be greater 
for high-authoritarian subjects than for low- 
authoritarian subjects. In the second study it 
was predicted (Hypothesis 3) that perform- 
ance would be positively related to satisfac- 
tion with the task only for high-self-esteem 
subjects, whereas (Hypothesis 4) task satis- 
faction would be positively related to per- 
formance on a second task only for low-self- 
esteem persons. 


EXPERIMENT 1 
Method 


Subjects 


The subjects in this study were 144 undergraduate 
students (75% male) enrolled in four sections of 
industrial and organizational psychology at an eastern 
state college. 


Research Instruments 


Chronic self-esteem was measured by the self- 
assurance scale of the Self Description Inventory 
(Ghiselli, 1971). Task-specific self-esteem was mea- 
sured by five items, all intended to tap the subject’s 
self-esteem for a particular task, solving anagrams. 
Specifically, the subject rated (a) his ability in solv- 
ing anagrams (compared to solving arithmetic and 
mechanical problems); (6) his aptitude for solving 
anagrams; (c) an estimate of what his performance 
might be on an anagram task; and (d) his capability 
in solving anagrams. The (alpha) reliability of the 
task-specific self-esteem scale was .72. 

Socially influenced self-esteem was manipulated 
through differential instructions given to describe the 
nature of the study. Specifically, the instructions 
given to high socially influenced self-esteem subjects 
were as follows: 


The task you are about to begin has been given 
to a nation-wide sample of people. It has been 
found that college students in New Jersey like 
yourself do very well on this task. In fact, they 
do well above the national average. I expect that 
you will also do rather well on this task. 


The instructions given to low socially influenced 
self-esteem subjects were as follows: 


The task you are about to begin has been given 
to a nation-wide sample of people. It has been 
found that college students in New Jersey like 


ihe n : < 
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ot do very = on this task. In The data provide r Hypothesis g 
well w national average. 3 aty k 8 
r te 1 * * influenced 
T am asking you to take it, however, so Sell-esteem and per or the ‘high 


were product-moment correlations with values 
one and two representing low and high self-esteem, 
respectively. 
Authoritarianism was measured by Form 40 of the 
F scale (Adorno, Frenkel-Brunswik, Levinson, & 
ord, 1950). In developing low- and high-authori- 
tarian subgroups, the sample was dichotomized at its 
‘median (112). The performance task used consisted 
of 20 S-letter anagrams taken from Mayzner and 
‘Tresselt’s (1958) list of difficult sequences. The cri- 
terion of task performance was the number of ana- 
_ grams successfully completed in a 15-minute time 


ned to the high socially influenced self-esteem 
tion and the two remaining classes (n= 77) 
assigned to the low socially influenced self- 
em condition. Following the differential instruc- 
ns (depending on the experimental condition), 
ich subject received an identical questionnaire book- 
t which contained in the following order: chronic 
Self-esteem scale, task-specific self-esteem scale, F 


scale, and anagram task. After completing the book- 
let, subjects were told the true nature of the study. 


x Results 


able 1 presents the intercorrelations 
among the major variables of the study. It 
should be noted, first, that the three sources 
self-esteem were virtually uncorrelated. 
o, regarding Hypothesis 1, only task-spe- 
c self-esteem was significantly (p < 01) 
lated to task performance. 


TABLE 1 


‘TIONS BETWEEN SOURCES OF SELF-ESTEEM 
AND Task Performance (N = 144) 


; Socially- 
Helen, | infuenced | peak Per 
self-esteem 


—.03 09: 3 
= —.10 —.16 
= .09 


authoritarians was 

for the low authoritari 
ns). The difference bi 
was signficant (z = 1.8 


, $ < 02) and 
~.02 (n= . 
ve correlations 
5, one-tailed), 


Discu 
Task-specific self-est: he only sig- 
nificant determinant « ance. How. 


ever, as predicted, the hip between 
socially influenced self d perform 
ance was significant {i ithoritariag 
subjects. This finding m: veral impli- 
cations. First, an organ ight decide 


ntervention 
employees 
success, 
jues might 
mployees. 
useful to 
eem as @ 
ttempt. In 
perform- 


to concentrate its sel 
techniques on high-au:! 
where there is a higher p: 

or, alternatively, differe: 

be considered for low-aut! 

In a theoretical vein, it : 
study socially influenced 
particular type of social influ 
this way, subsequent attitude 
ance) change could be inve 
of such variables as the char: 
communication and commun! 
sonality predisposition of the c: 
potential influence of group pres 


EXPERIMENT 2 
Method 


Subjects in the second study were 6! undergradu- 
ates in three sections of industrial and organizational 
psychology at an eastern state college. The second 
study concentrated on the moderator effects of task- 
specific self-esteem since it was the best predictor of 
performance in the first study. Task-specific self. 
esteem was measured by the same scale used in the 
first study. In order to test the impact of perform. 
ance on satisfaction and satisfaction on later per- 
formance, it was necessary to develop two similar 
tasks separated by a measure of task satisfaction. The 
original 20-item anagram task used in the first study 
was, therefore, separated into two 10-item tasks. 
10 odd-numbered anagrams constituted the first task 
and the 10 even-numbered anagrams were used i 
the second task. Task satisfaction was measured bY 
the subject’s response to the question, “How much 
did you like doing the anagram task?” Responses 
were made on a 5-point scale from “liked it very 
much” to “disliked it very much.” 5 ; 

Subjects received a booklet which contained in the 
following order: task-specific self-esteem scale, 
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N PERFORMANCE FROM Tass | ro Task 2 as a Foweriow oF eee eee wers Tasa I 
any Tase-Srecinec Sarr-esrian 


: satisfaction with Task 1, and Task . After 
completed the first task (but before they 
‘ir satisfaction), the experimenter read the 
inswer to each anagram and, in order to 
feedback on their relative success, told sub- 
it a score of 6 of more correct meant they 
ssed” the test and a score of $ or less meant 
y had “failed” the test. (These cutoffs were 
n score distributions on a pretest of the 
After hearing whether they passed or failed 

ks, subjects rated their satisfaction with the 

id went on to the second anagram task. 


Results 


order to test Hypothesis 3, subjects were 
dicotomized into high task-specific (u = 37) 
mi low task-specific (m= 24) self-esteem 
i (The dichotomy was approxi- 
nately a 60%-40% split because the distri- 
bution did not permit a split exactly at the 
ne lian, and a random allocation of subjects 
nez- the median into the high and low sub- 
gro os was rejected.) Then, within each 
f steem subgroup, correlations between 
performance (raw scores) on Task 1 and sat- 
action with Task 1 were computed. The 
Performance-satisfaction correlation for the 
J high task-specific subjects was .50 ( < .01) 
and for the low task-specific self-esteem sub- 
jects was .17 (ns). However, the difference 
between the two correlations (2 = 1.4) was 
nonsignificant. 
Hypothesis 4 predicted that satisfaction 
Should correlate with subsequent performance 
for low-self-esteem persons but not for high- 
Self-esteem persons. Within each level of task- 
Specific self-esteem, correlations between satis- 


OSS sO ee 
Task 1 Tak? 
=.» O ——— N 4 - ~ 
Satisfaction 
with Task 1 " Performance py hal ogg 1 ‘ 
£ | sv | £ | sp | 
= — — —Ab — — —— — 
Satisfaction 20 | 44s | 226 e | san | 299 | 290 
Dissatisfaction | 17 | 247 | 141 | Increase | zu | 1% | <r 
Satisfaction 11 4 | 199 | Reminame | 291 12 171 
| Dissatisfaction 13 $00 | 187 Decrease 1 | H 1 
| ól | 348 | 20 | 249 | 210 


faction with Task 1 and performance on Task 
2 were computed. For low task-specific self- 
esteem subjects, the satisfaction-performance 
correlation was .37 (p< 05, one-tailed), 
whereas for high task-specific self-esteem auh 
jects the correlation was .20 (ns). The differ- 
ence between the correlations was nonsignifi- À 
cant. ] 
Table 2 presents changes in performance 
from Task 1 to Task 2 for subjects sub- 
on task-specific self-esteem and satis- 
faction with Task 1. Perhaps the most obvious 
finding is that all of the subgroups decreased 
in performance from Task 1 to Task 2. An 
inspection of the performance changes of the 
four subgroups provides little support for 
Hypothesis 4. 
It might be noted parenthetically that al- 
though Experiment 2 concentrated on be 
moderator effects of task-specific self-esteem 
because it significantly predicted performance — 
in Experiment 1, the results for Experiment 2 
were not so strong. In fact, the correlation 
between task-specific self-esteem and perform- 
ance in the latter experiment was .14 (com- 
pared to an r of 31 in Experiment 1). It 
should also be noted that there was a restric- 
tion of range on task-specific self-esteem in 
Experiment 2 (SD = 2.42) compared to Ex- 
periment 1 (SD = 3.14). 


Discussion l 


The data in Experiment 2 lend tentative 
support to at least one of the moderating 
effects suggested by Korman (1970). Per- 
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formance did influence satisfaction only for 
high-self-esteem subjects. This result not only 
1 tends to confirm Korman's (1968) earlier 

finding but extends the generality of the hy- 
_ pothesis since in this study task-specific and 
not chronic self-esteem served as the moder- 
n ator, It should be remembered, however, that 
me difference between the performance-satis- 
faction correlations for high- and low-self- 
esteem subjects was nonsignificant. The other 
moderator effect, satisfaction predicts per- 
formance only for low-self-esteem subjects, 
received little support. 

It should be noted that the measure of 
satisfaction used in the present study was 
specific to the task. Perhaps future research 
in industrial settings would benefit by using 
either a more general measure (e.g., Bray- 
field—Rothe) or a multifactor measure (e.g., 
Job Descriptive Index) of satisfaction. 


OvERALL COMMENTS 


The results of the two studies suggest the 
important role self-esteem may play in shap- 
ing task performance and satisfaction. The 
main effect of task-specific self-esteem in Ex- 
periment 1 may have implications for organi- 
zational practice. Assuming that the posses- 
sion of relevant abilities is at least moderately 
related to task-specific self-esteem, effective 
selection, classification, and training systems 
can generate a work force with a relatively 
high degree of self-perceived competence. Per- 

haps any experience that provides for skill 
acquisition, mastery, and psychological growth 
= should contribute to employees’ level of task- 
specific self-esteem, 
Task-specific self-esteem seems related to 
other notions of expectancy (Porter & Lawler, 
1968; Vroom, 1964) since they all refer to 
perceived probabilities of effective perform- 
ance. Since expectancy theory proposes an 
interactive relationship between expectancy 
and valence, to the extent that task- specific 
self-esteem is raised, improvements in an or- 


JEFFREY H. GREENHAUS AND IRWIN J. Ba! 


ganization’s reward e (leading to a 
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ght be expected, 
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for example, that a se of high so- 
cially influenced self vould result in 
higher levels of emplo: ronic and task- 
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y could foster 
(I am compe 
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The usefulness of “certainty of respon: 
models was tested in the context of 
the act.” The direct evaluation of 


combinations of outcome evaluation, 


Ins: mentality-expectancy theory, as for- 
mulat by Vroom (1964), Porter and Lawler 
(1965 Graen (1969), Fishbein (1967), and 
other has as a basic tenet that behavior is 


licative function of the expected con- 
es of an action and the individual’s 
ion of those consequences. While de- 

the theories differ, an expectancy x 
term is common to all, whether it is 
“force? (Vroom, 1964), “attitude 
towa the act” (Fishbein, 1967), or “in- 
terns’ pressure” (Graen, 1969). 

Gen (1969) has, however, found that 
certain boundary conditions limit the gen- 
erality of expectancy theory. Specifically, he 
ound that behavior was predictable from ex- 
ctancy theory only in a “reciprocating cli- 
yate”—where performance is regularly evalu- 
atel, good performance rewarded, and the 
tele ant contingencies communicated to the 
Worker. Camman and Lawler (1973) reported 
data consistent with this finding, though the 
proposition was not directly tested. 
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OF CERTAINTY WEIGHTS 


IN EXPECTANCY THEORY * 


FELDMAN? 
ment, University of Florida 
sc"? as a moderator variable in expectancy 


Fishbein’s construct of “attitude toward 
five job-seeking behaviors by a racially 


and socioeconomically heterogeneous sample (W = 200) was predicted from five 


outcome expectancy, and response cer- 


tainty. Contrary to the hypothesis, the simple sum of the evaluations of the 
outcomes of each behavior best predicted the directly measured evaluation 
Results are discussed in terms of possible modifications of the expectancy model 
and the concept of uncertainty as reflected in subjective probability estimates. 


A reciprocating climate is taken to be a 
climate in which stable subjective probabili- 
ties (expectancies) can be formed. If rein- 
forcement contingencies continually change in 
a fashion perceived by the workers to be en- 
tirely random—for example, where a depart- 
ment is continually being reorganized—then 
stable estimates of behavior-reward proba- 
bilities will most likely not be formed, Accord- 
ing to Graen (G. Graen, personal communica- 
tion, 1971) expectancy models cannot predict 
attitudes or behavior if people cannot form 
stable expectancies, as in a nonreciprocating 
climate. This perceived instability ought to 
be reflected in the certainty statements given 
for the expectancy measures. Thus, Graen 
suggests the use of a “certainty of response” 
measure, used to weight the expectancy esti- 
mate, in the computation of internal pressure. 

The use of certainty weights ought to in- 
crease the validity of the model, since more 
certain responses would receive more weight 
in the equation. This suggestion is similar to 
Brigham’s (1971) proposal that the validity 
of stereotype measures in the prediction of 
interpersonal behavior could be improved by 
the use of a certainty weight. Some support 
is provided by Sample and Warland (1973), 
who found that student-election voting be- 
havior could be more accurately predicted 
from attitudinal measures for students who 
were highly certain of their responses. 

Fishbein’s (1967) construct of attitude 
toward the act can be used as the basis for a 
test of the certainty hypothesis. Fishbein 
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(1967) defines attitude toward the act as the 
evaluation of an act in a situation; theoreti- 
cally, it combines in a linear multiple regres- 
sion equation with a measure of normative 
beliefs about a behavior, to predict the in- 
tention to perform a particular act. These 
“behavioral intentions” mediate between the 
model and measures of overt behavior, (For 
further discussion and a review of the evi- 
dence pertaining to the model, see Ajzen and 
Fishbein, 1973.) 

Attitude toward the act is a multiplicative 
function of beliefs about the consequences of 
an act (B.) and the evaluation of those con- 
sequences (at). In the model, B; corresponds 
to an expectancy, or subjective probability, 
that a behavior will be followed by an out- 
come. As Feldman (1973) has pointed out, 
attitude toward the act and Graen’s (1969) 
concept of internal pressure seem to be vir- 
tually identical. 

Given the isomorphism of attitude toward 
the act and internal pressure, it is likely that 
anything affecting the validity of an expec- 
tancy estimate would also affect the relation- 
ship between the theoretical estimate of atti- 


tude toward the act, > (Bat), and a direct 
i=l 


measure of the evaluation of an action. Thus, 
if a certainty of response measure does reflect 
the degree to which a subject places confi- 
dence in his probability estimates, then a 
multiplicative certainty weight for each B; 
should improve the correlation between the 
above theoretical estimate and direct measure 
of attitude toward the act.? Denoting cer- 
tainty as C;, the question becomes: Which 
correlates more highly with a direct measure 
of attitude toward the act, (Bias) or 
=(C,Bia;)? 

However, this question ignores the issue of 
whether sum of products models generally 
provide the best estimates of directly mea- 
sured evaluation. To assess the validity of the 
expectancy model in general, it is necessary 
to compare the correlations obtained by sev- 


Note that, to the extent that some theoretical 
estimate of the attitude toward the act correlates 
with a direct measure of the attitude toward the act 
for a given behavior, the convergent validity of atti- 


tude toward the act as a hypothetical construct has 
been demonstrated, 


Jack FELDMAN 


eral different combi 
variables: expectanc) 
tion (a:), and certain 
sized, therefore, that 
most highly with dire 
toward the act, follo 
3(C;,B;), and B.. Thi ion is based on 
the assumption that t > combinations 
contain successively g amounts of 
information. 


of the following 
outcome-evyalua- 
It is hypothe- 
27) will correlate 
-asured attitude 

(Bag, Xa, 


Subjects 


Subjects were 200 male 
the ages of 18 and 50. Ap; 
into each of the following 
unemployed, black work 
unemployed, and white wo: 


inteers, between 
50 subjects fell 
lack hard-core 
hite hard-core 


Measures and Procedur: 


ehaviors were 


The evaluations of five job 


used in testing the hypothesi: ere (a) going 
to an employment agency (age b) going from 
place to place filling out applic: applications); 
(c) trying to join a training pro training), (@) 
reading want ads (ads), and nging around 
with friends until somethin up (friends). 
These behaviors had been mos tly mentioned 
by a pretest sample of blacks a: tes as typical 
job-seeking activities. g 

Following Graen (1969), iniri outcomes 0 
each behavior were also elicit m the pretest 
sample. “Intrinsic” in this case ns intrinsic 0 


i to “name 
e you (for 
ency.” Thus 
outcome of 
“filling out 
appointment,” 
omes of ea 


the behavior itself. Subjects v 
things that happen to you 
example) go to an employr 
“getting a job” would not be an i 
going to an employment agency, 
long applications,” “waiting for 
etc., might be. The five intrinsic ¢ t 
behavior most frequently mentioned by the pretes 
sample were used. : five 

Evaluations of each of the 25 outcomes and í 
behaviors were made on the nine-alternative 7a 
Kunin’s (1955) Faces Scale, presented on a flashea 5 
This scale provides an approximately edu a 
measure tapping an affective response dimensi 

Feldman, 1972). 
: 1 (subjective probability) was measured 
by asking each subject to estimate the chances 19105 
of receiving each outcome, having done the behan 4 
Subjects responded as if they were setting odds 0 
bet. No visual rating scale was used. 

*This methodology section is necessarily he > 
Interested readers are referred to Feldman 111 
where full details of the procedure, subject pop 
tion, etc. are presented, 


Certair measured after each expectancy re- 

x s were shown a flashcard with a 

three-alt è scale, ranging from “very sure this 

is right’ » “don’t know at all” (1), and chose 

the rest at best represented their confidence in 
the expe y estimate. 

Subje re familiarized with the task and given 
practice s before data collection began. Those 
who c t comprehend the task after practice 
were . Data were collected as part of a 
longer view, two to four hours long. Interviewers 
were same race and sex as the subjects and 
had be ined in neutral tones of voice and man- 
nerism ıl details of the interview procedure, 
practic ems, scales, interview schedules, and va- 
lidity on the measures can be found in Feldman 
(1972 

RESULTS AND DISCUSSION 
Sin race and social class differences are 


irrele 
perfo 
200) 
each 
corre 


nt to the hypothesis, analyses were 
ed on the combined sample (N = 
The directly measured evaluation of 
ehavior (attitude toward the act) was 
ted with each of five predictors: 


1. he sum of the expectancies of obtaining 
each of the five outcomes of each behavior 
(* 

2 the sum of the products of each ex- 
pec’ ncy times its corresponding certainty 
Wei-ht, taken across the five intrinsic out- 
cor es of each behavior [¥(C,B;) |; 

. the sum of the evaluations of the five 
inte insic outcomes of each behavior (Sa;); 
4, the sum of the products of evaluation 
of each outcome times the expectancy of ob- 
taining that outcome [3 (B;a;) | across the five 
ou comes of each behavior; 

5. the sum of the products of expectancy 
times certainty times evaluation of each out- 
come, taken across the five outcomes of each 
behavior [3 (C;Bia;) J. 


Thus, the hypothesis was tested five times, 
once for each of the five job-seeking be- 
haviors. These tests are not independent, how- 
ever, since the same subjects responded to 
all items. 

As Table 1 shows, the hypothesis was not 
Supported. The best predictor of directly 
Measured attitude toward the act was the 
Simple sum of the evaluations of the intrinsic 
utcomes of each behavior, X (at). The addi- 
tion of expectancy as a multiplicative weight, 
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TABLE 1 


CORRELATIONS OF ALTERNATIVE Premecnox Moons 
with Direerty Measure EvaLvanox or Five 
Jon-SkEkine Benaviors 


Behavior 
Predictor a a 
Applica- | Train- 1 
Agency Ads | Friends 
tions ing | lt — 
XB; 18° — 420 22 is — 10 
(CA. ae —.18* =. 31000 | —,29008 | = 10 
Zai 7500" onrer sgese | saose] 35.5 
208.20 ee sjees} 2700s] 199 
(CB | 6490 4 n ae 


i 


Note. N = 200. All correlations are rounded to the second 
decimal. Negative correlations reflect the negative evaluations 
of several outcomes for cach behavior. These signs are natu- 
tally reversed when the evaluations are specifically accounted 
for in the last three prediction models, 


(Ba, reduced the obtained correlation in 
every case, significantly so for all behaviors 
except training (agency, p < .05; applica- 
tions, p< .01; ads, p < .01; friends, p< 
.05). The addition of multiplicative certainty 
weights, (CB), further reduced the cor- 
relations, though the differences between 
X(B,a;) and 3(C,B,a;) as predictors of atti- 
tude toward the act were not significant for 
any behavior.“ 

Though attitude toward the act appears to 
be a meaningful construct, predictable from 
the sum of the evaluations of the act’s in- 
trinsic outcomes, the underlying expectancy 
model may need modifications. Kanouse and 
Hanson (1971), for example, cite a number 
of studies in the risk-taking area which sup- 
port the proposition that the reinforcement 
value of a potential loss is a more important 
determinant of behavior than the probability 
of the loss itself. Since several of the out- 
comes of each behavior in the current study 
were evaluatively negative, the same process 
may be operative in decision making under 
uncertainty, That is, even where an actual 
risk is not apparent, people may act to avoid 
a potential loss (in this case, of time or op- 
portunity). This avoidance would be reflected 
in their evaluation of the behavior, as in the 
data of Table 1. 


5 These results replicate across different attitude 
scales, though some method variance is present. A 
discussion of race and socioeconomic class effects on 
these relationships is found in Feldman (1972, 1973). 


2 


o 


It appears from these data that certainty 
weighting decreases the validity of the ex- 
pectancy model. Two possible explanations 
should be mentioned. 
First, certainty may already be represented 
in the expectancy estimate. Some intermediate 
expectancy may represent maximal uncer- 
_ tainty, while deviations from this point in 
either direction represent increasing certainty. 
This would suggest the use of a bipolar mea- 
Sure such as Fishbein’s (1967), rather than a 
_ direct probability estimate, in future research. 
Second, certainty may not be relevant to 
the measurement of subjective probabilities, 
since the individual is already assumed to be 
giving his “best guess” as to the state of the 
world. Certainty might be more relevant to 
statements such as “Current economic policy 
is beneficial” than to statements of the form 
“The chances are 1 in 10 that inflation will 
decrease this month.“ 
It appears from a closer analysis of the 
data that both of the above explanations may 
be valid, the former somewhat less frequently 
_ than the latter. Polynomial regression analy- 
es were run on each of the 25 outcomes of 
the five behaviors, testing the significance of 
the linear, quadradic, and cubic components 
of variance in the regression of expectancy on 
certainty. In each of 25 cases, the linear com- 
_ ponent was significant at or beyond the 01 
level. In 10 of 25 cases, the quadradic com- 
_ ponent was significant at or beyond the Os 


= 


level, generally controlling less variance than 
the linear component. The cubic component 
was significant (p < .05) in only 3 of the 25 
cases. Generally, where the quadradic com- 
“ponent was significant, the lowest degree of 
certainty occurred between 2 and 4 on the 
_ expectancy scale, although in two cases the 
_ inflection point was close to 7.8 
Iwo general conclusions may be drawn. 
First, certainty weights for subjective prob- 
ability estimates are not likely to prove use- 
„either in terms of predictive validity or 


6 Complete data, omitted for reasons of space, may 
e obtained from the author. 
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in aiding our unde: 
tion and decision 
weights have pro. 

operationally simi jective 
judgments (i.e., S Warland, 1973), 
Second, subjective lities ( 
cies) may well be i to the formati 
of attitudes toward (and, by img 
cation, behavioral i ind actual d 


g of attitude 
Where 
l, they have 


pect 


cisions) where negat ves of a behs 
ior exist. Further st both labora 
and field settings ar: to uncover 
complex interactions by the presenti 


data. 
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D of Management and 


ingency model of lead- 

ve of the most exten- 

urrent approaches to 
ı esperimental research in 
) ild settings (Chemers & 
ek, 1972; Fiedler, 1968, 1971a, 1971c; 
edler & Chemers, 1974) has shown that in 
ertain types of ions, the two types of 
pader—task motivated, or low LPC (least 
ferred co-worker), and relationship moti- 
ated, or high LPC—are differentially effec- 


everal j ‘raham, 1968, 1973; 
65; Sashkin, 1971, 
together, suggest pos- 

hanisms for the differ- 


ible explanatory m 


: sported by the firm in which 
ed while the latter two 
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{ vere computed between a manager's Least Preferred Co-worker 
and 17 psychological measures and moderated by subordinates’ 
vith the manager in 34 offices of a nationwide brokerage firm 
ended to support the hypothesis that correlations between LPC 
ological measures should be opposite in direction for bigħ- versus 
on offices, The hypothesis was directionally confirmed for 11 of 17 
clations, and the differences were statistically significant for $ of 
The results also support Ficdler’s interpretations of effective 
or and the relation between LPC and behavior, 
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ential effectiveness of high- and low-LPC 
leaders in the different situations, 

Fiedler (1967) created a set of eight situa- 
tion types, ranging along a postulated scale 
of “favorableness for the exercise of leader- 
ship,” that is, how easy or difficult it would 
seem for a leader to get subordinates to do 
what the leader wants. When the situation is 
high in favorableness (Types 1, 2, and 3), the 
low-LPC leader performs better than the 
high-LPC leader. What seems to happen is 
that the low-LPC leader perceives the “case” 
of the situation and the liklihood of task at- 
tainment. Thus, feeling secure that his needs 
will be met, the leader feels able to attend to 
interpersonal relationship factors. This is done 
by using interpersonal, “socio-emotional” be- 
haviors to reinforce subordinate task behav- — 
iors which seem appropriate, while maintain- 
ing a normal level of task structuring or 
directive behavior. The outcome is effective 
group performance. The high-LPC leader uses 
a similar implicit logic. To this leader the 
situation “looks good,” in general, in terms 
of meeting the leader’s needs, since leader- 
member relations in very favorable situations 
are quite good. The high-LPC leader can, 
therefore, concentrate on task factors. 

This is done by using directive, “initiating,” — 
or “structuring” behaviors, while maintaining 


M. Sasuxin, F. C. Taytor, anp R. C. Tr: 


a 

anormal, general level of relationship-oriented 

- behaviors. The result is poor performance, 
since (@) the structuring behaviors are not 
rewarding for subordinates, and (5) the ap- 
propriate focus for the leader is on relation- 

_ Ship-oriented behavior, since the task is, by 

the nature of the situation, likely to be pro- 
gressing smoothly. 

In situations of moderate favorableness 
(Types 4 and 5) or moderate unfavorableness 
(Types 6 and 7), high-LPC leaders are found 
to be superior to low-LPC leaders. According 
to Fiedler (1971a), the leader in such moder- 
ate situations becomes aware that his needs 
may not be satisfied. The high-LPC leader 
responds by focusing behaviors on interper- 
sonal relationships, the primary means this 
type of leader has developed for attaining 
need satisfaction, Since a major factor in 
these moderate situations is, supposedly, prob- 
lems concerning interpersonal relations (Fied- 

ler, 197 la, p. 17), the result is good perform- 
_ ance, The low-LPC leader becomes similarly 
_ concerned, and responds analogously, by fo- 
_ cusing on task-structuring or directive behav- 
_ lors. This focus, however, is not appropriate, 
and the outcome is poor performance. 

Finally, there are situations so grossly un- 
favorable (Type 8) that only by highly direc- 

tive leadership behavior can anything at all 
be accomplished. The low-LPC leader, en- 
_ gaging in directive, perhaps authoritarian, be- 
haviors then proves more effective than high- 
LPC leaders, although the results may be only 
temporary. 
Fiedler's theory and empirical model have 
not been without criticism, on a variety of 
grounds, primarily (a) the theoretical logic 
(McMahon, 1972), (5) the methodology of 
bis research (Graen, Alvares, Orris, & Mar- 
tella, 1970), (c) the validity of his results 
_ (Graen, Orris, & Alvares, 1971), and (d) the 
utility of the model (Duffy, Kavanagh, Mac- 
Kinney, Wolins, & Lyons, 1970; Sashkin, 
1972). While these issues are to a degree in- 
terrelated, it is useful to discuss each one 
briefly. 


Theory 


i McMahon (1972) has presented an excel- 
ent and thorough review of theoretical prob- 


s 


lems. One of the 
cerns the nature 
measure. Fiedler’s 
logic, not evidence 
any claim that LPi 
ality attribute, Fie 
Chemers, 1974) ha 
that LPC yields “an 
goals and needs” o 
portant personality 
measured by the u 
[Fiedler & Chemers, 1 

The problem is that 
other psychological m. 
usual. Fiedler and oth 
tently unable to show 
tionships between LP 
wide variety of comm 
measures (Fiedler, 196 
might logically relate to 
vational hierarchy or ta 
orientation. While Mitch: 
onstrate some consistent 
tween LPC and a measure 
plexity very similar to th: 
Scott (1962), it is not cle 
complexity is, in itself, a2 
factor. Furthermore, the < 
cept of cognitive complexit, 
to the LPC measure. Going 
(1967) earliest work in 
centered on the leader’s abilii. 
among subordinates, one find } 
as concrete developmental ties between B 
and cognitive complexity. Even so, a ree 
study by Larson and Rowland (1974) fa 
to replicate Mitchell’s findings. s 

One other attempt to interpret the E 
measure is found in the report by Fishbe 
Landy, and Hatch (1969). Using an 1 
tudinal analysis, these authors seemed to l 
gest that LPC may measure an individu 
attitude toward some specific other indi 
—a least preferred co-worker—and n 
more and provided some support for this ~ 
tion. In sum, Fiedler’s (1967) note that 
were in the peculiar position of having a sce 
which seemed to correlate with nothin 
group performance [p. 46]” holds almos 
well today as when written. 


us of these} 
ing of the 


initially 

es some p 
72; 
ently 
1e hierarchy 
vidual, an 
‘which is 
ological 


hodology 


ethodological issues have been 
ertain critics of Fiedler work 
ı et al, 1970, 1971). Fiedler 
71c), however, presented detailed 
gh counterarguments, These pre- 
as well as tightly designed and 
laboratory studies (eg, Sashkin, 
2) which meet Fiedler’s (1971b) 
specifications and which yielded 
pport for the contingency model, 
o conclude that Fiedler’s research 
-the basic design and approach, not 
ly specific measures or studies—ade- 
meet the requirements for rigorous, 
research. 


third problem area deals with the em- 
validity of the contingency model. 
et al. (1970, 1971) labeled this an 
of “antecedent versus evidential” va- 
That is, the model was derived em- 
piri. ly from the numerous experimental re- 
sul: . but does it, in fact, have predictive 
val ity? While Graen et al. (1971) attempted 
to now that it does not, Fiedler (1971b) has 
pointed out serious flaws in the research de- 
sig and method which Graen et al. used in 
th ir evidential tests. The predictive test evi- 
deace presented by Fiedler (1971c) and 
others (e.g, Chemers & Skrzypek, 1972; 
Fiedler & Chemers, 1974; Sashkin, 1972) in 
field and laboratory studies indicates quite 
clearly that the model does in fact predict 
le dership effectiveness, although to varying 
degrees, depending on which of the eight situ- 
ations one refers to. 


piric 
Gra: 
issu: 
lidi: 


Utility 

Finally, there is clearly a question of 
Utility, or practicality. Even if we assume that 
the model is fully accurate and predicts pre- 
cisely as it states, the median correlations be- 
tween LPC and group performance, for each 
of the eight situations, range between +.13 
and +.59 (Fiedler & Chemers, 1974). In 
terms of the variance in performance attri- 
butable to LPC score, the percentages range 
from about 0%, at worst, to about 36% at 
t. If we assume that actual situations fall 
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in equal proportion into the eight types 
(which is, obviously, ridiculous), the model 
would, on the average, account for about 11% 
of the variance in group performance. While 
this is nothing to scorn, neither is it a radical 
breakthrough in social engineering, particu. — 
larly since Duffy et al. (1970) showed in a 
field study that the predictive validity of the 
model is seriously reduced when performance — 
is measured in terms of supervisory ratings 
rather than “hard” performance criteria. 


Tux Parsenr Srupy 
Issues and Aims 


We are concerned here primarily with the 
first of the problems reviewed above and, io 
a lesser extent, with the third and fourth. We 
shall briefly detail our approach to these 
issues. . 
What does LPC measure? Given the evi- 
dence or, more appropriately, the lack thereof, 
Fiedler’s interpretation of the LPC measure 
must be considered speculative, at best. Vet, 
LPC must measure—and mea 
for it seems reasonably clear that leaders“ 
LPC scores are predictive of group perform- 
ance to a consistent, and in many cases sig- 
nificant, degree. Examination of relationships 
between the measure of interest and a variety 
of other measures is one common approach to 
understanding the meaning of a psychological 
measure. This approach, however, has yielded 
little in the way of explanation or understand- 
ing of the LPC measure. i 

We earlier summarized the results of sev- 
eral studies which suggest explanatory mech⸗ 
anisms for Fiedler’s model and discussed in 
detail some of this research when reviewing 
the problems associated with the model, TW 
reports in particular serve as the background 
for the hypotheses to be tested in the present 
study. 

Graham (1973) suggested that the LPC 
measures a predisposition to respond behav- q 
iorally in certain consistent ways to specific 3 
situations, the predisposition being “rigg 2 
gered” or activated by the social environ- 4 
mental factors inherent to those specific situ- 
ations. This argument seems related to the 
attitudinal analysis approach of Fishbein et 
al. (1969) discussed above. 
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Our thought here is related to both of these 
pproaches but has a different focus. We 
uggest that the environment—the situation 
seli—may moderate not only the LPC-per- 
ormance relationship but the relationships 
etween LPC and other psychological mea- 
ures as well. This proposition suggests that 
ither LPC or the other measures (or both) 
ire labile. Such reasoning is quite different 
rom that proposed by Fiedler (1974), who 
usserts the long-term stability of LPC and 
seems relatively unconcerned about relation- 
ships between LPC and any variables other 
than performance. Our primary hypothesis 
in the present study is, then, that the rela- 
tionship between LPC and certain psychologi- 
cal measures is also contingent on the situa- 
tion. 
We can test this hypothesis by examining 
correlations between LPC and a variety of 
psychological measures, for a set of leaders, 
while anticipating no significant results. The 
sample can then be divided into as many situ- 
ation types as exist in terms of Fiedler’s 
(1967) definitions, and the correlations re- 
calculated for each subsample. The hypothesis 
is that the subsamples exhibit markedly dif- 
ferent correlations between LPC and the psy- 
chological measures used. Examination of the 
patterns of relationships found in the sub- 
samples might then provide greater under- 
standing of what it is that the LPC instru- 
ment measures. 

Does LPC predict performance? Since there 
exists the opportunity, in the above design, 
for examining relationships between LPC and 
performance and since there does exist a need 
for further validation studies to determine 
more precisely the validity and utility of the 
contingency model, it seems appropriate to 
ask this question as well. This is, however, a 
secondary concern in the present investigation. 


METHOD 
Setting 


The present investigation is one part of a larger 
study concerned with the psychological and organi- 
zational correlates of performance and effectiveness 
among offices of a nationwide brokerage firm (Tay- 
lor, 1972). The total sample consisted of 40 offices 
in the United States. Data from 34 offices were 
suitable for the present analysis. Offices are dis- 


tributed nationally, and the reflects this dis 


persion as well as a bimod ution of offices 
by size. There were 24 office aff sizes ranging 
between 17 and 25 account < s, with a mean 
of 20, and 16 offices with sta ranging between 


26 and 49, with a mean of 3( 


Psychological Measures 


All measures were self-ac d as part of 
a mail survey package com office man- 
agers and account executives anagers com- 
pleted the LPC instrument as a number 
of other psychological meas uding: (a) a 
shortened version of the Cali Scale (Robin- 
son & Shaver, 1969); (b) a s version of the 
Rokeach Dogmatism Scale hi & Powell, 
1965); (c) Rotter’s (1966) f internal-ex- 
ternal control attribution; (% lowe-Crowne 
Social Desirability Scale (Cr: rlowe, 1960); 
(e) Miles’s Trust Scale (Mile ie, 1968); (f 
Schroder’s Sentence Completi for Cognitive 
Complexity (Schroder, Drive: ufert, 1967); 
(g) a modification of Scott's gnitive com- 
plexity instrument (Taylor, 1‘ ı measure of 
openness with information de m the Taylor 
(1972) instrument; and (% a conflict man- 
agement style instrument (Ta 2) similar to 
that developed by Lawrence usch (1969). 
Copies of all instruments use be found in 


Taylor (1972). 

The less familiar Taylor instr 
brief description. The cogniti 
asks the respondent to list i 
portant to the respondent’s wor! 
is designated with a letter. The pondent then 
groups the letters on a separate according to 
common item characteristics, indie s the common 
characteristic for each group. provided for 
up to 26 items and up to 12 gro af letters, This 
procedure was repeated for fin bers and for 
customers. Complexity is defi ihe number of 
items which are assigned uniq ofiles of char- 
acteristics in the grouping proc 

The original lists of securities, members, and 
customers could be removed fr e questionnaire 
by the respondent. Since each item was coded wit 
a letter and since the measure requires only the list 
of groupings, the initial list with items identified was 
not needed. Respondents were instructed to remove 
these initial lists if they wished to do so. If the re- 
spondent removed the list, this was scored as being 
“closed with information.” Leaving the list in 5 
questionnaire was scored as being open.“ 

The desired conflict management style z 
uses precisely the methodology developed by 1790 
rence and Lorsch (1969). Respondents rate eac! a 
30 proverbs on a 5-point scale ranging from 1 05 
very undesirable way to handle disagreements 30 
conflicts) to § (a highly desirable way). — 
items were previously shown to form five 1 
(Taylor, 1972). Individuals are scored on pre 
for confronting, smoothing, compromising, force, 


require some 
onity measure 
ecurities im- 
h item listed 


instrument 
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wi al by summing the item scores which com- 

í pris of these factors. Note, then, that the cate- 
gory sures are independent, and an individual 
respondent could, conceivably, attain any combina- 
tion of high- and low-preference scores on the five 
fact 

| Situ nal Measures 

l 7 ce situational variables of primary signifi- 

ca the contingency model are (a) leader-mem- 

ber ins, (b) task structure, and (c) leader posi- 

l tior r, Items from the Survey of Organizations 
Qu sire (Taylor & Bowers, 1970) were used to 
me a and c. We should note that the type of 
leac ember relations measure used has varied 
wic n studies of the contingency model, for ex- 
am ociometric questionnaires, satisfaction with 
leac ales, and leaders’ or group members’ ratings 
of p atmosphere. Results obtained using leader 
ver zroup member ratings have in some cases 
sho onsiderable differences (Fiedler, 1967). These 
resu however, have generally involved leaders’ 
res} es on a group atmosphere scale which 
Mc ion (1972) has noted bears considerable re- 
sem e to the LPC measure. To avoid any 
met! logical confounding, the measure used here 
wa point rating of satisfaction with supervisor, 
as re ed by office members. This item yielded a 
considerable range of results, office means going from 
2.3) w) to 4.83 (high), with 3.88 being the over- 
all office means. For situational definition, 
sec 3.60 were designated low, which indi- 
ca r leader-member relations, while scores of 
369 and above were designated high, which indi- 
exter! er-member relations. This “split” 
G chosen such that the low-satisfaction 
cfi be certain to fit this category. That is, 
the . mean of 3.88 represents, in all likelihood, 
some ree of upward bias in responses. The 3.60 
divisic: thus represents a conservative judgment by 


the re «archers. An extensive empirically based dis- 
cussic of this issue can be found in Taylor and 
Bowen: (1970; see, particularly, chap. 5). 

The task was the same for all subordinates in all 
offices: ‘he trading of securities for clients. Applica- 
tion, by the authors, of the four criteria used by 
Fiedler (1967, p. 28), which were developed by Shaw 
to evaluate the degree of task structure, indicated 
the task to be relatively unstructured. 

Leader position power was measured by office 
members’ responses to the Survey of Organizations 
Questionnaire item, “How much say or influence does 
each of the following people have on what goes on 
in this office?” followed by separate listings of each 
Position by organizational level: top managers, office 
manager, operations manager, and account executives. 
Each position was rated on a 5-point scale, ranging 
from 1 (little or none) to 5 (a very great deal). 
f s item seems to provide a reasonably good mea- 
Sure of the structural or positional distribution of 
_ Power in an organization (Tannenbaum, 1968). All 
Office Managers were reported by subordinates to 


have relatively high influence, the range being from 
343 to 5.00, with a mean of office means equal fo 
435, 

Fiedler model calls for a high or low dichotomized 
catagorization of leader position power, These data 
indicate that while some office managers are, indeed, 
relatively higher or lower in position power than 
others, none could reasonably be considered to be in 
the low category. It was thus concluded that office 
manager position power was high in all cases. 

The above definitions of the three contingency — 
model situational variables split the 34 offices into 
two groups, fitting Situation Types 3 or 7 of the 
model, differing only in leader-member relations: 
good-Situation Type 3 and poor-Situation Type 7. 
For the rather favorable situation (Type 3), low- 
LPC leaders are predicted to be more effective than 
leaders with high-LPC scores, while for the un- 
favorable situation (Type 7) the reverse is true. The 
specific LPC-performance correlations predicted by 
the model are —.29 and +.26, respectively (Fiedler 
& Chemers, 1974). We should note, however, that 
the present study involves groups which fit more 
closely with the definition of coacting groups, “groups 
whose tasks . . involve virtually independent per- 
formance by the group members, even though there 
might be some aggregate team score [Fiedler & 
Chemers, 1974, p. 87].” For such cases, data are 
sparse for Situation Types 3 and 7 but suggest that 
the predicted correlations might be even larger 
(Fiedler, 1971c), 


Performance Measures 


formance ratio,” defined as the ratio of the sum of 
commissions actually earned by all account executives 
in the office to the sum of commissions which would j 
have been earned if each account executive had per- 
formed according to the average for persons with the 
same amount of experience in the firm, In addition, 
a measure of two-year improvement in performance 
ratio was available. Both measures were taken from 
organization performance records. 


a 
The measure of leader effectiveness was a “per- $: 
1 


RESULTS 


Before examining specific correlational re- 
sults, we must be certain that differential 
effects for the two subsamples could not be 
produced by some selection bias involving the 
predictor variable, LPC. The mean LFS 
scores for office managers in the two situa- — 
tions were virtually identical: 61.2 and 62.1 
for Situation Types 3 and 7, respectively. We 
may, therefore, conclude that high- or low- — 
LPC leaders were not, in some biased way, 
selected into one or the other situation type. 
A second concern is whether the moderator 
and predictor variables are related in any 
way which might bias the predictor-moder- 
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TABLE 1 


CORRELATIONS BETWEEN LEAST PREFERRED CO-WORKER AND OTHER M 
AS MODERATED BY SITUATIONAL FACTORS 


M. Sasuxin, F. C. TAYLOR, AND R. C. TRIPATHI 


Correlation between LPC and other 
Uther measure nes ae Sa Absolute 
6 = 34) ‘ype 3 ype lifference 
(n = 22) (n= 12 3 versus 7) 
Authoritarianism +.12 +13 +.24 Al 
Dogmatism +.01 +.09 —.12 .21 
Defensiveness (Marlowe-Crowne) +.00 —.05 +.16 2¹ 
Internal control +.17 +.14 +.35 21 
Trust in subordinates +.09 +.08 +.18 10 
Cognitive complexity 
Sentence Completion Test —.02 +.04 — 16 20 
Classifying test (Taylor) 
Securities (nonsocial) +.18 +.45 —.03 48 
Firm members (social) +.40* +.63** —.32 .95* 
Customers (social) +.31 +.37 —.20 57 
Openness with information 
Securities —.18 — .46* +.55 i.01** 
Firm members —.09 —.34 1.74 1.08** 
Customers —.23 —.42 +4.85" 127 
Desired conflict management style 
Confronting resolution +.14 +.03 +.43 40 
Smoothing resolution +.13 —.02 +.55* 57 
Compromising +.03 18 ＋ 70% .88** 
Force —.34 —.30 —.48 18 
Withdrawal +.21 +21 +.24 03 
Satisfaction with office manager —.01 —.01 +.57* 58 
Performance 
Current year +.09 +.07 +18 il 
Two-year improvement +.10 —.05 +.34 39 
Other studies* 
Interacting groups —.29 +.26 59 
Coacting task group? —.32 +.87 1.19 
_ 


Note. LPC = Least preferred co-worker. 
poata els from Fiedler and Chemers (1974). 


7 <.05. 
**p <.01. 


ator-outcome variable relationships. Again, 
such an effect can be reasonably ruled out, 
since the correlation between office managers’ 
LPC, the predictor, and office members’ satis- 
faction with leader, the moderator, was essen- 
tially zero for the 34 offices (r = +.005, ms). 


Having excluded the possibility of such sam- 


pling bias effects, we may proceed to examine 
our results for the hypothesized moderator 
effect. 


_ Situational Moderator Effects 


Table 1 presents the correlations between 
LPC scores and the scores on the various psy- 


chological measures used, for the total sample 


of offices and for the two subsamples. As call 
be seen, none of the measur showed any 
overall relationship with LPC except for cog 
nitive complexity, results conforming to those 
obtained by Mitchell (1970), and the prefer- - 
ence for a forcing conflict management style 
by low-LPC managers. Of these results, only 
social cognitive complexity as measured 
categorization profiles of firm members Was 
statistically significant. 

The correlations for each sub e i 
ever, show a pattern of significantly a 
relationships for some of the measures. bew 
only 7 of the 34 correlations were, of : 905 
selves, significant, the best direct test o 


sample, how- 


ITUATIONAL MODERATING EFFECTS 


ype which predicts significantly dif- 
leren lations between LPC and the other 
psyc} cal measures for the two subsam- 
ples, omparison of the pairs of correla- 
tions ilts showed that these differences 
betw pairs were significant in 5 of 17 
pase: the 12 remaining pairs of correla- 
tions hypothesis was directionally con- 
firme six cases, in that the correlations for 
Situ Types 3 and 7 were signed op- 
posit Overall, then, for 11 of 17 measures, 
we Í | opposite, and in 5 cases, significantly 
differ. .t, correlations between LPC and the 
part ır measure in the two situations. 


W hould note that two of the measures 


used uthoritarianism and dogmatism, are 
shor orm versions of the standard instru- 
men and might be considered inadequately 
vali ed. In two other cases, internal control 
and gnitive complexity/nonsocial, an analy- 
sis carious population statistics for the two 
sub: ples showed these two situation type 
gro. to be nonequivalent. Thus, these com- 
pari ns are somewhat questionable. Neither 
of e factors, however, seriously affects the 
bas outcome of the present analysis. We 
cou. say that of 13 comparisons, dropping 
the - in question, our hypothesis was sup- 


port: | in 9 cases directionally, and the op- 
posiie-signed correlations were significantly 
different in 5 of the 13 cases. 

Clearly, the strongest support for the hy- 
pothesized moderating effect came from three 
4 openness with information, (b) 
cy complexity, and (c) preferred con- 


flix xagement styles. We can see that on 
all ` openness measures, greater open- 
hess «as associated with lower LPC scores in 
Situs n Type 3, while the reverse was true 


in Situation Type 7. For the social measures, 
greater cognitive complexity was associated 
with higher LPC scores in Situation Type 3 
but with lower LPC in Situation Type 7. 
Desired conflict management style showed 
little relationship to LPC in Situation Type 3, 
with the exception of a tendency for high-LPC 
managers to avoid expressing a preference for 
force and an even weaker tendency to avoid 
expressing a preference for compromise. How- 
ever, for managers in Situation Type 7, LEC 
Vas rather clearly related to expressed prefer- 
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ence of conflict management style. Specifi- 
cally, in Situation Type 7 environments, 
high-LPC office managers perferred compro- 
mise, smoothing resolution, and confronting 
resolution and preferred to avoid force. 

We should also note one unexpected result: 
Satisfaction with office manager, our moderat- 
ing variable, showed no relationship to LPC 
either overall or in Situation Type 3 but was 


positively and significantly correlated with < 


LPC for the 12 Situation Type 7 offices. 


Least Preferred Co-worker Score and 
Performance 


Our secondary concern was for a test of 
the hypothesis generally investigated by 
Fiedler and his associates concerning the re- 
lationship between LPC of the leader and 
group performance, for the two situations 
identified earlier. Fiedler’s recent summary 
analyses (1971c; Fiedler & Chemers, 1974) 
showed a median correlation for interacting 
groups in field studies of —.29 for Situation 
Type 3 and +.26 for Situation Type 7. A 
single study of coacting task groups yielded 
correlations of —.32 for Situation Type 3 and 
+.87 for Situation Type 7. The correlations 
obtained in the present analysis, using per- 
formance ratio as the outcome variable, were 
+.07 and +.18 for Situation Types 3 and 7, 
respectively. Neither of these results attained 
or approached statistical significance, although 
the result for Situation Type 7 was in the 
predicted direction and reasonably approxi- 
mated the outcome predicted on the basis of 
field studies (r = +.26). 

When performance was defined as improve- 
ment over a two-year period, the results were 
even closer to the predictions of the con- 
tingency model: —.05 for Situation Type 3 
and +.34 for Situation Type 7. While, again, 
not statistically significant, the directional 
conformance with Fiedler’s reported results 


was greater in both cases, and the magnitude 
of the correlation for Situation Type 7, ＋. 34, 


versus a prediction within the range of from 
＋ 26 to +.87, provides some support for the 
predictive validity of the contingency model. 


Discussion AND CONCLUSIONS 


We are now able to attempt to answer the 
two questions postulated for this study: (@) 


9, Soi ES Ad) bes aml 


i 
; 
E 
X 
; 
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What does the LPC instrument measure? and 
(b) Does LPC predict performance in the 
situations studied? 

We have reviewed and summarized Fiedler’s 
beliefs about what the LPC measures (Fiedler, 
1971a; Fiedler & Chemers, 1974) and noted 
some alternate explanations (e.g., Fishbein 
et al., 1969; Graham, 1973). Our results sug- 
gest a rather different interpretation from any 
proposed to date. 

The data in Table 1 suggest rather strongly 
that our hypothesis—situational variables 
moderate the relationship between LPC and 
other psychological measures—is at least on 
target, if not fully confirmed. Most consistent 
in exemplifying the present hypothesis were 
the information openness results; all three 
measures, which showed no initially signifi- 
cant relationship with LPC, yielded signifi- 
cant differences when the situation was taken 
into account. The cognitive complexity mea- 
sures followed a similar pattern, as did con- 

flict management preference, with six of these 
nine measures yielding differential results con- 
sistent with the hypothesis. 

Tn conclusion, it has been shown that situa- 
tional factors do, in some cases, moderate the 
relationship between LPC and other psycho- 
logical measures. Before considering the impli- 
cations of this finding, we might briefly specu- 
late as to why our results were not even 
stronger and more consistent. 

Perhaps the most obvious factor is the 
situational moderator used, office members’ 
satisfaction with the office manager, which re- 
flects leader-member relations. While the 
nature of leader-member relations has been 

repeatedly shown to be a significant organi- 
zational variable (Argyris, 1957; Bowers & 
Seashore, 1966; Kahn, 1958; Likert, 1961; 
Morse & Reimer, 1956), it is surely not the 
only such factor and probably not always the 
most important of potential situational mod- 
erators. In other words, the larger issue at 
this point seems to be the determination of 
what the most significant moderating vari- 
ables are. This same point has been made by 
Fiedler and Chemers (1974) with respect to 
situational measures for best defining the con- 
tingency model situation types. Assuming that 
Y further research results in some general list 


M. SasnkIN, F. C. Taylor, AND R. C. TRIPATHI 


of these moderating fact: 
the implications of our re 
pretation of the LPC measu 
We have already conclu: 
in certain situations and to 
dict leader effectiveness i! 


us consider 
r the inter- 


LPC does, 
legree, pre- 
s of group 


performance (Fiedler & Ch 1974; Sash- 
kin, 1972). The present adds some 
slight weight to such a c n, at least 
for Situation Type 7. T! rmation is 
greater when performance vement is 
considered rather than cur erformance, 
Thus, the situation modera lationship 
between LPC and performa en the fact 


that LPC is, it would seen 
lated to any other psycholog 
cept perhaps cognitive 
reasonable, and perhaps es 
find that the situation cai 
relationships between LPC : 
logical measures. 

Since the LPC-performan 
strongest for Situation Typ: 
examine, in more detail, the 
tween LPC and other psycho! 
for this situation. The high 
been shown to be somewha’ 
than the low-LPC leaders. T? 
open with information and ! 
ferentiating. As compared 
leaders in this situation, high-! 
agers prefer relatively non 
management styles—compro! 
resolution, and confronting 
force. Finally, although Situa 
defined by low satisfaction 
manager, we see that even“ 
viously narrowed range of satisfaction scores 
(2.31 to 3.59) there is a strong positive rela- 
tionship between satisfaction and office man- 
agers’ LPC scores, These results suggest that 
the high-LPC office manager in this moder- 
ately unfavorable situation is, as Fiedler 
(1971a) asserted, behaviorally more relation- 
ship oriented than the low-LPC leader. 

In looking at Situation Type 3, we m 
naturally expect to find just the reverse Pa 
tern, since high-LPC leaders were predicted 
to be more effective than low-LPC office 1 
agers. Recall, however, that this was not t 5 
case. In this study, the correlation betwee 


uely unre- 
asure (ex- 
ty), it 
prising, to 
derate the 
er psycho- 


tionship is 
let us first 
onships be- 
measures 
gaders have 
- effective 
also more 
socially dif- 
low-LPC 

` office man- 
cive conflict 
smoothing 
ylution—over 


uon 


ight 


SITUATIONAL MODERATING EFFECTS 


ce and LPC in Type 3 situations 


per! 
was tially zero. Note, then, that while 
the ; n of relationships between LPC and 
the measures (see Table 1) for Situa- 
tion e 3 is reversed for openness and 
cog complexity, this is not the case for 
the ict management preference measures. 
In situations, low-LPC office managers 
did prefer smoothing or confronting to 
an) ter extent than did high-LPC leaders 
and wed only a slight tendency to prefer 
con nise, Furthermore, low-LPC leaders 


owed greater preference for forcing in 
situations. And, finally, we see that in 
ation there was no relationship be- 
LPC and satisfaction with the leader. 
iclusion, the fact that low-LPC office 
sers in this situation (Type 3) were not 
or to high-LPC leaders could possibly 
e to an inadequate relationship orienta- 


at, then, does the LPC score mean? We 
t definitely answer this question, given 
-esent data. Our results would indicate, 
as earlier suggested, that some of the 
ures employed are labile. Whether this 
as that LPC can change over situations 
or ‘hat the other psychological measures ex- 
hibit such shifts is a question for further re- 
seo rch.® It does, however, seem that our results 
support rather strongly the inference that 
effectiveness is related to behavioral differ- 
ences among leaders which, in certain circum- 
stances, are very similar to the expectations 
cf leader behavior which can be derived from 
tlie contingency model (Fiedler, 1971a; Fied- 
ler & Chemers, 1974). 

in conclusion, we would suggest that LPC 
may have no meaning outside of situational 
contingencies. Leaders’ LPC score—the de- 
gree of esteem reported for some particular 
least preferred co-worker—may have a more 
general meaning only in the context of actual 
situations. It is in these specific, defined situ- 
ations that LPC is a potentially critical and 
useful link between the leader, as defined by 
Psychological and behavioral measures, and 
the performance of the group. 

5A recent study by Stinson and Tracy (1974) 
showed far greater instability in LPC scores than 
had been assumed on the basis of earlier research 
(Fiedler, 1967). 
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elements, 


ita obtained through job analysis pro- 
res are useful for several functions re- 
to personnel management, such as in the 
e lishment of personnel specifications, 
ving, job classification including the es- 
shment of wage and salary rates, and job 
gn. Conventional essay types of job de- 
ptions can well serve useful purposes in 
nection with such functions. However, 
some personnel management functions and 
research purposes supporting such func- 
ns, some types of standardized job analy- 
procedures seem to be particularly desir- 
ble. Such a procedure preferably should 
(ulfill certain conditions; it should (a) have 
a sound theoretical base, with a dear defini- 
tion of the nature of the analysis; (5) pro- 
vide a standardized approach applicable to a 
vide range of jobs; (c) permit statistical 
quantification of the data in terms of relevant 
units of job data; (d) lend itself to reliable 
usc; and (e) be economically feasible. 
A review of the relevant German literature 
revealed no such job analysis procedure 


„ moe aes 


1 The initial research with the Position Analysis 
Questionnaire (PAQ) was reported in a monograph 
by McCormick, Jeanneret, and Mecham (1972). 
The authors of this study wanted to find out 
whether it would be possible to use a German form 
of the PAQ under German working conditions. The 
study was financed by the Bundesanstalt fiir Arbeit 
(Federal Republic of Germany). 

2 Requests for reprints should be sent to Ekkehard 
Frieling who is now at Memminger Strasse 14, 
University of Augsburg, 8900 Augsburg, Germany. 


SOME RESULTS WITH THE GERMAN FORM OF THE 
POSITION ANALYSIS QUESTIONNAIRE (PAQ)' 


E. FRIELING,’ W. KANNHEISER, axo R. LINDBERG 
University of Repemiburg, Regensburg, Germany 


is study involved the development of a German form of the Position 

alysis Questionnaire (PAQ) to serve as a structured job analysts procedeare, 
1 Phase 1, independent translations of the PAQ, made by thie groupe of 
poychologists, were incorporated inte the PAQ Bd (the German edition). 
Thirty trained graduate students analyzed 61 positions with the PAQ Bd. 
Average interrater reliability was .79; average reliability of the individeal job 
clements was .65. In Phase 2, 70 industrial paychologists rated the importance 
of 80 attributes, modified to include better definitions, for each of the 205 
job elements. Satisfactory levels of reliability were achieved. It seems possible 
to construct profiles of job requirements for positions 
Bd analyses of jobs in combination with the attribute profiles of the individual 


on the basis of the PAQ 


which might meet these conditions for use in 
the Federal Republic of Germany, However, 
the Position Analysis Questionnaire (PAQ), 
developed in the United States by McCor- 
mick, Jeanneret, and Mecham (1972), seems 
to meet certain of these conditions. The PAQ 
is a highly structured type of job analysis 
questionnaire, consisting of nearly 200 job 
elements (or items) of a “worker-oriented” 
nature (McCormick, 1959; McCormick, 
Cunningham, & Thornton, 1967), which 
yields information about job activities, their 
corresponding working conditions, and the 
specific situations which might influence the 
activities of workers.* 

The present study consisted of two phases: 


1. The translation and transformation of 
the PAQ into an adequate German form 
and the use of that form in the analysis 
of a sample of jobs, to evaluate its rele- 
vance as a job analysis instrument in 
the context of the Federal Republic of 
Germany. 

2. The development of a possible data 
base of ratings of human attributes 


required for the job elements of be 


PAQ that might be used in determining 


the psychological and physical require- — 


ments for jobs. 


3Sample job elements (without their more de- 


tailed descriptions) include: use of written materials, 
reasoning in problem solving, hand-arm manipula- 
tion, etc. 
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PHASE 1: INITIAL DEVELOPMENT AND 
EVALUATION 


German Form of the PAQ 


The initial research with the PAQ reported 
by McCormick et al. (1972) was based on 
Form A. Form B was used as the basis of the 
translation. Although reliability data are not 
available for Form B, this form presumably 
incorporates a number of improvements over 
Form A. The German edition is referred to 
as PAQ Bd. Because of linguistic, educational, 
and cultural differences between the United 
States of America and the Federal Republic 
of Germany, the following modifications 
seemed desirable: (a) Certain modifications 
of the instructions. (b) Modification of cer- 
tain rating scales. Two of the six types of 
rating scales used in rating the job elements 
Importance“ and “Extent of Use“) had to 
be translated rather freely to provide sub- 
jectively equal distances between the steps of 
the scales. (c) Modification of certain job 
elements (or items). Because of cultural dif- 
ferences certain job elements (46, 47, and 48) 
were materially altered. (d) Addition of new 
items. On the basis of pilot studies, it seemed 
advisable to add certain new job elements, 
such as “knowledge of foreign languages” 
(which is an important requirement for many 
jobs in Germany). (e) Elimination of job 
elements. The “income/pay” items were 
eliminated, along with four “open-ended” 
items that required write-in responses. 

On the basis of these modifications, three 
different groups of psychologists from the 
“Bundesanstalt für Arbeit” in Nürnberg, Ger- 
many, the “Forschungsinstitut fiir praktische 
Psychologie” in Salzburg, Austria, and the 
first author (University of Augsburg) trans- 
lated the PAQ independently of the other 
groups. The resulting translations were then 
discussed and incorporated into the PAQ Bd. 


Sample of Positions Analyzed 


Sixty-one positions were analyzed with the 
PAQ Bd. According to McCormick, very 
_ heterogenous samples of positions should be 


This translation of the PAQ Form B (McCor- 
mick, Jeanneret, & Mecham, 1969a) was authorized 
_ by the Purdue Research Foundation (Nov. 1970). 
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used in the comput: 
the PAQ in order t 
of inclusion of job 
values of the various 
numerically limited s 
sible in all cases. Ne 


he reliability of 
the probability 
resent all scale 
ments. In our 
is was not pos- 
, efforts were 


made to include as m rent types of 
positions as possible, i for example, 
those from different the electrical 
industry, the Federal the Federal 


Post, the chemical indu 

industry, and several tr 

the ratings on most job 
out over the entire rang 
0 to S. 


glass working 
t turned out, 
s did spread 

values from 


Analysts and Analysis 


Thirty analysts, rand mbined in 
pairs, analyzed these 61 Based on 
the scheme by Heyns and (1953), a 
training course for these a was devel- 
oped. The sample consisted duate stu- 
dents of industrial psyche he Univer- 
sity of Regensburg. The ani procedure 
was the same as the one us y McCor- 
mick, Jeanneret, and Mechan 69b). Spe- 
cifically for most of the job sements there 


was an appropriate scale st 
tance”, “Extent of Use”, “I t 
special scale applicable to the spe 

ment; the analyst rated each such element? 
it related to a given position er job, using in 
most instances a 6-point scale from 0 (does 


not apply) to 5 (the highest scale value). 
otomous na- 


Several elements were of a di 
ture; in such instances the analys 
the element as “0” (does not 
(does apply). Each analyst 
equivalent of about $12 (American). 


Reliability of Analyses 

In order to be able to evaluate the reliabil- 
ity of the analyses, two analysts analyze 
each position. The correlation of the ratings 
of these pairs of analysts would provide in- 
formation about their consistency. With this 
procedure it would be possible to compare our 
data with reliability data based on PAQ Form 
A, as reported by McCormick et al. (1960) 

One reliability analysis consisted of om 
lating the ratings of the job elements as mā 


i __—_---  —————— 


wo analysts of each pair who ana- 

given position, The correlation for 

sition was then made for the two 

cross all job elements. 

61 positions were analyzed by pairs 
pendent analysts, it was also possible 
ermine the reliability of the ratings for 
ob element. This was done by comput- 
correlation coefficient for each job ele- 
riving a total of 207 coefficients. Be- 
the responses to certain items were of 
s ally a nominal nature, the reliability of 
th items was determined with the “Kappa 
ient” developed by Cohen (1960). 


ts 


e reliability coefficients for the individual 
of analysts ranged from .51 to .93, with 
ıverage reliability of .79 (computed 
s z-transformed correlations). In the 
ə of the reliability coefficients of the indi- 
yi ial job elements, the average coefficient 
65. Half of the coefficients were higher 
61; the coefficients for 23 job elements 
wc insignificant (i.e., less than .35). 


cussion 


Che average reliability coefficients of the 
pairs of raters (.79) were identical with those 
reported by McCormick et al. (1969b). With 
respect to the reliability coefficients of the 
individual job elements, McCormick et al. 
found an average coefficient of .80 which is 
considerably higher than ours (.65). 

Although our interrater reliability coeffi- 
‘cients corresponded with those reported by 
McCormick et al. (1969b), there is some 
question as to why the reliability coefficients 
of the individual job elements in our study 
were somewhat lower than those reported by 
McCormick. It is of course possible that 
these differences are due to differences be- 
| tween PAQ Form A and Form B, although in 
McCormick’s opinion Form B should be at 
least as reliable as Form A. On the other 
hand, the PAQ Bd might not be as reliable 
as the PAQ Form A because of the transla- 
tion. In addition there might be intercultural 
differences with respect to the content of sev- 
eral terms included in the American or Ger- 
man items. However, the results of a later 
Pilot study covering 32 positions indicated 
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that certain modifications in the content and 
form of those low-reliability job elements 
increased overall reliability. In this study, 
the average reliability of the job elements 
was .78. Although the size of the sample used 
in this pilot study had to be limited, it seems 
probable that the reliability of the job ele- 
ments might be increased further by addi- 
tional modifications and the use of a larger — 
sample. 

In general terms, the trial use of the PAQ 
Bd was sufficiently encouraging to suggest 
that it might well serve as a structured job 
analysis procedure for various purposes, espe- 
cially since it appears that some further 
modification of selected items might enhance 
its reliability in use. 


PHASE 2: DEVELOPMENT oF Data BASE OF 
ATTRIBUTE PROFILES or Jon ELEMENTS 


The results of the first phase encouraged — | 


the investigators to attempt to derive esti- 
mates of the requirements (in terms of cer- 
tain psychological and physical attributes) of 
the individual job elements of the PAQ Bd. 
The method suggested by McCormick et al. 
(McCormick, 1959; McCormick, Cunning- 
ham, & Thornton, 1967; McCormick, Jean- 
neret, & Mecham, 1969b; Mecham & Mc- 
Cormick, 1969) seemed likely to serve as an 
appropriate procedure for obtaining estimates 
of the attributes required for the approxi- 
mately 200 individual job elements. This 
method provides for experts to rate the rele- 
vance of each of certain attributes to the in- 
dividual job elements (see Mecham, 1968; 
Mecham & McCormick, 1969). In contrast to 
the procedures followed by Theologus, Ro- 
mashko, and Fleishman (1970) and Theo- 
logus and Fleishman (1971), who sought to 
determine job requirements for relatively com- 
plex tasks through use of task anchored abil- 
ity rating scales, McCormick’s concept di- 
vides positions and tasks into “elements” for 
which requirements are estimated. Since it is 
believed that these elements are easily grasped 
and relatively unequivocal, it may be sup- 
posed that overly global ratings might be 
avoided. The ratings may be combined with 
the results of the PAQ analyses of a given 
position in order to construct or build up a 
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profile of the total job requirements for that 
Position. This profile is based on the consoli- 
dated requirements of the job elements that 
comprise the position, This procedure pre- 
sumably would be quite economical. 
The following part of this study describes 
the selection and rating of the attributes as 
related to the job elements of the PAQ Bd. 


Selection of Attributes ë 


The list of attributes used in this phase 
was one developed by Wenninger, Kann- 
heiser, Frieling, and Hoyos (1971), which 
contained 59 physical and psychological at- 
tributes. In developing this list, an extensive 
analysis of the relevant literature had been 
carried out. More than 400 sources were ana- 
lyzed for information about definitions of the 
attributes and attribute measuring devices; 
most of these sources were factor-analytic 
studies concerned with the analysis of simple 
and complex perception, psychomotor skills, 
physical abilities, intelligence, temperament 
traits, and interests. 

The selection of the attributes for use in the 
second phase of the study was carried out 
with special consideration of the following 
points which seemed extremely important: 
(a) Attributes taken into the list had to have 
Positive correlations with successful perform- 
ance in different jobs, as determined in pre- 
vious studies. (5) To meet the criterion of 
“measurability” and “methodological clear- 

ness“, only attributes based on the results of 
factor-analytic studies were taken into ac- 
count. (c) The authors also tried to obtain a 
high degree of applicability of the attributes 
by defining them as clearly and precisely as 
' possible. The attributes were defined in rela- 
tively close relationship to the primary litera- 
ture. In contrast to the attributes of the list 
Proposed by Mecham (1968), special expla- 
nations and measurement instruments similar 
to those provided by Theologus, Romashko, 
and Fleishman (1970) were given, 

A comparison should be made of the 59 
attributes used in this study as compared with 


© Sample attributes include: perceptual attributes 
(spatial orientation, flexibility of closure), physical 


1 9 (body strength), memory (visual memory), 
etc. 


E. FRIELING, W. KANNHEISER, AND R. LIND: 


the 68 used by Me cCormick, Jean- 
neret, & Mecham Iecham, 1968). 
Mecham’s attribute n two classes, 


namely those of an a 
the types of qualities 
aptitudes) and those 


ture (including 
y considered as 
ational nature 


(those characterizing as to which 
workers need to adjust vorking alone 
or adapting to repetitiv: ions). Of the 
59 attributes used in nt study, 47 
corresponded essentially ose used by 
Mecham, although the s of certain 
of these were modified. f Mecham’s 
aptitudes were omitted, others were 
added.“ Most of the 3, however, 
related to Mecham’s si attributes. 
Since these presumably lications in 
terms of personality cha s (in the 
sense of the worker havi ust to the 
work situation), some of | e replaced 
with more specific tempera d interest 
traits. It should be noted thai 1e compari- 
son of the two sets of attribut ı a limited 
number of cases one attribute ne set was 
considered to embrace two in other set, 
and vice versa. Therefore a si ne-to-one 
comparison is a bit misleading, 
Selection of Raters 

One hundred and thirty-five icustrial psy- 
chologists were chosen at random: from the 


ciation of 
hese psy- 
> as a rater 


membership list of the German 
Industrial Psychologists. Each 
chologists was asked to participat 
in our research project. Of 135 
76 psychologists were willing to c 

27 refused to cooperate; 31 did not answer at 
all. From the 76 cooperative psychologists, 70 
were randomly selected as raters. Of these 
raters, 37% worked as personnel managers, 
20% taught industrial psychology in Gere 
universities, and 16% were employed in de- 
partments for the selection and placement of 
industrial personnel. The rest (18%) of the 
raters worked in different areas of industrial 
psychology and related fields. Each rater was 
paid the equivalent of about $100 (Amer 
can). 


i the 
o Specific changes may be obtained from 
authors. 
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R Procedures 

e able to carry out the ratings, each 
ra vas provided with the following rating 
m ils: (a) a general introduction to the 
p e of the study; (5) detailed instruc- 
ti (c) a copy of the modified PAQ; (d) 
1 ing manuals with detailed definitions of 
t ttributes which were supposed to be 
r and several explanations; and (e) a 
s} inventory for the recording of socioeco- 
n data. 


h rater had to rate the importance of 
ea of 10 attributes with respect to the 205 
jo elements on a 6-point scale ranging from 
west importance) to 6 (highest impor- 
ta: e). This rating scale seemed an adequate 
tr. -lation of the one used by Mecham (1968, 
p. 9%). 
itable action was undertaken to restrict 
th influence of systematic errors in the rat- 
in »rocess as much as possible by the follow- 
in procedures: (4) The 70 groups of attri- 
I materials (10 attributes per group to be 
1 i by each of the 70 raters) were assigned 
tt raters on the basis of a latin square de- 
si to eliminate the effects of sequence. (0) 
A ributes that seemed to be logically related 
tc each other were always rated by different 
re:ers to minimize the proximity error. (c) To 
avoid the logical error, definitions and expla- 
nations about the attributes were described in 
detail to provide the raters with extensive 
knowledge about each attribute. IBM Type 
551 data sheets were used to record the rat- 
1 gs. Each rater had to finish his ratings 
_ w thin two months. 


| Results 


On the basis of the medians of the ratings, 
an “attribute profile“ for each of the 205 job 
elements of the PAQ Bd was computed. 
From these profiles, it could be deduced how 
important each of the 59 attributes had been 
judged for each job element. The medians 
were divided into two groups: 

1. Median < 3.5: the attribute is not 
important for the job 
element. 
the attribute is im- 
portant for the job 
element. 


2. Median > 3.5: 
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By summarizing the medians which were 
equal or greater than 3.5 for each attribute, a 
rough measure of the judged general impor- 
tance of this attribute for each of the job ele- 
ments of the PAQ Bd was available. 

To determine the reliability of the ratings, 
three methods were used: (a) The reliability 
of ratings for each attribute as related to 
each job element was computed on the basis 
of the method proposed by Finn (1970). 
These values may be regarded as measures of 
the reliabilities of the arithmetic means (and, 
on the assumption of a high positive correla- 
tion between arithmetic means and medians, 
as a reliability estimation for the medians). 
(b) The reliability with which all the raters 
(the number varied from 9 to 14) indepen- 
dently rated the attributes across all job ele- 
ments was computed on the basis of the intra- 
class correlation (Ebel, 1951; Winer, 1962). 


(c) The reliability of attribute ratings of job 


elements was computed for each job element 
across all attributes. Again the intraclass 
correlation method was used. These coeff- 
cients were based on anywhere from 9 to 14 
ratings, in most cases 10, 11, or 12. 

The complete matrix of medians and the 
other results are given in Frieling, Kann- 
heiser, and Hoyos (1973). The coefficients of 
reliability (by the method proposed by Finn) 
were all significant on the .05 level. The reli- 
ability coefficients of the attributes are sum- 
marized in Table 1. That table shows the 
numbers of coefficients in each of several class 
intervals, including those which generally 
corresponded to attributes used by Mecham 
as well as the mew attributes in the German 
version and of all those in the German ver- 
sion. For comparative purposes, the coeffi- 
cients obtained by Mecham are also given (in 
the first column). It can be seen that for the 
German version, 11 of the coefficients were .95 
or above, 31 were between .90 and .94, 13 
from .85 to .89, and 4 from .80 to .84. 


Discussion 


The attribute ratings generally reflect a 
satisfactory level of reliability, which is a 
prerequisite for their potential use as a basis 
for the construction of requirement profiles 
for individual jobs. They generally are compa- 


5 TABLE 1 
i Summary or RELIABILITY COEFFICIENTS OF ATTRIBUTE 
Rarixes or Jon ELEMENTS IN GERMAN VERSION OF 
THE POSITION ANALYSIS QUESTIONNAIRE ANC OF 
CORRESPONDING ELEMENTS IN THE 
‘ UNITED STATES VERSION 


| Attributes common 
to both versions 


German version 


Class in- wet 3 
terval of | = EW tee, 
wag aie German 5 
r ae coef- | Newat- | Allat- 
S cients | tributes | tributes 
95 15 9 2 11 
90 to .94 16 24 7 31 
85 to .89 11 10 3 13 
80 to .84 2 4 4 
78 to. 79 
70 to 74 1 
-65 to .69 2 
Total 47 47 12 50 


* Data from Mecham (1968) and McCormick, Jeanneret, 
and Mecham (1972). 


rable to those reported by Mecham. A 
comparison of the coefficients of the new 
individual attributes which deal with tempera- 
ment and interest characteristics (as con- 
trasted with some of the situational attributes 
of Mecham’s that were not included) may be 
interpreted as a justification of our deviation 
from Mecham’s procedure. 
The matrix of median attribute ratings 
_ (which can be thought of as reflecting the 
attribute profiles of the individual job ele- 
ments) is not given in the present study. An 
examination of it, however, shows that all 
_attributes contained in the list are relevant 
for at least some of the job elements. Thus it 
can be concluded that the list of attributes 
includes factors which may be relevant re- 
quirements for the job elements of the PAO. 
The definitions of attributes used seem to 
explain the function of each attribute so 
clearly that it was possible to avoid exces- 
_ sively global ratings. 


ne DoS we ae 


7 


D 


CONCLUSIONS 


The two phases of this study have shown 

that despite the several modifications made in 
the PAQ and in the attributes used with it; 
J the PAQ Bd yields results which are gen- 
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erally comparable to ti reported by 
McCormick et al. (1969b iability co- 
efñcients, for the most p led our ex- 
pectations. The results o ilot study 
indicate that unsatisfact ibility co- 
efñcients of the use of elements 
can be increased conside: modifying 
these items. The results o nd phase 
seem to allow several ter nclusions: 
(a) Experts are capable í the im- 
portance of human attrib dified in 
accordance with procedure hose de- 
signed by Theologus and , 1971) 
for elemental aspects of behavior 
with acceptable reliability wit cing in a 
real working situation. (b ited im- 
portance of single attributes spect to 
different job elements, varie: 

On the basis of these attri! ings (as 
reflected in the attribute prof e indi- 
vidual job elements of the PAQ t seems 
possible to construct plausible p s of job 
requirements for different inc jobs. 
This assumption may be strengt in sev- 
eral studies which are to be pu ed else 
where. In these studies it was ible to 
construct profiles of job requ nts for 
masons, signalmen (railway), electri- 
cians. 

The applicability of the PAQ P= to prob- 
lems of occupational classification bhas yet to 


be studied intensively. At this point we would 
expect that the PAQ Bd is applicable to these 
problems, especially as it seems »ssible to 
continue constructing profiles of job require- 
ments for positions on the basis of PAQ Bd 
analyses of jobs in combination with the at- 
tribute profiles of the individual elements of 
it. Thus a psychologically oriented occupa- 
tional classification may be developed. 

In summary, the dimensions of working a 
tivities in Germany seem to be comparable to 
the ones existing in the United States of 
America, since it is basically possible to use 
an American job analysis procedure success- 
fully in Germany. Cultural-specific influences 
seem to be minimal in this case. In general, it 
seems that occupational classifications para 
on instruments like the PAQ or the PAQ B 
might well serve as the basis for generating 


a!) statements about similarities of jobs in 
it countries. 
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SLEEP DEPRIVATION EFFECTS ON THE 
VESTIBULAR HABITUATION PROCESS 


PATRICK J. DOWD * 


United States Air Force School of Aerospace Medicine, 
Brooks Air Force Base, Texas 


The effects of sleep deprivation on habituation of the vestibular system in 


Í stressful situation were examined. One hundred forty three experienced pilots 
q were exposed to two tests of Coriolis acceleration after periods of sleep depriva- 
: tion (24-30 hours) or rest (6 hours). Nystagmus responses to Coriolis stimula- 
7 tion were recorded after four right-to-left tilts. Sleep deprivation resulted in 
x (a) increased sensitivity to Coriolis stimulation; (b) a decreased recovery 
A rate; and consequently (c) interference with the vestibular habituation process 
f The implications of sleep loss in increasing the hazards of flying are discussed, 
K Very little information is available on the sist of a slow phase in the opposit: ection 
= effects of moderate sleep deprivation on the of rotation with a fast phase in t} tating 
3 vestibular sensory system. Yet the interac- direction. Vertical nystagmus can b orded 
tions between sleep mechanisms and the and reduced quantitatively using simple 
vestibular system have relevance to a number two-parameter analog function amer, 
of areas of human performance involving sen- Dowd, Moore, & Collins, 1967). 
sitivity to motion stimuli. Particularly rele- Nystagmic responses to Corio! imuli 
vant is the problem of sleep loss and the per- repeated over days display atte: m of 
formance of aircraft pilots. sensitivity, with some habituation, :<bitua- 
Coriolis acceleration, induced by tilting the tion to a Coriolis stimulus means a iction 
head in the frontal plane while rotating about of nystagmic responses, subjective ations, 
a vertical axis, has been found to produce and vestibulo-autonomic reactions repeti- 
stress for both experienced pilots and non- tive stimulation. Studies (Dowd, 1 ) have 
flyers (Dowd, 1973). Coriolis is a sensation of indicated that vestibular sensitivity ich as 
rolling, banking, or pitching about a third axis the rate of recovery and the in‘: ity of 
orthogonal to the aircraft's turn axis and the nystagmus elicited by Coriolis acco tions, 
_ head-movement axis. Prolonged rotary stimu- is significantly modified by the efic: of fly- 
lation seems to overdrive the vestibular sys- ing experience. Such modifications are identi- 
tem resulting in compensatory eye movements, fied as habituation of the vestibula: system 


5 


N 


termed nystagmus. The eye movements con- 


The research reported in this article was con- 
ducted in the Enivronmental Sciences Division, 
United States Air Force School of Aerospace Medi- 
cine, Aerospace Medical Division, (AFSC), United 
States Air Force, Brooks Air Force Base, Texas. 
Further reproduction is authorized to satisfy the 
needs of the U.S. government. The voluntary in- 
formed consent of the subjects used in this research 
was obtained in accordance with Air Force Regula- 
tion 80-33. 
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Hartman and Dr. R. L. Cramer for their advice, to 
Francis A. Brogan and James Bower for their tech- 
nical assistance, and to Richard McNee for statistical 
analyses. 


Requests for reprints should be sent to Patrick J. 


_ Dowd, United States Air Force School of Aeros; 
dy pace 
Medicine/VNE, Brooks Air Force Base, Texas 78235. 


process. 

The purpose of the present study was to 
determine if moderate sleep deprivation of 
24 to 30 hours would modify the pilot’s vestib- 
ular habituation process by affecting the sen- 
sitivity and the recovery rate of the vestib- 
ulo-ocular responses to Coriolis acceleration. 


METHOD 
Subjects 


The subjects were 143 experienced Air Force flying 
personnel undergoing physical evaluations. 


Procedure 


Two Coriolis test periods were given to all sub- 
jects; 48 hours after the initial period all subjects re- 
ceived the repeat test. The subjects were divide! 
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into three ip A—44 subjects who were 
deprived < the first test and had rest for 
the repeat B—38 subjects who had rest 
for the fi ad no sleep for the repeat 
test; and 61 subjects who had rest for 
both tests jects labeled “fatigued” were tested 
after a s} rivation period of 24 to 30 hours. 
All subje i “rested” had at least 6 hours of 
sleep pric g 

A stanc testing procedure was used (Dowd, 
1966). S troencephalogram disc electrodes 
were me ıbove and below each eye, with a 
ground mounted on the forehead between 
the eyet Subjects were seated in the stimulator 


and in. (a) to keep their eyes open, (b) to 


refrain f anging their body or head positions, 
and (c) frain from blinking or talking. All 
stimulat vere conducted in a darkened room. 
Initially re the beginning of rotation, the subject 
was tilt the left, 30 degrees from the vertical. 
Rotatio always in a counterclockwise direction 
at 18 itions per minute. After 2 minutes of 
exposur constant velocity, the subject was tilted 
throug! rtical to 30 degrees to the right. This 
tilt was pleted in 3 seconds. After 2 minutes, the 
subject tilted again to his initial position. Al- 
ternatir ilts continued until a total of eight tilts 
Were co: pleted: four left-to-right and four right- 
to-left. 

Electr. tagmograms, recorded during each tilt 
and for seconds following the cessation of each 
tilt, w processed and statistically analyzed for 
each of e four tilts from right-to-left. The right- 
to-left : produces a sensation similar to that per- 
ceived | y experienced flying personnel during a roll- 


ing elin in a high-speed aircraft (Dowd, 1973). The 
left-to-r: “ht tilt stimulus was described by pilots as 
a danger. and unpracticed maneuver which they 
avoided in i ‘ht. Visceral disturbances were induced 
by the left-to-right tilt (Dowd, 1973). Thus, the 
more neutral s imulus, the right-to-left tilt, was used 
in the analysis of this study. 


Scoring 
The m 


ude of the nystagmic slow-phase devia- 
eyes in degree of displacement was ob- 
tained for each second following cessation of the 
four stimulus tilts. These were combined into average 
degrees per second for each subject. The average 
Values were then reduced to fractions of the reference 
signal and presented as arbitrary units per second, 
The exponent of the decay function was obtained 
by taking the logarithm of the average values of the 
slow-phase deviation in arbitrary units per second. 
The slope of the line generated from these data 
described the rate of recovery (8) from Coriolis 
Stimulation as a nystagmus decayed over time: The 


_ Sleeper the slope, the more rapid the recovery from 


this disorienting stimulus. 
Temporal variations in slow-phase velocity of the 
vertical nystagmus subsequent to stimulation of the 
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semicircular canals can be described by a negative 
exponential decay function: 


R. = Re, 


in which t equals seconds after cessation of stimulus, 
R: equals the response at time t, and Ro equals re- 
sponse at cessation of stimulus where t equals 0 
(maximum eye velocity), 8 equals a parameter of 
the system which governs the rate at which activity 
decays or the slope of the negative exponential func- 
tion. 

This relationship has been demonstrated repeatedly 
(Dowd, 1973). Inferentially the response during 
acceleration should develop according to: 


Rig =a S(1 — e” t), 


where S equals sensitivity of the system, tg equals 
any instant during acceleration intensity (a). The 
response (Ro) at the termination of three seconds of 
acceleration of this tilt and the onset of decay is: 


Ro =a S(1 — e). 


Ro and f can be determined from experimental 
data on decay. By substituting these values into the 
last equation, the product of the coefficients of ac- 
celeration intensity (a) and the vestibular sensitivity 
(S) can be determined. For brevity this term (a 8) 
is referred to as the sensitivity product. Also 8 is 
defined in terms of the maximum eye velocity (Ro) 
and the rate of recovery (8), so it is possible, for 
these experimentally derived values to affect the 
changes in the sensitivity coefficient (S). 

The dynamic characteristic (8) represents the 
decay of the response or rate of recovery from this 
stimulus, so the greater the numerical yalue, the 
more rapid the rate of recovery. The sensitivity 
product (œ S) describes the system’s susceptibility 
to this stimulus, and its numerical value increases as 
the sensitivity increases, with acceleration being held 
constant. 


RESULTS 


None of the subjects reported or showed 
motion sickness symptoms. Samples of electro- 
nystagmographic tracings of induced vertical 
nystagmic responses of two subjects, one in 
Group A (fatigued), the other in Group C 
(rested), to Coriolis stimulation (right-to-left 
tilts) are presented in Figure 1. Note that the 
duration of the nystagmic responses in the 
initial test of the fatigued subject continued 
for almost 20 seconds after cessation of the 
stimulus, and for almost 13 seconds in the 
repeat test. The rested subject shows a dura- 
tion of almost 13 seconds for both initial and 
repeat tests. Compare these frequencies of 
fatigued test period with the rest test periods. 


Ficure 1. Induced vertical nystagmic responses to Coriolis stimulation (right-to-le: 


two subjects: one in the rested state, the other 


Patrick J. Dowo 


test, O = cessation of stimulus, 10 and 20 = seconds.) 


Mean data and standard deviations for the 
decay function or recovery rate (8) and the 
sensitivity products (œ S) for the initial, re- 

_ peat, and repeat minus initial test periods for 
Groups A, B and C appear in Table 1. 

The effects of sleep deprivation on the 

nystagmic responses were estimated by one- 


TABLE 1 
MEANS AND STANDARD DEVIATIONS FOR 8 AND aS 


Group 
Variable 
A B Ç 
(44) (38) (61) 
8 (rate of recovery) 
Initial 
* 150 .198 -200 
SD 057 .039 .055 
Repeat 
x 180 136 251 
SD 078 .036 075 
Repeat — Initial 
Ds 030 | —.061 -050 
SD 058 043 046 
a S (sensitivity) 
Initial 
X 11.19 6.19 6.53 
SD 3.71 1.66 2.81 
Repeat 
* 8.68 | 9.50 | 5.63 
SD 3,42 2.77 2.30 
Repeat — Initial 
D =251 3.30 —.90 
SD 3.36 2.39 1.65 


aD = X repeat — X initial; columns may ni 
e i y not compute due 


of 
in the fatigued state. (I initial test, I eat 
half the difference of the differe: etween 
the repeat and the initial tests f ups A 
and B. These estimates were 2 r sen- 
sitivity (a S) and —.046 for y rate 
(8). Both of these effects wer ificant 
using F tests ( < .001). 

To determine the “learning” ei com- 
parison was made between the rep id the 
initial test periods for Group C. ificant 
decrease in à S and a significant ase in 
B (p < .001) were found. 

DISCUSSION 

Analysis of the parameters 8 and « S of the 
nystagmic measures for both the rested and 
the sleep-deprived groups indicate’ ‘hat fa- 
tigue (moderate sleep deprivation of a period 


slightly more than 24 hours) exacei 
vestibular responses to Coriolis s 
resulting in interference with the vestibular 
habituation processes developed by pilots in 
their flying experience (Dowd, Moore, & 
Cramer, 1966). 

During the repeat rested test, Group A 
showed an increased recovery rate with less 
sensitivity in their nystagmic responses. One 
could speculate that the initial test was novel, 
and these results were simply due to lack of 
sleep prior to this test. Group B, however, 
had knowledge of the test and with lack of 
sleep prior to the repeat test, their nystagmic 
responses showed decreased recovery rate I 
increased sensitivity. These factors resulted 


DEPRIVATION EFFECTS ON THE VESTIBULAR HABITUATION PROCESS 


tibular habituation process. 
i typical pilot responses with 
ease in sensitivity and increase 
ate of nystagmic responses to 


in an 
Group 
the m 
in re 


repea lis stimulation. There was no 
interí with the vestibular habituation 
proce oup C’s responses. 

Pa lies (Dowd, 1973; Dowd & 
Cra: 7) have shown that subjects re- 
peate osed to such a Coriolis stimulus 
disp! lecrement in the sensitivity prod- 
uct te of recovery. This vestibulo-cere- 
bella ituation suppresses nystagmic re- 
spon cited by Coriolis stimulation, and 
this ession is enhanced by highly ad- 
vanc ght experience (Dowd, 1973; Dowd 
& Cr , 1967). 

In study the habituation was modified 
by t tress condition of fatigue or moderate 
sleep ‘cprivation in that vestibulo-cerebellar 
sens! ity to such Coriolis accelerations had 
incre . Similar results were obtained for 
expe: ced pilots under another stressful con- 
ditio: that of Coriolis-induced motion sick- 
nes lots susceptible to motion sickness 
shox an increased sensitivity and a lesser 
tate recovery in their nystagmic responses 
to C iolis stimulations. 


R sults of this study are opposite to those 
expe ted if minimal arousal from rotation 
were ‘he only factor affecting sensitivity. 
Other studies (Collins, 1962; Collins & 
Guedry, 1962) have clearly demonstrated 
that mental relaxation (or a state of reverie) 


strongly pressed, while alert conditions 
ampliies, vestibular nystagmic responses to 
Totaticn: whereas vestibular responses to ro- 
tation, while the subjects were sleeping, 


clearly cemonstrated depression of vestibulo- 
ocular responses. It was inferred that these 
sleep effects could be “referred to some in- 
fluence exerted by the sleep mechanism on 
_ the brain stem reticular structures intercalated 
in the vestibulo-ocular reflex pathway [Jones 
 & Sugie, 1972, p. 51].” 

It appears that combined stresses from 
Coriolis acceleration and sleep deprivation are 
needed to alter the vestibular habituation 
Process of pilots, Scholander (1961) demon- 
_ Strated supportive findings in that the effects 
of combined stresses of lack of sleep with 
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“continued pressure for performance” caused 
the electrodermal response to increase, Scho- 
lander (1961) explained this effect was due 
to a delay of the habituation process by some 
central neural mechanism that more or less 
“alerts” the subject to respond to such stress. 
Scholander’s explanation of a “higher level 
of internalized attention” affecting a central 
habituation process supports the findings of 
this report. 

It appears that different neural pathways 
are effectively involved: one by a fatigue con- 
dition and the others by either an alert or 
relaxed state. Reverie does not affect the 
vestibulo-cerebellar systems’ habituation pro- 
cesses. It only does, however, influence the 
oculomotor system’s response to vestibular 
stimulation such that the individual acts as if 
distracted and inattentive to the vestibular 
stimulus. The fatigued state, on the other 
hand, modifies the neural elements associated 
in the vestibular system’s suppression or ha- 
bituation process. This, in turn, affects the 
oculomotor system’s response, and the indi- 
vidual has no conscious control over his reac- 
tions to vestibular stimuli (Kennedy, 1972; 
Wolfe, 1968, 1969, 1971). 

The role of the cerebellar areas in the 
regulation of eye movements has been ex- 
plored by various studies (Cohen, Goto, 
Shanzer, & Weiss, 1965; Reding & Fernandez, 
1968). The results support the hypothesis 
that specific areas in the cerebellum exert 
control over the vestibular nuclei and the re- 
sulting oculomotor responses. Thus, control- 
ling habituation develops through some direct 


inhibitory action or facilitory action in neural 


pathways between these nuclei (Wolfe, 1968). 

It is not known whether the processes of 
fatigue influence orientation per se, or vice 
versa, or whether they are independent of 
each other save for the common sensory 
modalities and neural pathways involved, As 
reflected by changes in both parameters, re- 
covery rate (8) and sensitivity (a S), fatigue 


or moderate sleep deprivation induces a po- 


tentially hazardous condition for flying. It 
appears that in a fatigued state, skilled pilots 
regress to a lesser state of vestibulo-cerebellar 
habituation. It can be concluded from this 
study that the vestibular system’s habituation 


Pa — 999 8 
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Ac ‘nistrative decisions based on public 
prefe es have often relied on past experi- 
ence guesswork. Planners would like to 
be a to estimate and predict public re- 
spons: to policy decisions. To do this the 
plann must know two things. First, he must 
kno. zich attributes in a proposed system 
are want to public preference and what 
thei: .elationship is to the perceived desira- 
bill! of a particular system. Second, the 
planier needs to know how these different 
attributes ibine or interact in the forma- 
tion of an overall preference. 

Two rather different modeling procedures 


were considered in this study and applied to 
concerning choice of 

t Ihe first method is gen- 
er 2s “policy capturing” (Ham- 


j. This method stems directly 
swik’s (1956) Lens Model and has 
ularly applied in a number of areas 
nology. The second method is known 
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BUS TRANSPORTATION: 
TWO MODELING APPROACHES * 
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iged probability of riding hypothetical bus systems was studied as a function 
„ frequency of service, and walking distance to the bus stop. The judg- 
nent process of 20 paid college students was modeled by two different ap- 
roaches. A policy capturing approach using regression analysis indicated that 
a linear model accounted for the data adequately. However, a polynomial 
model incorporating both linear and multiplicative components in the judgment 
policy provided a better fit. A method of scaling known as functional measure- 
ment served to reject the linear model and confirmed that a multiplying model 
was correct. Functional measurement provided subjective scales for fare, fre- 
quency of service, and proximity to the bus stop. The advantages and limita- 
tions of both approaches were discussed. 


as “functional measurement” and derives from 
Anderson’s (1974) work on information inte- 
gration theory. The similarities and differ- 
ences between these approaches have been 
discussed elsewhere in detail (Anderson, 
1972; Slovic & Lichtenstein, 1973), but are 
briefly reviewed in the present context. 

The policy-capturing approach usually pre- 
sents stimulus combinations which cover a 
“representative” range, and data are analyzed 
by regression and correlational methods. This 
means that a physical metric on each attribute 
is necessary to allow for a correlation with 
the subject’s judgment response. Linear re- 
gression models are usually used, though non- 
linear or configural models have been recently 
explored (Goldberg, 1971). 

Functional measurement constitutes a scal- 
ing theory which allows for a subjective value 
s for each physical stimulus value and an 
associated weight w which reflects the im- 
portance of that stimulus. Subjective stimulus 
scales can be derived independently of the 
physical metric. Although functional measure- 
ment has been primarily concerned with linear 
models such as adding and averaging, it has 
lately been applied to more complex con- 
figural models (Anderson, 1972; Shanteau & 
Anderson, 1972). 

Both policy capturing and functional mea- 
surement have been quite successful in model- 
ing judgment behavior, but investigators have 
enjoyed little communication on these two 
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PROXIMITY: 


W h 
MEAN JUDGED PROBABILITY 


_ portation mode as well as insight into the 
advantages and disadvantages of each ap- 
proach in a practical situation. 


METHOD 
Task 


The problem to which these two approaches were 
applied is one of particular interest to planners of 
a transportation system. In predicting public re- 
sponse to a new bus system the most desirable 
experiment would be a direct manipulation of vari- 
ous factors. Historically such experiments have been 
= impractical or impossible. An alternative, especially 
in the initial stages of a research program, is a 
_ Paper-and-pencil experiment in which persons rate 
_ the probability that they woud ride a bus system 
described by a set of characteristics, 

Hypothetical bus systems varying in fare, prox- 
imity to the bus stop, and frequency of bus service 
Were judged in terms of the probability that the 
individual would ride the described bus system. 
_ Subjects were told to make their responses in the 
= context of a trip to or from school. Each attribute 
describing the bus system could take on any one of 
_ three values. Fare was either 15 cents, 30 cents, or 
45 cents; proximity to the bus stop was 4 block, 3 

blocks, or 9 blocks; and frequency of service was 
every 15 minutes, 30 minutes, or 60 minutes. All 
possible combinations of these three attributes were 
Presented as different hypothetical bus systems in a 
3° factorial design. The 27 combinations were pre- 
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FARE: “QUENCY: 


15 30 45 15 30 60 2 3 
FARE (CENTS) FREQUENCY (MINUTES) PROXIMi OCKS) 

SUBJECTIVE SPACING 
Ficure 1. Mean judged probability of riding a bus system as a function of fare, frec of service, 
and proximity to the bus stop. (Each panel presents a two-way plot by averaging over or. Values 

on the abscissa are subjectively spaced according to a multiplying model.) 

approaches. The present study was therefore sented in randomized blocks with ler of the 
subjected to both modeling approaches. This ‘ree attributes varying nonsysten Rei 
S = z 3 nations of attributes were printed i middle o! 

comparative technique was intended to pro- 5 
vide data on the properties of choice of trans- Judgments were to be indicated! sh mark 


on a 150-millimeter horizontal lin- he lower 
half of each page. Subjects were tol the left 


end point labeled “O” represented nty that 
one would never take a bus with th icteristics 
given, and that the right end poin led “100” 
represented certainty that one would alwəys take the 
bus described. Intermediate points nted the 
“proportion of time you would choose take the 


bus described.” Responses were me 
meters and rescaled to range between © 
anchors were supplied in the instruct 
possible nonlinearity in the response 
responding close to the end points. T 
were combinations of attributes more 
those actually employed in the experi 
descriptions composing each randomized block were 
stapled together in a booklet. 


Subjects 


Twenty students drawn from the University 15 
Towa Office of Student Employment served as sub- 
jects and were paid $5 each. 


Procedure 


The experiment was administered in two 1-hour 
sessions. In a morning session subjects rated 155 
systems in three booklets at their own rate, 1 
five minutes, and rated descriptions in an additiona 
three booklets. The same procedure was followed in 
an afternoon session. The first randomized block 0 
descriptions in each session was treated as practice 
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leaving 10 blocks for each 
instructed to treat stimulus 
to proceed directly through 
erring to previous judgments. 


Ri 
nts are graphed in the panels 
function of the values of fare 

quency and fare, and prox- 
ency. It can be seen, as ex- 

with increasing fare, judged 

f riding the bus decreased. Like- 
time between buses (frequency) 
king distance to the bus stop 
increased, judgments decreased. 
st striking feature of the data in 


nel 


SULTS 


* 
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the factors was low in desirability the other 
pieces of information had a smaller effect 
on the judgment. For example in the first | 
panel, if the bus stopped 9 blocks away the 

probability of riding was only influenced 

slightly by fare; whereas, fare was quite im- 
portant in determining the judgment when the k 
bus stopped } block away. p 


Policy-capturing 


A linear model of the following form was 
initially fit to the data: 


Rijk = bo + 01. 22 bT; + bUr, 11 i 


where Rj, is the predicted response for fare 


Figur: vas the convergence of the lines to Si frequency of service T}, and proximity to 
the : (his indicated that when one of the bus stop Ux, with their respective regres- 
TABLE 1 
OBSERVED AND PREDICTED RATINGS OF Bus SysteMs For DIFFERENT MODELS 
Predicted rating 
Stimulus combination 

Observed Regression models Functional 

rating measurement 

~ multiplying 
Frequency Proximity Linear Polynomial model 

its) (in minutes) (in blocks) (R? = .948) | (R? = .994) (R? = .997) 
5 5 4 71.01 58.11 67.96 68.56 
5 15 3 59.84 50.96 58.26 59.50 
i5 15 9 23.41 33.80 25.38 26.06 
15 | 30 4 60.20 55.32 61.64 60.75 
15 | 30 3 52.14 48.17 53.22 52.73 
15 | 30 9 22.71 31.02 23.15 23.10 
2 49.00 49.74 48.99 50.00 
3 42.29 42.59 43.15 43.39 
ou 9 19.50 25.44 19.25 19.00 
15 4 40.72 43.48 43.48 42.05 
15 3 35.56 36.33 35.78 36.50 
15 9 17.63 19.18 17.04 16.00 
3 30 2 37.68 40.69 39.54 37727 
30 30 $ 32.16 33.54 32.68 32.34 
30 30 9 16.12 16.39 15.94 14.17 
30 60 4 29.66 35.11 31.67 30.66 
30 60 3 28.05 27.96 26.48 26.62 
30 60 9 13.33 10.81 13.76 11.66 
45 15 2 21.81 28.86 21.11 23.30 
45 15 3 19.38 21.71 18.81 20.22 
45 15 9 8.50 4.56 7.34 8.86 
45 30 2 20.79 26.07 20.91 20.65 
45 30 38) 19.98 18.92 18,99 17.92 
45 30 9 8.43 1.76 8.45 7.85 
45 60 2 18.82 20.40 20,50 16,99 
45 60 3 15.14 13.34 19.36 14.74 
45 60 9 5.50 —3.82 10.67 6.40 


N ote. Correlation indexes (R!) are between observed and predict 


ed ratings, 
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sion weights 5,, b2, and ba, and intercept bo. 
The estimates of the regression weights and 
intercept were —.975, —.186, —2.859, and 
76.95, respectively. Table 1 lists the observed 
and predicted judgments for the linear model. 
The correlation index (R?) of .948 might 
seem more than satisfactory, but a compari- 
son of observed and predicted judgments re- 
vealed a systematic underestimation at ex- 
treme stimulus values and overestimation at 
intermediate stimulus values. 

The linear regression model did not capture 
the configural aspect of the data noted in 
Figure 1 since it assumes that the magnitude 
of the effect of each factor is independent of 
the levels of other factors. It was assumed 
that the addition of cross product terms could 
increase the fit by incorporating multiplica- 
tive components in the judgment policy. Con- 
sequently, a third-order polynomial regression 
equation was applied with all possible cross 
product terms. The predicted judgments for 
this model are listed in Table 1 with an R? of 
994. 

The conclusion based solely on regression 
analysis was that while a linear model ac- 
counted for a large proportion of the variance, 
the subject’s judgment policy was best de- 
scribed by a combination of linear and multi- 
plicative components in a polynomial regres- 
sion model, 


Functional Measurement Analysis 


Like policy capturing, functional measure- 
ment seeks the best fitting model that de- 
scribes the data. Analysis of variance often 
helps to indicate the form of the appropriate 
model. The presence of interactions may indi- 
cate a multiplicative relationship between 
factors (Shanteau & Anderson, 1972), In the 
present case the three two-way interactions 
shown in Figure 1 were all highly significant. 
In addition, the three-way interaction of all 
factors was also significant (p < .001, in each 
case). The presence of these interactions indi- 
cated that a nonlinear model would be nec- 
essary to account for these data. The form 
of the observed interactions suggested a 
three-factor multiplying model of the follow- 
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ing form: 


Rix = (s) (i „ [2] 
where se, tj, and uy are th: ited subjec- 
tive values of the levels of requency of 
service, and proximity to s stop with 
their respective weights t nd wy. The 
weights on these factors a retical, and 
in the present case are ar due to the 
interval nature of the subj cales (An- 
derson, 1974). To allow a 1 for Equa- 
tion 2, the weights were set to 1. Addi- 
tionally, a common origin scales was 
set such that sı = $ = u. It scale esti- 
mates of the subjective valucs found by 
the method of maximum likeli! (Norman, 
1973). 

Table 1 lists the predicted ments for 
the multiplying integration Predicted 
judgments were very close observed 


997 indi- 
s well as 
This is 


values, and the correlation in: 
cated that this model fit at 
the polynomial regression 


interesting to note since the multiply- 
ing model requires the estimat less than 
half as many independent pë ers. 

A further test of the mul ing model 
was provided since it predi fan of 
straight lines when the facto: re subjec- 
tively spaced. This is shown in gte 1 where 
the abscissas in the three pa are spaced 
subjectively for fare, proximity to the bus 
stop, and frequency of ser s graphic 


since the 
the inter- 
bilinear or 


result may be statistically te: 
multiplying model predicts t} 
actions should be localized in 


trilinear components (Shanteau & Anderson, 
1972), and residual componenis should be 
nonsignificant. This pattern was obtained: 


for each interaction, the bilinear or trilinear 
component was significant (p < .001, in each 
case) and the residual was not significant (b 
> .20, in each case). 

Figure 2 plots the relationship between the 
physical scales and the subjective scales for 
each factor for the averaging and multiplying 
models. The straight lines represent the as- 
sumption of regression analysis that the 


physical and subjective scales are linearly Te 


sub. j ective 


lated. For fare and frequency, the 195 


probability of riding the bus was a nesa 


i 
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PHYSICAL SPACING 
Fic Subjective values as a function of the physical metrics of fare, frequency of service, and 
proxim to the bus stop. (Dashed lines represent a linear relationship between the subjective and physi- 


cal metr. s.) 


decele ing function of the physical values, 
that the decrease from 30 cents to 15 cents 
was bjectively greater than the decrease 


from 5 cents to 30 cents. A similar relation- 
shi held for frequency of service. For prox- 
m. y to the bus stop, subjective probability 
w=- a positively decelerating function so that 
e al increments along the physical scale lead 
tc increasing decrements in subjective proba- 
bity. 

This analysis based on functional measure- 
ment clearly showed that a linear model was 
inadequate and should be straightforwardly 
rejected. The multiplying model, on the other 
hand, provided an excellent fit of the data. 

ven that the multiplying model is the cor- 

ct model, functional measurement guaran- 
tecs that the estimated subjective values (see 
iure 2) constitute valid interval scales for 
re, frequency of service, and proximity to 
the bus stop. 


Discussion 


Planners are interested in the two questions 
posed earlier regarding the effects of system 
attributes and the integration rule. First, it is 
clear that the three attributes of fare, fre- 
quency of service, and proximity to the bus 
stop had significant effects on rating judg- 
ments. No statement about the relative im- 
portance of the three attributes can be made 
Independent of the levels of the attributes 


chosen in this experiment. However, within 
the levels chosen, fare had the greatest effect 
and frequency of service the least. Also, it 
appeared that these scales were not neces- 
sarily linear with subjective ratings. One 
would exepct that as distance to the bus stop 
increased to prohibitive levels, the subjective 
ratings would no longer change appreciably 
since they would be close to zero. 

Second, the integration model is multiplica- 
tive in form. If any attribute is at an ex- 
tremely low value the overall rating will be 
low despite the values of other attributes, but 
it is also compensatory since a high-valued 
attribute may compensate for a moderately 
low value on another attribute. Given such a 
model as a description of the integration proc- 
ess, planners may take the next step of opti- 
mizing bus ridership at minimum cost levels. 
One needs only to relate the subjective values 
to the cost of the system and find those levels 
that maximize system desirability at a pre- 
determined cost. As always, caution must be 
exercised in generalizing the results of a con- 
trived experiment to a real situation. 


Evaluation of Approaches 


Regression analysis and functional measure- 
ment differ in several fundamental assump- 
tions. Regression analysis must assume a 
particular relationship between the physical 
values and the subjective values, since physi- 


cual values are used in the prediction equation. 

Although physical metrics were available in 

the present experiment, there are many situa- 

tions in which there is no objective metric. 

Furthermore, the assumption of a linear rela- 

' tionship between the physical and subjective 

values is often unreasonable and in the pres- 

ent experiment would have led to the accep- 
~ tance of an incorrect model. 

: The policy-capturing approach produced 
confusing results as to whether the integration 
model of the subjective impressions in the 
present experiment was additive or multiplica- 

_ tive. A polynomial model provided the best 
it by incorporating both linear and nonlinear 
components in the judgment. However, it 

should be mentioned that the generality of a 

high-order polynomial may be limited. The 

coefficients of higher order terms which in- 
creased predictability in the present situation 
may be unreliable and may decrease predicta- 

_ bility if applied in other situations. Further- 

2 more, the psychological interpretation of the 

polynomial model is lacking. It is not at all 

dlear what processes or combinations of proc- 
esses the higher order coefficients reflect. 

Nevertheless, the policy-capturing approach 

% has permitted high levels of predictability 

(Slovic & Lichtenstein, 1973) and has been 

particularly useful when the data are correla- 

tional in nature and when a factorial manipu- 
lation of the variables is not feasible. How- 
ever, when it comes to diagnosing the correct 
integration model, the policy-capturing ap- 

_ proach may not fare so well. 

In contrast, functional measurement does 

P not require a physical metric and makes no 

assumption concerning the relationship be- 

tween the physical and subjective metric, Un- 
$ like regression analysis, functional measure- 
ment constitutes a theory of measurement 
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which can guarantee vali: 
the subjective stimuli (A: 
seen in Figure 2, the spaci 
tive values is not the same 
tween the respective physic: 
on analysis of variance sery 


al scales of 
1974), As 
een subjec- 
pacing be- 
s. Reliance 
ject the in- 


correct linear model and ind hat a mul- 

tiplying model was appropr he conclu- 

sion based on functional me nt is clear 
and straightforward—the dat ultiplica- 
tive in form. 
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n earlier study conducted in 1973 by Donnelly, Etzel, and Roeth concluded 
an individual's breadth of categorization is related to his purchase of new 


In a ‘nt study, Donnelly, Etzel, and 
Roeth 3) concluded that a consumer’s 
“breadt categorization” is related to his 
purchase i new products. It is our belief that 


this conc) sion is not justified by their study, 
due to a umber of methodological problems. 


Proced 


One roblem with the study is the definition 
Of ne» products, which is based on four prod- 
uct © iaracteristics or dimensions: packaging, 
phys cal appearance, required user prepara- 
tion, and manufacturer’s technological pro- 
cessing. While the four-factor model of prod- 
uct newness may ultimately prove useful, the 
render is not told how each factor was mea- 
sured and the direction of relationship of each 
factor to newness. While a radical departure 
in packaging approach would undoubtedly be 
E*recived by most housewives as representing 
ne ess, how is physical appearance sup- 
posed to operate? Is “required user prepara- 
tio supposed to be measured in time, effort 
or what? Must a product offer less required 
user preparation to be considered new? Does 
More manufacturer's technological process- 
ing” mean more newness, or is it more sophis- 
ticated processing that means more newness? 
Can the researchers accurately judge manu- 
facturer processing in such terms? Even if she 
can, is it likely that a housewife actually 
attempts to assess manufacturer’s technologi- 
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ucts [p. 337].” Questions are raised about how the products studied were 
lected, choice of subjects, and the measure of association used. It is argued 
hat the original conclusion is weakly supported. 


cal processing in identifying and/or evaluat- 
ing new products? Finally, Donnelly et al. 
(1973) give no account of how the factors 
are combined (multiplied? added?) in assess- 
ing newness, and whether or not a product 
must score a certain minimum on each factor 
to be considered new. In short, we need to 
know more about how their theory of newness 
was applied before the selection of their five 
food product groups for study can be evalu- 
ated. 

The five product groups selected for study 
were (a) frozen pudding, (5) frozen donuts, 
(c) canned pudding, (d) canned cake frost- 
ing, and (e) instant oatmeal with fruit. It 
would appear that four of these products 
represent desserts, and are new only in the 
sense of how they are preserved, that is, 
frozen or canned. Thus, according to the 
scheme developed by Robertson (1971), these 
products would appear to involve “the least 
disrupting influence on established consump- 
tion patterns. Alteration of a product is... 
involved, rather than the creation of a new 
product Ip. 7].” 

For these reasons, we believe the Donnelly 
et al. (1973) study cannot claim to have 
necessarily identified five product groups of 


significant newness as perceived by house- 
wives. 


Subjects 


In their sampling of housewives, the re- 
searchers screened for familiarity with the 
five product groups. If a housewife “exhibited 
a lack of familiarity with some or all of the 
products [p. 336],” she was dropped from the 
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sample. Since only 17% acknowledged a lack 
of familiarity with one or more of the five 
product groups, it is at least possible that a 
_ yea-saying bias exists in the dependent vari- 
able, namely, recalled purchase. Actual pur- 
chase data were not available. In this connec- 
tion the hypothesis is incorrectly stated as 
“willingness to buy” when in fact the measure 
is one of recalled behavior. 
By having screened out the 35 subjects 
who disclaimed familiarity with one or more 
of the product groups, Donnelly, et al. (1973) 
lost an opportunity to enrich the conceptual 
utility of their study. Namely, “width of 
categorization”, as a measure of cognitive 
style, may have at least as much to do with 
determining familiarity or nonfamiliarity with 
a product group as it may have to do with 
a determining purchase and nonpurchase within 
a product group. It would have been useful 
_ to relate the width of categorization measure 
lor each respondent to familiarity /nonfamil- 
larity with each product group before relating 
it to recalled purchase incidence. 
pp Analysis 
Even if we assume that the criteria of prod- 
_ uct newness and the sampling were adequate, 
the authors should have used for their analy- 
sis a statistical test of association, given their 
hypothesis, rather than chi-square, which is 
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a test of independence. T? la test would 
have been most appropri ven that the 
dependent variable is sca minally and 
causality between width oi rization and 
recalled purchase is implici e hypothesis, 
However, for frozen puddi: zen donuts, 
and instant oatmeal with he lambda 
coefficient would equal zero inned pud- 
ding and for canned cake 1g, lambda 
would equal .02 and .11, respe 7 

A less demanding, nondire measure 
of association would be cor: analysis 
calculated from the chi-squar tic. Con- 
verting the Donnelly et al. (1 ni-square 
statistics into correlations, we that they 
range from .10 to .28 across the yroducts, 
indicating a maximum of 8% of variance 
in recalled purchase behavior < nted for 
by width of categorization. Th: sing the 
authors’ own data, there is, at b ly weak 
evidence that category width ts pur- 
chase of new products. 
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1 and Tellefsen raised questions regarding some methodological pro- 


tres, including the operationalization of product newness, employed in an 
lier study by Donnelly, Etzel and Roeth. These questions are addressed in 


in attempt to clarify the procedures and conclusions of the Donnelly et al. 
investigation 
In (1974) study Ostlund and Tellef- weakness in many new product studies (Lam- 
sen severa] points in connection with bert, 1972; Ostlund, 1969; Ostlund, 1972a; 
our of new product trial (Donnelly, Peterson, 1973). It would seem that an at- 
Etzel Roeth, 1973). Their principal argu- tempt to operationalize newness should be 
ment with our attempt to operationalize embraced and, if possible, improved by re- 
produ : newness. Specifically, they question: searchers interested in new product research. 
(a) tìi- definition of “newness” used in the To overcome problems associated with the 
study, (b) the choice of products included selection of products in new product studies, 
in the - udy. In addition, they challenge the we used four dimensions—packaging, physical 
use of call as a measure of purchase and appearance, required user preparation, and 
the choice of a statistical test. Each of these manufacturer's technological processing—to 
point. “eserves a reply. operationalize product newness. These four 
0 l and Tellefsen (1974), using dimensions were selected for two reasons. 
Roß on's (1971) well-known continuum of First, they are nonsubjective in nature in that 
new products, have concluded that the prod- they are tied to the physical characteristics of 
ucts in our study are at best only slightly new the products and could therefore be easily de- 


(ceatinuous innovations). Though Robert- 
son's work has contributed significantly to 
thinking and research on new products, his 
three categories are operationally vague. For 
example, if consumption is viewed as pur- 
che he housewife buying canned cake 
frosting may view it as a very new product 
(a discontinuous innovation). However, if 
Cons:mption is viewed as the family eating 


the frosted cake, it is likely that the family 
Members would see little difference between 
this other frosted cake eaten previously, 


and the product becomes at best a continuous 
Innovation. 

A failure to define objective criteria to 
Operationalize product newness is a common 
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termined. Second, they are attributes common 
to all grocery products and are likely to be 
promoted by manufacturers and noticed by 
consumers. 

Admittedly, a stronger research design 
would have incorporated the respondents’ 
evaluation of the degree of newness of the 
products. However, this was not done for two 
reasons. First, having respondents report pur- 
chase behavior and evaluate newness could 
produce a bias. For example, respondents may 
have had a tendency to rate products they 
had purchased as newer than products not 
purchased. Second, in order to have several 
products generally viewed as new, many 
products would have had to have been in- 
cluded on the questionnaire. Given the length 
and difficulty of the Pettigrew (1958) scale, it 
was felt that the addition of a lengthy list of 
product evaluation statements would severely 
lessen respondent cooperation. 
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Ostlund and Tellefsen (1974) ask, “how 
each factor (dimension) was measured and 
the direction of the relationship of each factor 
(dimension) to newness [p. 759].” Numerous 
products were compared to the nearest pre- 
viously available substitutes and were classi- 
fied as being either different or not different 
on each dimension. The products included in 
the study were judged to be significantly dif- 
ferent on all four dimensions. 

Tt is surprising to us that Ostlund and 
Tellefsen (1974) could find a difference on 
only one dimension (how the products are 
preserved, or in our terminology, packaging) 
for the five products in the study. Our ob- 
servations are quite different. To avoid re- 
dundancy, we cite only one example. 

The nearest available substitute for canned 
cake frosting is a powdered frosting mix pack- 
aged in a box. Thus, on our four dimensions 
the two products differ in packaging (can 
versus box), physical appearance (powder 
versus semiliquid), user preparation (opening 
the can and spreading versus adding ingre- 
dients and mixing), and manufacturer’s tech- 
nological processing (a semiliquid requires dif- 
ferent processing than a powder). Though we 
feel Ostlund and Tellefsen (1974) were rather 
conservative in their attempt to match the 
products and dimensions, our disagreement 
points to a more important issue. Studies of 
new products should, whenever possible, in- 
corporate consumers’ perceptions of newness 
rather than researchers’. Possibly future new 
product research should incorporate matched 
Samples, one to develop a set of products and 
the other to test the research hypotheses. 

Ostlund and Tellefsen (1974) are con- 
cerned about the direction of change: “Must 
a product offer less required use preparation 
to be considered new [p. 759]” and “does 
More ‘manufacturer’s technological processing’ 
mean more newness [p. 759].” We contend 
that the direction of change from existing 
products is irrelevant. Webster’s New World 
Dictionary (1961) defines newness as “differ- 
ent from one in the past.” We concur with this 
definition. This is consistent with many prod- 
uct changes which may or may not be product 
improvements but which manufacturers ad- 
vertise as “new.” The actual consumer bene- 
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fit derived from the cha e question- 
able; however, the pur; ) create the 
impression of newness in iers’ minds. 

Ostlund and Tellefsen ask, “How 
is physical appearance su to operate 
Ip. 759].” Physical appea simply how 
the product looks to cons Though no 
other change may be appa anging the 
physical appearance of the can create 
the impression of newness imple, in- 
stant coffee crystals diffe; powdered 
instant coffee only in ap; In our 
study, all of the products physical 
appearance different from tł reviously 
available substitutes. 

Ostlund and Tellefsen (1 question 
whether “a housewife actual! mpts to 
assess ‘manufacturer’s technol: process- 
ing’... (p. 759).” Certainly ti iains to 
be empirically determined. Hos we feel 


that though consumers may no his ter- 


minology, they are frequen formed 
through advertising about new lations, 
processing methods, and techne cal ad- 
vances. Thus, technological chan, a rele- 
vant dimension of newness. 

In our study purchase recall used as 
a surrogate measure of actual pur: se. Ost- 


that this 
dias. We 


lund and Tellefsen (1974) point < 
may have produced a “‘yea-saying 
acknowledge this possibility. H 
other new product researchers wh 
used recall (Darden & Reynolds, 
bert, 1972; Ostlund, 1972b) and . 
chase surrogates such as purchase intention 
(Ostlund, 197 2a), we found that practicality 
made such a measure necessary. We also point 
to the exploratory purpose of our study as 4 
further justification for a less than perfect 
measure. 

In regard to the analysis of the data, Ost- 
lund and Tellefsen (1974) contend that 4 
measure of association would have been mote 
appropriate than chi-square, a measure of 
independence. Our results, using chi-square, 
establish rather conclusively that the rela- 
tionship between breadth of categorization 
and purchase of selected new products 15 not 
the result of chance, Using correlation analy- k 
sis, Ostlund and Tellefsen have shown that 
breadth of categorization explains only 4 


small po 


ATTEMPTING TO OPERATIONALIZE Propuct NEWNESS 
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USING SUBORDINATE RATINGS TO ELICIT BEHA\ 
CHANGES IN SUPERVISORS * 


W. HARVEY HEGARTY? 
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The purpose of this study was to determine whether feedback of subordi 
ratings of supervisors leads to positive changes in the supervisors’ beha 
The employees of the 56 supervisors in the experimental and control g 
completed an information-opinion survey using a modified Likert scale fo 
Feedback reports were given to the experimental supervisors and a second 
vey was conducted 10 weeks later to measure change. Two independent n 
sures showed significant positive change for the experimental supervisors. 


The purpose of this study was to determine 
whether feedback of subordinates’ ratings to 
their supervisor leads to subordinates perceiving 
positive changes in their supervisor’s subsequent 
behavior. 

From a learning standpoint, the feedback report 
can provide the supervisor with knowledge of 
how well he is performing as a supervisor. He 
may change because he is motivated to become 
a better supervisor and/or the feedback report 
reveals some shortcomings that are in contrast 
to his concept of a good supervisor. 

Blum and Naylor (1968) provide a good sum- 
mary of the types of feedback based on the kind 
and source of information involved. This study 
was concerned with extrinsic feedback in which 
the cues are external to the organism. Since the 
information about the performance was from a 
source outside the task, and since there was a 
time delay between behavior and feedback, this 
feedback was secondary and summary in nature. 
Behavioral statements directly relating to super- 
visory performance were used, so feedback 
tended to be more specific than general. 

The earliest study concerning feedback of sub- 
ordinates’ ratings to their supervisors was one 
conducted by Esso Research in the late 1950s 
(Maloney & Hinrichs, 1959). Esso supplied its 
technical supervisors with personal analyses made 
by their subordinates. Another study in a mid- 
western power company used responses gathered 
on an attitude survey of subordinates to feed- 
back to supervisors (Baumgartel, 1959). Wakely 
(1964) used the feedback method in connection 
with an actual-predicted ratings approach. These 


+The data for this study were collected in con- 
junction with the author's doctoral dissertation at 
the University of North Carolina at Chapel Hill. 
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studies all reported favorable r did not 
employ control groups in their designs. 

Two studies reported by Gage ult with 
using feedback to elicit behavior > in an 
academic setting. On a retake oí teacher 
ratings, 10 of the 12 items used study 
showed changes in the hypothesize ird the 
ideal) direction. A similar study o been 
carried out using teacher ratings ir own 
principal and an ideal principal (D Gage, 
1967; Gage, 1963). 

METHOD 

The subjects for this study were 5 >rvisors 
of staff employees at the Universi North 
Carolina. Thirteen departments at the university 


s were 
e depart- 


were involved in the study. The su 
the first-level supervisors in each of th: 


ments. No supervisor with fewer than ‘ive sub- 
ordinates was included, and the average number 
of subordinates was nine. 

The 56 supervisors were randomly assigned 


group. 
< between 
imental 


to the experimental group and the c 
The control group received no feedba 
the first and second survey. The ex 
subjects received feedback on their 
weeks after the first survey. The control super- 
visors did not expect immediate feedback; the 
experimental supervisors knew they would receive 
feedback shortly after the first survey. No sub- 
ordinate knew who would be receiving feedback 
and who would not. 

The specific behaviors to be measured were 15 
4-point Likert scale statements taken from 4 
larger attitude questionnaire. The statements 
under consideration in this study were all con- 
cerned with supervisory behavior. A summary 
item measuring overall performance on 4 
7-point scale ranging from poor to excellent was 
also used. Y 

Complete anonymity was assured by asking 
each individual to identify himself by using # 
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pation r: 3% 


TABLE 1 


SUBORDINATES’ 
TROL Group AND EXPERIMENTAL 


SHort Nores 


nly his department and work 
Fa in the survey was encour- 

were told if they had 
t participating for any reason, 
rot to participate. The partici- 


RATINGS OF 


Control | Feedback 
1 group group 
(n = 28) | (n = 28) 
1. Ian advised by my 
supe s to how I am 
perf z my job. 1.768 2.00 
„ha onfidence in the 
abil Í my immediate 
supe r 1.88 2.20 
The work I do is often 
over! d, but my mis- 
takes -e highlighted. 1.38 1.49 
. My s" visor gives clear 
expla ons and instruc- 
ns 1.76 1.93 
to talk to my 
sor about my prob- 
leas, and suggestions. | 2.10 2.24 
supervisor does not give 
cre-lit where credit is due. 1.51 1.71 
7. I have a clear understand- 
ing cf what my supervisor 
expects me to achieve on 
my jab. 1.96 2.05 
8. My supervisor “breathes 
down our necks” too much. 1.92 2.12 
| 9. My supervisor brings out 
best in people. 1.57 1.78 
y supervisor is not con- 
siderate of me as a person. 1.95 2.10 
.« My supervisor handles 
things which he should dele- 
gate to others. 1.49 1.68 
- My supervisor explains the 
reasons for rejection or ac- 
ceptance of my ideas and 
suggestions. 1.73 1.85 
3. My immediate supervisor is 
quick to take care of com- 
plaints. 1.61 1.86 
My immediate supervisor is 
interested in me asa person. 1.70 1.86 
My supervisor consults us 
when inaking a decision 
that affects us. 1.91 2.05 


Nes 
acı 


Toss all items a higher score is more desirable. 


gatively worded item scores have been reversed so that 
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TABLE 2 
Postrest MEANS OF SUMMARY ITEM 
I Control | Feedback 
ae group | group 


(n = 28) | (n = 28) 


Overall, I think my supervisor’s 
performance is excellent. 
good . . fair... poor. 


a Higher score reflects a more positive rating; for both groups 
p <.02. 


The experimenter went through a standardized 
procedure in explaining the results of the feed- 
back to the supervisors when they received their 
feedback reports. Approximately 60 to 90 min- 
utes was spent with each supervisor in going over 
his individual report. 

Ten weeks after the supervisors were given the 
feedback reports (12 weeks after the first 
survey), their employees were asked to partici- 
pate in a second survey. The format of the 
second survey was the same as the first. 


RESULTS AND DISCUSSION 


Table 1 shows the results of comparing the 
control group with the feedback group. The sign 
test was used on the posttest scores of the 15 
behavioral statements. The experimental group 
had a higher rating on all 15 items (p < .001). 
The sign test was also used to pretest scores be- 
tween the two groups; there was not a significant 
difference at this time. 

A ¢ test was used to determine if the experi- 
mental group was superior to the control group 
on the summary item (see Table 2). The experi- 
mental group was superior (p < .02). There was 
no significant difference on this item or pretest 
scores. 

The major finding as a consequence of this 
study was that giving supervisors feedback re- 
ports improves the supervisors’ performance in 
the eyes of their subordinates. This was sup- 
ported by two independent measures of perform- 
ance as noted above. 

It is significant to note that there was a posi- 
tive change across all 15 behavioral items, The 
lack of independence among these items pre- 
cluded a test of significance for each item, How- 
ever, when a ¢ test is applied, the six items with 
the greatest amount of change had a p<.10 
or better. 

The focus of the study reported here is on the 
subordinates’ perceptions of their Supervisor’s 
behavior. A previous study (Hegarty, 1973) of 
their own change showed that supervisors also 


eo ee 


_ thought that they had become better supervisors 
“as a result of the feedback process. 

In summation, this project offers strong evi- 

= dence that positive change does occur when 


their behavior. Some questions that might be 
addressed in the future along these lines include: 


1, How often should the feedback process be 

used to keep supervisory performance high over 

A longer period of time? 

2. What gains can be made in better super- 

= visory performance if a training program accom- 

panies the feedback process? 

_ 3. Are there other measures of performance 
that could be used to provide feedback (e.g., 

ratings, ratings by the supervisor’s superior, 

work group productivity) ? 

What are the comparative effects of feed- 
from different sources? 

ja > 

This researcher feels that these and other rele- 

t questions should be intensively studied be- 

of the potential value of the feedback 

rocess to an organization. Maximizing the po- 


~ 
4 


SN 


tential of feedback rests 
most judicious way to inc 
the continuing growth and 
supervisor. 
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The S isory Behavior Description (SBD) 
question: first described by Fleishman (1951, 
1953, 195 1972) has received widespread appli- 
cation as measure of the well-known leader 
behavior mensions of Initiating Structure and 
Considers on (Stogdill & Coons, 1957). The 
original m of the SBD was based on a factor 
analysis of leader behavior description items 
administered to industrial employees who de- 
scribed. their supervisors. Tscheulin (1973) re- 
cent! ed a replication of Fleishman’s fac- 
tor of the SBD items for a sample of 
indus! ‘al supervisors and managers in Germany. 
He co cluded that the two main factors of Con- 


Siders ion and Initiating Structure were con- 
firmed as relevant dimensions for describing the 
leader behavior in organizations in West Ger- 
many. 

The purpose of this present study is to report 
a replication of Fleishman’s factor analysis for a 
predominantly female sample in a nonprofit en- 
vironment. The results are compared with those 
presented by Fleishman and Tscheulin. 


METHOD 


Fleishman’s SBD questionnaire was admin- 
istered to 1,161 employees of a major medical 
center located in the Midwest, who used the 
questionnaire to describe their own supervisors. 
Administration of the questionnaire was con- 
 cucted during regular working hours at the cen- 
ter. The sample consisted of professional nurses, 
therapists, technologists, technicians, secretaries, 
d service workers, and administrative em- 

Ployees. Of the respondents 79.6% were female 


5 Requests for reprints should be sent to Henry P. 
Sims, Jr., Room 630-Business, Indiana University, 
Bloomington, Indiang 47401. 


SAMPLE STABILITY OF THE SUPERVISORY BEHAVIOR 
DESCRIPTION QUESTIONNAIRE 


HENRY P. SIMS, Ju 


Indiana University 


tor analysis was performed on the item intercorrelations obtained from 
administration of Fleishman’s Supervisory Behavior Description (SBD) to 
redominately female sample in a nonprofit organization. Factor congruence 
ref were calculated in order to compare the results from this sample 
ith previous leader behavior studies. The SBD measures of consideration and 
itiating structure were found to be stable across time, sex differences, type 
organization, culture, and analysis technique. 


employees, For each item in the SBD, a 5-point 
Likert scale response categorization (e.g., always, 
often, occasionally, seldom, never) is used to 
indicate the frequency with which the supervisor 
engages in that particular behavior, 


RESULTS 


The responses to the 1,161 questionnaires were 
subjected to a principal-axis factor analysis, 
using the subprogram FACTOR from the Statis- 
tical Package for the Social Sciences (SPSS) 
(Nie, Bent, & Hull, 1970). The method of fac- 
toring was PA 2, principal factoring with itera- 
tion, with orthogonal rotation using the varimax 
method.? The factor loadings for the two-factor 
solution are presented as Sample C in Tables 1 
and 2. Also presented in Tables 1 and 2 are the 
original factor loadings obtained by Fleishman 
(Sample A) and the recent study by Tscheulin 
(Sample B). Split-half reliabilities, corrected by 
the Spearman-Brown formula, were .93 for Con- 
sideration, and .89 for Initiating Structure. The 
two factors explained 39.6% of the variance. 

The results indicated a high degree of corre- 
spondence between each of the factors in the 
three studies. In order to quantify the similari- 
ties of the results of each of the studies, factor 
congruency coefficients (phi-coefficients) were 
computed, using the Wrigley and Neuhaus pro- 
cedure described in Harman (1970, pp. 256-259). 
The results, reported in Table 3, reveal phi- 
coefficients above .90 for the Consideration scale 
and above .80 for the Initiating Structure scale, 
The correspondence between the German sample 


The original method of factor analysis used by 
Fleishman (1953) was the Wherry-Winer iterative 
procedure (1953); Tscheulin (1973) used the or- 
thogonal varimax rotation method. 
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TABLE 1 


Comparison OF SUPERVISORY BEHAVIOR DESCRIPTION QuEsTIONN Al: 
Trem FACTOR LOADINGS Across SAMPLES 


x 
; Consideration (Factor I) | St Factor II) 
} Consideration items Sample* 
y 
A B ¢ A 
1. He/she refuses to give in when people disagree 
4 with him/her. —.68 | —38 | —A7 06 
4. He/she does personal favors for the employees 
under him/her. 40 65 27 06 
5. He/she expresses appreciation when one of us 
does a good job. 70 55 72 419 | 
7. He/she is easy to understand. 70 63 73 13 
9. He/she demands more than we can do. —40 — 45 — 58 —.08 | 
10. He/she helps his/her people with their personal | 
problems. 32 54 54 05 
II. He/she criticizes his/her people in front of others. —.49 —.60 —.61 03 | 
15. He/she stands up for his/her people even though | 
it makes him/her unpopular. 54 71 wpe 08 | 
. He/she insists that everything be done his/her | 
way. 52 38 60 01 
. He/she sees that an employee is rewarded for a 
job well done. 70 40 63 05 
He / she rejects suggestions for changes. 62 54 65 06 
- He/she changes the duties of people under him 
without first talking it over with them. —.69 | —.56 | —.64 .09 
. He/she treats people under him/her without con- 
sidering their feelings. —.72 —.70 — 80 Atl 3 
. He/she tries to keep the employees under him in 
good standing with those in higher authority. -68 wal 49 17 -30 
- He/she resists changes in ways of doing things. =57 — —.61 -| —.16 = 
. He/she “rides” the employee who makes a mis- 
take. Saal — 51 —.67 37 43 
- He/she refuses to explain his/her actions. —.72 — AS —.73 .23 ig 
„ He/she acts without consulting his/her people 
first. —.73 —.67 A7 0¹ — ĝi 
- He/she stresses the importance of high morale 
among those under him/her. 13 34 38 —.11 54 
- He/she backs up his / her people in their actions. 62 04 16 16 18 
He / she is slow to accept new ideas. —.66 —.21 15 —.06 04 
. He/she treats all his/her people as his/her equal. .66 69 ST .28 —.08 
. He/she criticizes a specific act rather than a 
: particular individual. 63 19 18 14 —.07 
39. He/she is willing to make changes. 78 55 05 LOD mato ta) 
40, He/she makes those under him/her feel at ease 
l when talking with him/her. 80 688 07 ae 10 
> - He/she is friendly and can be easily approached, 82 “ial = 05 —.02 0⁵ 
He / she puts suggestions that are made by people 
under him/her into operation. 87 55 14 11 26 
- He/she gets the approval of his/her people on im- 
portant matters before going ahead. 62 49 05 — 0² 39 


. Sample B data from Tscheulin (1973); Sample C data from present sample. 
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TABLE 2 


COMPARISON OF SUPERVISORY BEHAVIOR DESCRIPTION QUESTIONNAIRE ITEM 
FACTOR Loapincs Across SAMPLE 


Consideration (Factor I) | Structure (Factor II) 
Initiating structure items Sample* 
A B | G A B G f 
. He encourages overtime work. 20 —.03 09 40 43 16 
. He/s > tries out his/her new ideas on employees. | —.10 35 10 42 38 33 
. He, tes with an iron hand. —.20 —37 — 34 58 58 coil 
He iticizes poor work. —.18 —18 —.15 59 63 55 
. He/si Iks about how much should be done. —.20 —.09 —32 60 55 54 
. He/st courages slow-working employees to 
greate rt. —.17 07 33 33 74 40 
14. He/she „its for his/her employees to push new 
ideas bei re he/she does. —.07 07 —.02 28 33 06 
16. He/she a -signs people under him/her to particu- 
Jar task -00 08 04 2 29 37 
20. He/she sks for sacrifices from his/her people for 
the go. of the entire department. .00 00 OL 46 50 Al 
23. He/sh- insists that his/her people follow standard 
ways of doing things in every detail. 25 —.02 —29 72 63 49 
27. He. ne sees to it that people under him are work- 
i ag up to their limits. —.17 .06 16 87 73 40 
30. fe/she offers new approaches to problems. 36 49 63 -12 30 25 
34. ‘e/she insists that he/she be informed on de- 
ions made by people under him/her. T 11 —.03 51 57 40 
37 she lets others do their work the way they 
ak best. =17 44 40 —.33 20 =.32 
38. He/she stresses being ahead of other work groups. 03 —.16 —.10 34 42 40 
41. He/she “needles” people under him/her for 
greater effort. 17 —.06 —51 50 67 AS 
42. Heh decides in detail what shall be done and 
how it shall be done. 37 —.05 12 .63 40 43 
45. He/she emphasizes meeting of deadlines. 10 04 17 68 49 A 
S people who are slow to work harder. | —.22 | —.03 | —.04 40 70 OF 
A , sme emphasizes the quantity of work. A7 — 21 01 St 69 59 


and the hospital sample (C) is exceedingly 
bizh, with coefficients of .95 and .93 for the Con- 
sideration and Structure factors, respectively. The 
low phi-coefficients between these factors, within 
and across samples, confirm the independence of 

these factors as separate dimensions of leader 
behavior. 


CoNCLUSION 


The similarity of results of this present study 
(with a predominantly female population in a 


Sample A data from Fleishman (1953) ; Sample B data from Tscheulin (1973) ; Sample C data from present sample, 


nonprofit environment) with those reported by 
Fleishman and Tscheulin lend increased confi- 
dence in the cross-sample stability of the SBD 
questionnaire and its use in the study of leader- 
ship in a variety of organizational contexts, In 
particular, the results of the three studies indi- 
cate that the construct validity of the SBD ques- 
tionnaire is stable across time, sex differences, 
type of organization, culture, and analysis tech- 
nique. 


TABLE 3 


CONGRUENCY COEFFICIENTS FOR SUPERVISORY 
BEHAVIOR DESCRIPTION QUESTIONNAIRE 


Sample A Sample C 


Con- 


z Struc- | > Struc- 
sider- | ture | sider- | ture 
aton ation 
OL | .05 95) | =.05 
01 | Bt —.14 93 
ML | 02 

—.09 80 


de A data from Fleishman (1953); Sample B data 
lin (1973); Sample C data from the present sample 
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ON BETWEEN FOREMEN’S LEADERSHIP ATTITUDES 


ND THE SKILL LEVEL OF THEIR WORK GROUPS 


IRIS CHING-YANG HSU? ann RICHARD R. NEWTON 


General Motors Corporation, Detroit, Michigan 


tion between foremen’s leadership attitudes, as measured by the Leader- 
Opinion Questionnaire, and the skill level of their work groups was studied, 
ibjects were 243 foremen and 66 general foremen in a manufacturing plant 
in the Midwest. The results indicated that the skill level of the foreman’s work 
group is significantly related ( < 01) to his structure attitude but not to his 


F 101 
gee 
consideration attitude toward these workers. 
i \uational variable which may be re- 
Flate the leadership style of the supervisor 


is kill Jevel of the work group he super- 
Vise vorkers have generally received 
more trainine ng have gained higher pres- 


Uge tua workers. In addition, the 
je workers may call for more 
ini and autonomy than those of un- 
Ski en workers. Supervisors of skilled workers 
n less need to structure workers’ roles 
a ctivities than supervisors of unskilled 


workers. Differences in skill level may also 
relate the consideration shown by the 
1 in terms of the rapport established, 
umount of two-way communication engaged 
in, etc. 

“he present study compares the leadership 
attitudes of structure and consideration, as 
measured by the Leadership Opinion Ques- 
tionnaire (LOQ) developed by Fleishman 
(1969), for foremen supervising groups of 


sited and unskilled workers. 


METHOD 


Subjects used in this study were 243 foremen, 
representing 85% of the foremen in the manufactur- 
ing area of a plant in the Midwest. Each foreman 
supervised about 25 hourly workers, averaged 42.2 
years of age and 11.4 years of education. 

Hourly workers were classified into skilled and 

killed groups. The former had served an ap- 
Prcaticeship leading to jobs as machinists, tool 


makers, electricians, etc. The latter had not served 
a: apprenticeship and worked as assemblymen, 
pain: 

skilled 


stockmen, ete. Foremen of skilled and un- 
r groups did not differ significantly in 


cts for reprints should be sent to Chris 


Chi 
4 ing- Yang “su, 12-201 General Motors Building, 
eii „ Michigan 48202. 


mean age, education, or number of workers super- — 


vised. 


The Leadership Opinion Questionnaire was ad- f E 


ministered at a location outside of the plant to gen- 
eral foremen and foremen in groups of approxi- 
mately 80 persons as part of a manpower planning 
test battery. The subjects were told that the test 


results would be confidential and would not be 


available to their supervisors. The LOQ consists of 
40 items, and it is purported to measure how a per- 
son thinks he should behave as a supervisor. The 
LOQ provides a score for structure and consideration 
(for detailed deñnitions of these leadership dimen- 
sions see Fleishman & Harris, 1962). 

Since Fleishman (1953a) had previously shown 
that the leadership attitudes of the foreman’s own 
boss might affect the attitudes of the foreman toward 
his workers, a check was made on the possible in- 
fluence of this variable. The LOQ was administered 
to 66 general foremen who supervised the primary 
sample of foremen. Each general foreman supervised 
about four foremen and averaged 44.2 years of age 
and 12.3 years of education. 


RESULTS 


The average structure score of foremen was 
51.08, and the consideration score was 50.78. 
The average structure score of general fore- 
men was 52.58, and the consideration score 
was 54.30. General foremen have significantly 
(p< .01) higher consideration scores than 
foremen. 

The correlations between structure and con- 
sideration scores for foremen and general 
foremen were —.07 and —.10, confirming the 
independence of these two scales reported 
earlier (Fleishman, 1953b). 

Although general foremen of the unskilled 
group tend to have higher structure scores 
and lower consideration scores than general 
foremen of skilled workers, the differences 
were not statistically significant. Thus, both 
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TABLE 1 
COMPARISON or MEAN LEADERSHIP PINION QUES- 
TIONNAIRE SCORES FOR FOREMEN OF SKILLED AND 
UNSKILLED Work Groups 
No. 
LOQ scores | Groups fore- M SD | ttest 
men 
3 | Skilled 1 | 47.88 | 7.56 
Structure 9 202 8172 7.69 2.92 
ion | Skilled 41 | 50.73 | 5.95 
Consideration 202 5075 | 6.59 02 


| Unskilled 


Note, LOQ = Leadership Opinion Questionnaire. 
„ <0. 


groups were reasonably well matched on this 
climate variable. However, foremen super- 
vising the unskilled workers showed signifi- 
cantly (p < .01) higher mean structure scores 
than foremen of skilled workers (see Table 
1). No difference in consideration score be- 
_ tween foremen of the unskilled and skilled 
groups was found. 

As a partial replication of the earlier Fleish- 
man (1953a) findings, the attitudes of fore- 
men working under general foremen with 
different leadership attitudes were compared. 
High consideration or structure attitudes were 
defined as those falling more than one half a 
standard deviation above the mean, while low 
consideration or structure attitudes were those 
falling more than one half a standard devia- 
tion below the mean. 

Although the trends were in the direction of 
confirming the earlier findings (Fleishman, 
1953a), that foremen’s attitudes on considera- 
tion and structure are related to those of 


SHort Notes 


their own supervisors, tl 
plementary analysis did 
significance, 


of this sup. 
statistical 


ConcLusi 


The primary finding was 
of the foreman’s work gr 
significantly related to the for: 
attitude toward his workers. the other 
hand, the skill level of the fo. man’s work 
group was not related to the fo man's con- 
sideration attitude. 

Korman (1966) suggested thc evelopment 
of “a systematic conceptualiza ion of situa- 
tional variance as it might relate 0 leadership 


he skill level 
a variable 
s structure 


behavior and a research program ‘esigned to 

test derivations from such a concept. alization 

.. Ip. 355].” The present results suggest 

one such situational variable whic} Huld be 

given further study in leadershi; rch. 
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CONTENT ANALYSIS AS AN AID TO PREDICTING 


rs OF ADVERTISING 


MICROCOMMUNICATIONS 


DONALD W. HENDON 


\ Columbus College 


analysis of 204 posters was conducted to predict the percentage 


who remembered seeing an outdoor advertising message. The 


atiables were 21 mechanical and 6 content elements; the de- 
les were recognition scores. The predictive validity of 20 equa- 
ow to be of value to advertising communicators. The correla- 
predicted and actual scores were low (.03 to .41, p< .05) 
iependent variables varied only slightly as a result of the 


’s simplicity. 


I | Trade Commission 
to substantiate every 
f the privilege of puffery has 
from the advertiser’s list of con- 
bles, it is probable that content 
masized, while graphics (the ads’ 
ariables) are emphasized, The graph- 
who encodes the advertiser’s origi- 
io order to attract more attention, 
rease in relative importance to the 
sev. This specialist, who is usually the art 
avout designer, has been found to be out of 
touch “h ihe receivers of advertising com- 
m me, Studies (Brinton, 1961; Tannen- 
cobson, & Morris, 1964) have reported 
that the aesthetic judgments of graphic experts 
t shared by ordinary readers. 
he phics specialist’s job is to microcom- 
municate the advertiser’s original macrocom- 
munication to the receivers of his message by 
mbellishing the original content with memorable 
or attention-getting mechanical features. The 
ose of this study is to provide this member 
the encoding team with guidelines to follow 
in the microcommunication process by indicating 
which mechanical variables aid in accomplishing 
the sender’s goals—especially “recognition” of 
the message. Since both mechanical and content 
variables are used in the present context, a new 
term, “mechano-content analysis,” is suggested 
as a substitute for “content analysis” in future 
studies where mixed variables are examined. 
Outdoor poster ads were chosen because they 
are tie simplest of all ad media, with the short- 
and simplest verbal and visual messages. It 
Was elt that if the simplest advertising com- 
Munication channel were used, the results might 
be applied to more complicated ad media, possi- 
bly with a relatively high degree of predictive 
a Tracy, 


req ers 
adve 

been 
trollable va 
will be deer 
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A review of the literature concerning the the- 


ory of the contribution of graphics to memory 
of ads revealed 53 separate studies suggesting 66 


specific findings which would affect recall, recog- 


nition, or attention either positively or negatively — 


(Hendon, 1973). For example, Twedt (1952) 
found in a study of trade journals that size of 
illustration, size and area of copy, number of 
colors, number of type sizes, number of illustra- 
tion units, and number of copy units were signifi- 
cantly correlated with recognition scores in a 
positive direction. 

This review led to the working hypothesis that 
in mass-communicated advertising messages via 
the outdoor medium, the Starch advertising re- 
search organization’s recognition scores (Starch, 
1962) are significantly correlated in a positive 
direction with the factors of brevity, simplicity, 
and boldness (or sensualness). Furthermore, a 
positive correlation is hypothesized between rec- 
ognition and seven variables: (1) tallest product 
illustration letter height/poster height, (2) color 
area/poster area, (3) illustration area/poster 
area, (4) product illustration area/poster area, 
(5) product illustration area/illustration area, 
(6) product illustration area/illustration and 
product identification symbol (PID) areas, and 
(7) product illustration and PID areas/poster 
area. A negative correlation is hypothesized be- 
tween recognition and 20 variables: (8) number 
of copy styles, (9) number of copy sizes, (10) 
average copy height/poster height, (11) tallest 
copy letter height/poster height, (12) number of 
copy lines, (13) number of copy units, (14) 
number of PID copy styles, (15) number of 
PID copy sizes, (16) average PID letter height/ 
poster height, (17) number of colors, (18) num- 
ber of illustration units, (19) PID area/poster 
area, (20) PID area/illustration and PID areas, 
(21) copy area/poster area, (22) number of copy 
letters, (23) number of copy words, (24) num- 
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lustration words, (26) number of copy sentences, 


i 
ber of PID words, (25) number of product il- 
> 


and (27) Robert P. Gunning’s “Fog Index” (Mc- 
Kee, 1967). Variables 22-27 are the six content 

~ variables. 

MetHop 


Each of the independent variables were mea- 
Sured separately by two coders using color 
reproductions of the posters, A compensating 
polar planimeter was used to measure the areas 

< which are included in the definitions of nine 
Vvariables—2-7 and 19-21. 
Each recognition score (dependent variable) 
was changed from the raw recognition score to 
a “copy performance index” (CPI) according to 
a technique developed by Starch (1962). Starch’s 
Scores were expressed in five different forms: 
(a) total CPI, (b) male CPI, (c) female CPI, 
(d) male CPI adjusted for inherent interest dif- 
_ ferences among products, and (e) female CPI 
“similarly adjusted. All five CPI scores express 
the percentage of respondents who remembered 
_ seeing an outdoor ad. 
here were 37 advertisers represented in three 
"product categories: (a) gasoline/automobile ac- 
cessories (89 posters), (b) food (68 posters), 
and (c) alcoholic beverages (47 posters). Thus, 
four equations were required for each of the five 
CPI forms—one for each product category and 
a fourth for all 204 posters grouped together. 
The final sample, chosen on the basis of 
availability, covered 1956-1958 and 1961-1962. 
A cross-validation methodology was used by di- 
viding each of the four product sets into four 
analysis subsets (Group A) and four validation 
"subsets (Group B). A stepwise multiple regres- 
Sion and correlation technique was used to ex- 
mine the relationship between recognition 
“scores and the message elements in Group A to 
obtain predictive equations, and between Group 
B predicted and actual scores to test the predic- 
tive equations. The relationships between Group 
A predicted and actual scores and Group B scores 
and message elements were not measured, 

A combination of factor and regression analy- 
ses was used to test the qualitative working hy- 
pothesis. Factor analysis was first used to sum- 
marize the information contained in a larger set 
of variables (27) in terms of another, smaller, 

less redundant set called “factors,” Then, it was 
used in an indirect way to account for variations 
in the dependent variable in terms of the less 
‘redundant set of factors: If (a) all of the mes- 
Sage elements which were significantly correlated 
with recognition scores in one direction were 
highly loaded on only one factor, and (b) no 
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other message element was led on that 
same factor, then one cou his par- 
ticular factor was highly < h the 
Starch scores, 

ResuLTS 

Because the aforementioned 1 (b) did 
not occur, the working hypothe not sup- 
ported. Five factors appeared in th products 
and gas/auto categories: (a) copy nt brev- 
ity and mechanical simplicity, (b) oduct jl- 
lustration boldness, (c) visual boldnes (d) copy 
area boldness, and (e) PID simpli ty. Three 
more inclusive factors, brevity, sir city, and 
boldness (or sensualness) appeared modifying 
adjectives that described the five actors. Be- 
yond these three more inclusive facto: >. the rela- 
tionships between the message eleme were 
unclear. 

Using the first part of the cross-valiiation 
methodology (examination of the relation, ip be- 
tween Group A independent variables anc recog- 
nition scores), only the hypotheses corre -onding 
to mechanical variables 3 and 7 were s orted. 
These both highly loaded on the visua! dness 
factor; Twedt (1952) found this factor, which 
he labeled “pictorial-color.” No other variables 
were so highly correlated with Group A r=-ogni- 
tion scores as were these (.59 and .51, :espec- 
tively in the five gas/auto equations, p ~~ 01). 


These were the only r values over 50. 

Using the second part of the cross-validation 
methodology (examination of the relationshi 
between Group B predicted and actual reco jni- 
tion scores), the correlations between piecicted 
and actual Group B scores were quite lew, rang- 
ing from 03 to 41, These r values were signifi- 
cantly greater than zero for only two sets of 
equations: all products and gas/auto (p <.05)- 
Fairly large standard errors of the estimate oc- 
curred. The two strongest equations, both with r 
values of .41 (p< .01), were those for n-edict- 
ing the following: 


(a) Percentage of recognition by men viewing 
gas/auto posters = (1.15) [29.31 —.59 (Variable 
3) — 3.01 (Variable 8) + 1.08 (Variable i1) + 
1.39 (Variable 15) —.64 (Variable 17) — 89 
(Variable 21) + 1.22 (Variable 25)]; (b) Per- 
centage of recognition by women viewing gas 
auto posters = (.55) [67.70 — .96 (Variable 4) + 
60 (Variable 7) — 8.48 (Variable 8) — 7.64 
(Variable 24) J. 


These two equations were not applied to other 
ad media as originally intended because of low 
Predictive validity found in the outdoor medium. 


DISCUSSION 


fo! sry general guideline is sug- 
raphics specialist: The larger the 
the poster's total area that is de- 
il illustration, especially that of 
the produ d the PID (or signature), the 


ghe scores of outdoor ads, 
ily 16% of the variance in 
, pr-ducted and actual scores was ac- 


sed “or by the variables in the two strongest 
e „ the crealive interaction of the 27 
anahe, w nnot be measured, had to ac- 
Count 10 maining 84% of the variance, 
racted the influence of other varia- 

h were not measured in this study. 

reíore, jt was not possible to quantify cre- 
atvity, and the guide is probably of limited 


Because ot his 
Present siudy lie 


limitation, the value of the 
elsewhere: 


y little of the variance in outcomes for 
a visual communications system for outdoor ads 
‘s accounted for by graphics decisions, because of 
the restriction of range in the numeric values of 
è independent variables. These numeric values 
were similar, since most poster ads are similar, 
having few words, simple layouts, etc. as a 
result of their unique viewing context. This 
anding has an implication for research meth- 
Odolory—Researchers should probably not begin 
their search for universals in the simplest en- 
|. Vironment possible and extrapolate and test 
vc -resvlts"in more complicated environments; 
the opposite approach is recommended. In the 
paesent context, the search for predictable out- 
comes for a communication system has to lie in 
“iore complicated ad medium, one in which 
the messages transmitted by the medium would 


exhibit more differences than occurred in the 
outdoor sample, 

2. McLuhan’s (1970) thesis, “the medium is 
the message,” was supported. There were inti 
media differences that affected later recognition 
of ad messages carried by different media. The 
Twedt (1952) study and the present study A 
quite different equations predicting recogniti 
scores. As Klapper (1960) suggested, a me: 
is merely one influence, working amid other 
fluences, in a total situation. Therefore, the pu 
message cannot be isolated and studied by i 
There will always be contamination from otht 
forces, including the medium, which will affe 
the outcomes of a communication system. 
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EFFECTS OF ALTERNATIVE POSITIONING OF OPEN-E 
QUESTIONS IN MULTIPLE-CHOICE QUESTIONNAIR! 


W. RUSSELL JOHNSON ano NICHOLAS A. SIEVEKING °? 
Vanderbilt University 


EARL S. CLANTON III 
Fisk University 


Two studies investigated the effect of the placement of open-ended questions, 
relative to multiple-choice questions, on the responses elicited. Both studies 
found that positionings at both the beginning and the end of a predomi 
nantly multiple-choice questionnaire elicited significantly more responses tha 
did either placement alone. When the two positions were used singly, the o 
at the beginning was found to elicit more discrete ideas and response ce 
gories. This was true for black samples and white samples and for differ: 


questionnaire topics and lengths. 


Most questions asked in opinion research are 
multiple-choice because such items often are 
easier to answer, require less of the respondents’ 
time, and permit direct quantification of answers. 
However, these questions are often supplemented 
with open-ended questions in order to elicit un- 
anticipated responses. 

Open-ended questions are frequently positioned 
among other questions wherever convenient or 
logical, without considering that responses to 
these questions could be influenced by their 
placement. Placement at the end of a question- 
naire might elicit both the respondent’s initial 
ideas and those suggested to him during the ques- 

tionnaire. Alternatively, such placement might 
cause response attenuation due to fatigue. Al- 
though authors have given attention to item con- 
struction (e.g., Wood, 1960) and to the use and 
arrangement of questions in an interview (e.g., 
Kahn & Cannell, 1957), no studies of the effects 
of various placements of open-ended questions 
have been conducted. Due to the cost of ad- 
ministering and quantifying open-ended ques- 
‘tions, it seems advisable to determine which 
placements produce more responses. The present 
investigation compared responses to open-ended 
questions placed at the beginning of written 
questionnaires and at the end, after intervening 
multiple-choice items. The problem of open- 


17The authors wish to express their thanks to 
Elaine Johnson, who served as one of the judges, 
and to Lynn Cocke, who assisted with data analyses 
and preparation of the manuscript. 
2 Requests for reprints should be sent to Nicholas 
Sieveking, Interuniversity Psychological and Counsel- 
ing Center of Vanderbilt University, George Peabody 
College and Scarritt College, 300 Oxford House, 1313 
21st Avenue South, Nashville, Tennessee 37212. 


ended question placement within an interview was 
not addressed. 


METHOD 
To investigate the generality ing 
effects, two separate studies vw ed. 
These involved different questionnair tent, 
different experimenters, and different su ject 


samples. The media study, so named because its 
items related to the news media, was g en in 
class by a black experimenter to 110 bla x high 
school and college students. An open-ende | ques- 
tion was placed at both the beginning (Item 1) 
and the end (Item 62) of the questionnaire 
(“first and last” condition), after 60 multiple- 
choice items dealing with the quality of Jae! 
news media. The question asked: “What changes 


would you like to see in the news presentations 
by television, radio, and newspapers? E 
The second study, the counseling in- 


Pa) 
cluded items dealing with various attitudes about 
counseling and counseling methods. The counsel- 
ing study questionnaire was administered by à 
white experimenter during two introduct? 0°" 
chology classes to 156 white college students. ‘the 
open-ended question asked: “What questions 
about yourself would you like to have answ ered 
if counseling were made available to your There 
were two conditions in the counseling study. In 
one of these, 75 randomly chosen cis kes 
ceived the open-ended question at both the be- 
ginning (Item 1) and the end (Item 18) of the 
questionnaire (first and last condition). The 1m- 
tervening 16 multiple-choice items dealt W>. 
less personal matters such as attitudes ionar 
various methods of counseling. In e ,econd con- 
dition, 81 subjects received the open-ended 41 85 
tion in the last position (Item 18) only (“las 
only” condition). 


776 


rid -AN 
E SHorT NOTES 777 
TABLE 1 
A Mean NUMBER OF Distinct IDEAS Given PER RESPONDENT 
Open-ended question position 
| Total: Last 
First | Last First and Last (Subjects giving 
: ail subjects) | (all subjects) (all subjects) additions) 
| w | sp | n | Mw sp | » | «| sp | » | ae SDa 
— — = | ESA DERN S T E 
X af 
y Fir 1 2.33 1.48 110 131 328 1103.64 2.27 57 2.53 | 1.60 
uoun 
; 5 75 | 190 | 140] 75 | St] e 75 | 281 | 140 29 | 1.26 60 
ly aie a | ee be 811.36 96% oan 
ci | 1 1 ~t 
* 7 9 5 
y Subjects who received the open-ended question 52% did give additional responses, and these sub- 
tt inning (Item 1) were asked not to jects averaged 2.53 additional distinct ideas per 
heir responses to that question once they response. The corresponding average for the 39% 
vad bo un answering the multiple-choice items. of counseling study subjects who gave additional 
t Since those subjects were not aware that the responses was 1.26. ‘ 5 
pen question asked at the beginning would be In answering the question in the last only posi- 
ed, the first positioning within the first and tion, counseling study subjects gave significantly — 
jast condition also constituted a “first only” con- 
dition. 


Two judges, blind to question positions, evalu- 
a te? each subject’s responses to the open-ended 
questions in terms of the number of separate 
thoughts expressed, producing a distinct ideas 
ore for each respondent. The judges also sorted 
l responses into broader response categories which 
were a^ stracted from subjects’ answers by group- 
ing related distinct ideas. Independent interjudge 
agreement for distinct ideas was 90% in the 
media study and 94% in the counseling study. 
Corresponding percentages for response categories 
were 69% and 62%. Disagreements were settled 

by discussion between judges. 


RESULTS AND DISCUSSION 


Distinct ideas. Subjects in both studies added 

- _ simificantly (f = 5.06, df = 218, p < .001, media 
study; = 2.49, df= 148, p<.01, counseling 
study) to the number of distinct ideas given to 
tlie open-ended question in the first position when 
they were given a second opportunity to answer 
sac same question (see Table 1). The additional 
dis:ci ideas given by media study subjects 
@veraged 1.31 and those given by counseling study 
su S averaged 51, These averages are based 
on each entire subject sample, though not all sub- 
jects offered additional responses when given a 
second opportunity, Of the media study subjects, 


per subject than when asked the same question 
in the first position and significantly (¢= 4.95, 
df =154, p < .001) fewer than when asked the 
same question twice. Thus, in terms of eliciting 
a maximum number of distinct ideas per subject, 
the first only position appeared more productive 
than the last only position, and the first and last 
position was more productive than both first 
only and last only positions. 

Response categories. The same pattern of re- 
sults emerged with response categories as did 
with comparisons of distinct ideas. The 400 and 
175 distinct ideas given by subjects in the media 
and counseling studies, respectively, were placed 
into 43 and 18 categories. Both studies demon- 
strated that respondents used significantly more 
categories per subject in the first and last posi- 
tioning than in either the first only (t= 6.46, 
df =218, p <.001, media study; £= 2.04, 1 
148, p < .05, counseling study) or the last only 
position (f = 3.90, df=154, p< 001) (see 
Table 2). Also, a first only position elicited sig- 
nificantly (f = 2.20, df= 154, p< 05) more re- 
sponse categories than a last only position. 

It might have been expected that repetition of 
an open-ended question would elicit repetition of 
previous ideas or at least of ideas similar enough 
to be in the same category; however, this was not 
the case. Few respondents used categories in 


(t= 2.68, df = 154, p < .01) fewer distinct ideas ( 


TABLE 2 
MEAN NUMBER OF RESPONSE CATEGORIES USED PER RESPONDENT 


g a repeated question that they had used 
wering it the first time. Of the subjects in 
st and last condition, 47% and 21% re- 
averages of 1.61 and 1.13 categories per 
ect in the media and counseling studies, re- 
vely. The remaining subjects repeated no 
ries. Of the media study subjects, 55% 
additional categories on the repeated ques- 
n; the average number of new categories for 
subjects was 1.93. Similarly, 32% of 
counseling study subjects used an average of 1.58 
w categories. Very few of these subjects also 

— ted categories. 
These studies offer evidence that if an open- 
i ed question is included only once, the best 
ent for it is at the beginning of the ques- 
e. However, repetition of an open-ended 
tion elicited more 4% ideas per subject 


aad 


Total: La 
Last First and Last (Subjec 
(all subjects) (all subjects) addit 
7 ' ku t 
j | u 
110 1.06 | 1.25 110 2.87 1.30 61 1.93 
75 51 85 75 2.73 1.71 24 1.58 
pe = — 81 1.79 1.23 — | 


and more response categories per subject than 
did either single placement. The ‘generality o 
these results was suggested by their becuri 
with black subjects and white ts and 
different questionnaire topics ai: 

the costs of developing response codes and cate- 
gories should be similar whether a question is 
given once or repeated, the return per investment 
should be greater if open-ended questions a 
asked twice. 


lengths 
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